Add virtual memory support: PE_MMU, VA allocator, fabric MmuMapMsg

Implement VA/MMU layer (ADR-0011 Phase 1) enabling Triton kernels to use
contiguous virtual addresses on sharded tensors.

Key changes:
- PE_MMU component: hybrid inbox (MmuMapMsg) + sync translate() for PE_DMA
- VirtualAllocator + PEMemAllocator: free-list with coalescing
- MmuMapMsg/MmuUnmapMsg fabric path with SIP-level routing
- DPPolicy-based mapping: replicate=local, sharded=broadcast
- Tensor lifecycle: del + weakref cleanup, context manager
- Rename: TensorHandle.pa→addr, DmaReadCmd.src_pa→src_addr, ctx→torch

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This commit is contained in:

Yangwook Kang

2026-03-26 00:01:47 -07:00

parent 62fb01ae18

commit 08812eda58

34 changed files with 2131 additions and 139 deletions

									
										benches/ipcq_allreduce.py
									
		+1
		-1
	
												View File
												
				@@ -1,2 +1,2 @@

				def run(ctx):

				def run(torch):

				    print("IPCQ all reduce kernel bench")