This website requires JavaScript.
Explore
Help
Register
Sign In
ywkang
/
kernbench2
Watch
1
Star
0
Fork
0
You've already forked kernbench2
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
32b29a1e5c01d8c7646e7e94c4267c3ed559cd8b
kernbench2
/
benches
T
History
mukesh
83ea97b05f
Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
...
Co-Authored-By: Claude Opus 4.7 (1M context) <
noreply@anthropic.com
>
2026-05-13 15:00:41 -07:00
..
__init__.py
commit - release 1
2026-03-18 11:47:48 -07:00
ccl_allreduce.py
Intercube allreduce: pe0 cube-mesh reduce + multi-SIP ring/torus/mesh
2026-04-16 17:33:42 -07:00
gemm_single_pe.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
gpt3_qkv.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
ipcq_allreduce.py
Add virtual memory support: PE_MMU, VA allocator, fabric MmuMapMsg
2026-03-26 00:01:47 -07:00
loader.py
Add PE-level IPCQ collective infra + unified ccl_allreduce bench (ADR-0023)
2026-04-12 19:36:59 -07:00
matmul_composite.py
Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
2026-05-13 15:00:41 -07:00
qkv_gemm_multi_pe.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
qkv_gemm.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
va_offset_verify.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00