Logo
Explore Help
Register Sign In
ywkang/kernbench2
1
0
Fork 0
You've already forked kernbench2
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
a143925a12794f0c3f44faaeda4f3b4ca3031750
kernbench2/benches
T
History
mukesh 83ea97b05f Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-13 15:00:41 -07:00
..
__init__.py
commit - release 1
2026-03-18 11:47:48 -07:00
ccl_allreduce.py
Intercube allreduce: pe0 cube-mesh reduce + multi-SIP ring/torus/mesh
2026-04-16 17:33:42 -07:00
gemm_single_pe.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
gpt3_qkv.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
ipcq_allreduce.py
Add virtual memory support: PE_MMU, VA allocator, fabric MmuMapMsg
2026-03-26 00:01:47 -07:00
loader.py
Add PE-level IPCQ collective infra + unified ccl_allreduce bench (ADR-0023)
2026-04-12 19:36:59 -07:00
matmul_composite.py
Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
2026-05-13 15:00:41 -07:00
qkv_gemm_multi_pe.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
qkv_gemm.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
va_offset_verify.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
Powered by Gitea Version: 1.26.2 Page: 62ms Template: 3ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API