Logo
Explore Help
Register Sign In
ywkang/kernbench2
1
0
Fork 0
You've already forked kernbench2
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
a44f832be5939b7b9a218e40c36ec72fee66f1a0
kernbench2/benches
T
History
mukesh 83ea97b05f Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-13 15:00:41 -07:00
..
__init__.py
commit - release 1
2026-03-18 11:47:48 -07:00
ccl_allreduce.py
Intercube allreduce: pe0 cube-mesh reduce + multi-SIP ring/torus/mesh
2026-04-16 17:33:42 -07:00
gemm_single_pe.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
gpt3_qkv.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
ipcq_allreduce.py
Add virtual memory support: PE_MMU, VA allocator, fabric MmuMapMsg
2026-03-26 00:01:47 -07:00
loader.py
Add PE-level IPCQ collective infra + unified ccl_allreduce bench (ADR-0023)
2026-04-12 19:36:59 -07:00
matmul_composite.py
Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
2026-05-13 15:00:41 -07:00
qkv_gemm_multi_pe.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
qkv_gemm.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
va_offset_verify.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
Powered by Gitea Version: 1.26.2 Page: 60ms Template: 3ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API