Logo
Explore Help
Register Sign In
ywkang/kernbench2
1
0
Fork 0
You've already forked kernbench2
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
687c98086d1c689f6c307c45f549b8f94471241e
kernbench2/benches
T
History
mukesh 83ea97b05f Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-13 15:00:41 -07:00
..
__init__.py
commit - release 1
2026-03-18 11:47:48 -07:00
ccl_allreduce.py
Intercube allreduce: pe0 cube-mesh reduce + multi-SIP ring/torus/mesh
2026-04-16 17:33:42 -07:00
gemm_single_pe.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
gpt3_qkv.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
ipcq_allreduce.py
Add virtual memory support: PE_MMU, VA allocator, fabric MmuMapMsg
2026-03-26 00:01:47 -07:00
loader.py
Add PE-level IPCQ collective infra + unified ccl_allreduce bench (ADR-0023)
2026-04-12 19:36:59 -07:00
matmul_composite.py
Composite GEMM: K-loop accumulator residency, pinned operands, sweep + deck
2026-05-13 15:00:41 -07:00
qkv_gemm_multi_pe.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
qkv_gemm.py
Reduce test time to 12s: shrink GEMM dims + enable pytest-xdist
2026-04-12 21:06:41 -07:00
va_offset_verify.py
ADR-0026: DPPolicy intra-device only + ShardSpec structural coords
2026-04-14 13:02:19 -07:00
Powered by Gitea Version: 1.26.2 Page: 57ms Template: 3ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API