Files
kernbench2/tests/allreduce_latency_plots/summary.csv
T
ywkang fca24feac5 Fix all remaining test failures: single-cube allreduce + matplotlib dep
- intercube_allreduce: add single-cube fast path that skips intra-SIP
  mesh reduce and goes directly to inter-SIP exchange. Fixes IPCQ
  deadlock when TP launches kernel on one cube per SIP.
- distributed.py: derive effective cube dims from tensor shard placement
  instead of hardcoding topology mesh size.
- pyproject.toml: add matplotlib>=3.7 to dependencies.
- pe_dma.py (prior commit): add MMU translation in pipeline DMA path.

577 passed, 0 failed (was 529 passed, 10 failed).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-27 21:25:31 -07:00

2.5 KiB

1algorithmsip_topologyn_sipsn_elembytes_per_pebytes_per_siplatency_ns
2intercube_allreducemesh_2d_no_wrap68162563508.4249999999993
3intercube_allreducemesh_2d_no_wrap6326410243515.55
4intercube_allreducemesh_2d_no_wrap66412820483525.0499999999975
5intercube_allreducemesh_2d_no_wrap612825640963544.049999999992
6intercube_allreducemesh_2d_no_wrap65121024163843667.049999999992
7intercube_allreducemesh_2d_no_wrap610242048327683837.049999999992
8intercube_allreducemesh_2d_no_wrap620484096655364177.049999999992
9intercube_allreducemesh_2d_no_wrap6409681921310724857.049999999959
10intercube_allreducemesh_2d_no_wrap68192163842621446217.049999999945
11intercube_allreducemesh_2d_no_wrap616384327685242888937.049999999937
12intercube_allreducemesh_2d_no_wrap63276865536104857614377.049999999872
13intercube_allreducemesh_2d_no_wrap64915298304157286419817.049999999872
14intercube_allreducering_1d68162563073.1299999999937
15intercube_allreducering_1d6326410243079.8799999999947
16intercube_allreducering_1d66412820483088.879999999992
17intercube_allreducering_1d612825640963106.8799999999865
18intercube_allreducering_1d65121024163843225.8799999999865
19intercube_allreducering_1d610242048327683391.8799999999865
20intercube_allreducering_1d620484096655363723.8799999999865
21intercube_allreducering_1d6409681921310724387.879999999965
22intercube_allreducering_1d68192163842621445715.879999999957
23intercube_allreducering_1d616384327685242888371.879999999932
24intercube_allreducering_1d63276865536104857613683.879999999903
25intercube_allreducering_1d64915298304157286418995.879999999917
26intercube_allreducetorus_2d68162562190.4799999999923
27intercube_allreducetorus_2d6326410242196.479999999993
28intercube_allreducetorus_2d66412820482204.4799999999905
29intercube_allreducetorus_2d612825640962220.479999999985
30intercube_allreducetorus_2d65121024163842325.479999999985
31intercube_allreducetorus_2d610242048327682471.479999999985
32intercube_allreducetorus_2d620484096655362763.479999999985
33intercube_allreducetorus_2d6409681921310723347.4799999999777
34intercube_allreducetorus_2d68192163842621444515.4799999999705
35intercube_allreducetorus_2d616384327685242886851.479999999952
36intercube_allreducetorus_2d63276865536104857611523.479999999923
37intercube_allreducetorus_2d64915298304157286416195.479999999952