Files
kernbench2/tests/allreduce_latency_plots/summary.csv
T
mukesh 04c912f53e Allreduce sweep: parametrized + xdist parallelism + topology diagram
Refactor the latency sweep from one giant test into 36 parametrized
cases that run in parallel under xdist (~6-8x faster: 1:49 instead of
~10 min). Each case writes a JSON row to a staging dir; conftest
sessionfinish hook aggregates rows on the controller node into
summary.csv and the per-topology + overview plots.

Aggregator gains a CSV fallback so plot-only tweaks no longer require
re-running the sweep.

Overview plot updates:
- 96 KB explicit x-axis marker with vertical dotted line
- horizontal theoretical 2D-torus reference (10600 ns)
- annotation showing both theoretical and simulated values at 96 KB
- drop overlapping 128 KB tick

New topology.png: 2x2 panel diagram showing device-level topology
(ring, torus 2x3, mesh 2x3) and the cube-level reduction inside SIP 0.
Wrap arrows anchor on box edges and arc outside rows/columns so they
do not overlap any SIP.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-27 16:43:19 -07:00

2.4 KiB

1algorithmsip_topologyn_sipsn_elembytes_per_pebytes_per_siplatency_ns
2intercube_allreducemesh_2d_no_wrap68162563508.4249999999993
3intercube_allreducemesh_2d_no_wrap6326410243515.55
4intercube_allreducemesh_2d_no_wrap66412820483525.0499999999975
5intercube_allreducemesh_2d_no_wrap612825640963544.049999999992
6intercube_allreducemesh_2d_no_wrap65121024163843667.049999999992
7intercube_allreducemesh_2d_no_wrap610242048327683837.049999999992
8intercube_allreducemesh_2d_no_wrap620484096655364177.049999999992
9intercube_allreducemesh_2d_no_wrap6409681921310724857.049999999959
10intercube_allreducemesh_2d_no_wrap68192163842621446217.049999999945
11intercube_allreducemesh_2d_no_wrap616384327685242888937.049999999937
12intercube_allreducemesh_2d_no_wrap63276865536104857614377.049999999872
13intercube_allreducemesh_2d_no_wrap64915298304157286419817.049999999872
14intercube_allreducering_1d68162563073.1299999999937
15intercube_allreducering_1d6326410243079.8799999999947
16intercube_allreducering_1d66412820483088.879999999992
17intercube_allreducering_1d612825640963106.8799999999865
18intercube_allreducering_1d65121024163843225.8799999999865
19intercube_allreducering_1d610242048327683391.8799999999865
20intercube_allreducering_1d620484096655363723.8799999999865
21intercube_allreducering_1d6409681921310724387.879999999965
22intercube_allreducering_1d68192163842621445715.879999999957
23intercube_allreducering_1d616384327685242888371.879999999932
24intercube_allreducering_1d63276865536104857613683.879999999903
25intercube_allreducering_1d64915298304157286418995.879999999917
26intercube_allreducetorus_2d68162562190.4799999999923
27intercube_allreducetorus_2d6326410242196.479999999993
28intercube_allreducetorus_2d66412820482204.4799999999905
29intercube_allreducetorus_2d612825640962220.479999999985
30intercube_allreducetorus_2d65121024163842325.479999999985
31intercube_allreducetorus_2d610242048327682471.479999999985
32intercube_allreducetorus_2d620484096655362763.479999999985
33intercube_allreducetorus_2d6409681921310723347.4799999999777
34intercube_allreducetorus_2d68192163842621444515.4799999999705
35intercube_allreducetorus_2d616384327685242886851.479999999952
36intercube_allreducetorus_2d63276865536104857611523.479999999923
37intercube_allreducetorus_2d64915298304157286416195.479999999952