a44f832be5
Allreduce + pe2pe + ipcq + pe_view auto-regenerated by test sweeps running against the new chunk-streaming wire timing (per-flit wormhole) — absolute numbers shift upward to reflect bottleneck-link transit charged once per flit (instead of the previous cut-through subtraction at HBM CTRL). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
6.7 KiB
6.7 KiB
| 1 | hop | label | size_bytes | path | total_ns |
|---|---|---|---|---|---|
| 2 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 128 | ipcq | 42.8899999999976 |
| 3 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 128 | raw | 29.0199999999968 |
| 4 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 256 | ipcq | 48.1399999999976 |
| 5 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 256 | raw | 31.0199999999968 |
| 6 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 384 | ipcq | 50.3899999999976 |
| 7 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 384 | raw | 32.0199999999968 |
| 8 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 512 | ipcq | 52.6399999999976 |
| 9 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 512 | raw | 33.0199999999968 |
| 10 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 768 | ipcq | 57.1399999999976 |
| 11 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 768 | raw | 35.0199999999968 |
| 12 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 1024 | ipcq | 62.6399999999976 |
| 13 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 1024 | raw | 37.0199999999968 |
| 14 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 2048 | ipcq | 84.6399999999976 |
| 15 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 2048 | raw | 45.0199999999968 |
| 16 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 4096 | ipcq | 128.6399999999976 |
| 17 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 4096 | raw | 61.0199999999968 |
| 18 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 8192 | ipcq | 216.64000000000306 |
| 19 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 8192 | raw | 93.02000000000407 |
| 20 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 10240 | ipcq | 260.64000000000306 |
| 21 | h1_intra_horizontal | Intra-cube horizontal (pe0 to pe1) | 10240 | raw | 109.02000000000407 |
| 22 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 128 | ipcq | 42.8899999999976 |
| 23 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 128 | raw | 29.0199999999968 |
| 24 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 256 | ipcq | 48.1399999999976 |
| 25 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 256 | raw | 31.0199999999968 |
| 26 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 384 | ipcq | 50.3899999999976 |
| 27 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 384 | raw | 32.0199999999968 |
| 28 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 512 | ipcq | 52.6399999999976 |
| 29 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 512 | raw | 33.0199999999968 |
| 30 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 768 | ipcq | 57.1399999999976 |
| 31 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 768 | raw | 35.0199999999968 |
| 32 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 1024 | ipcq | 62.6399999999976 |
| 33 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 1024 | raw | 37.0199999999968 |
| 34 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 2048 | ipcq | 84.6399999999976 |
| 35 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 2048 | raw | 45.0199999999968 |
| 36 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 4096 | ipcq | 128.6399999999976 |
| 37 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 4096 | raw | 61.0199999999968 |
| 38 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 8192 | ipcq | 216.64000000000306 |
| 39 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 8192 | raw | 93.02000000000407 |
| 40 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 10240 | ipcq | 260.64000000000306 |
| 41 | h2_intra_vertical | Intra-cube vertical (pe0 to pe4) | 10240 | raw | 109.02000000000407 |
| 42 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 128 | ipcq | 81.15999999999804 |
| 43 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 128 | raw | 89.28999999999724 |
| 44 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 256 | ipcq | 88.65999999999804 |
| 45 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 256 | raw | 95.53999999999724 |
| 46 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 384 | ipcq | 90.90999999999804 |
| 47 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 384 | raw | 96.53999999999724 |
| 48 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 512 | ipcq | 93.15999999999804 |
| 49 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 512 | raw | 97.53999999999724 |
| 50 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 768 | ipcq | 97.65999999999804 |
| 51 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 768 | raw | 99.53999999999724 |
| 52 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 1024 | ipcq | 103.15999999999804 |
| 53 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 1024 | raw | 102.53999999999724 |
| 54 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 2048 | ipcq | 125.15999999999804 |
| 55 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 2048 | raw | 114.53999999999724 |
| 56 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 4096 | ipcq | 169.15999999999804 |
| 57 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 4096 | raw | 138.53999999999724 |
| 58 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 8192 | ipcq | 257.15999999999985 |
| 59 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 8192 | raw | 186.54000000000087 |
| 60 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 10240 | ipcq | 301.15999999999985 |
| 61 | h3_inter_cube_horizontal | Inter-cube horizontal (cube0 to cube1) | 10240 | raw | 210.54000000000087 |
| 62 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 128 | ipcq | 103.15999999999804 |
| 63 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 128 | raw | 111.28999999999724 |
| 64 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 256 | ipcq | 112.65999999999804 |
| 65 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 256 | raw | 119.53999999999724 |
| 66 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 384 | ipcq | 114.90999999999804 |
| 67 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 384 | raw | 120.53999999999724 |
| 68 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 512 | ipcq | 117.15999999999804 |
| 69 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 512 | raw | 121.53999999999724 |
| 70 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 768 | ipcq | 121.65999999999804 |
| 71 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 768 | raw | 123.53999999999724 |
| 72 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 1024 | ipcq | 127.15999999999804 |
| 73 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 1024 | raw | 126.53999999999724 |
| 74 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 2048 | ipcq | 149.15999999999804 |
| 75 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 2048 | raw | 138.53999999999724 |
| 76 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 4096 | ipcq | 193.15999999999804 |
| 77 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 4096 | raw | 162.53999999999724 |
| 78 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 8192 | ipcq | 281.15999999999985 |
| 79 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 8192 | raw | 210.54000000000087 |
| 80 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 10240 | ipcq | 325.15999999999985 |
| 81 | h4_inter_cube_vertical | Inter-cube vertical (cube0 to cube4) | 10240 | raw | 234.54000000000087 |