mscclpp

mirror of https://github.com/microsoft/mscclpp.git synced 2026-07-17 01:07:19 +00:00

Files

Qinghua Zhou 2529774868 tests/ep: intranode send-side counts unique (token, dst_node) to match NCCL-EP

Previously total_send_tokens was Sigma over dst_rank of num_tokens_per_rank
which over-counts intra-node fan-out. NCCL-EP's ep_bench collapses
multiple destinations on the same node into one count; on a single-node
run that means total_send_tokens = number of tokens with at least one
valid expert. Switching to is_token_in_rank.any(dim=1).sum() makes the
send-side BW comparable to NCCL-EP's send: total_bw / nvl_bw line.

2026-04-29 23:31:47 +00:00

tests/ep: intranode send-side counts unique (token, dst_node) to match NCCL-EP

2026-04-29 23:31:47 +00:00