tests/runner/test_runner.py · a961ebd4cf920ddcaacd89553311eeda40a72b1b · tsoc / superbenchmark

Benchmark: Update overlap and sharding matmul benchmarks (#19) · a961ebd4

one authored Apr 24, 2026

- Enable `computation-communication-overlap` and `sharding-matmul` in
some configs through the existing PyTorch distributed mode.
- Use `torchrun --standalone` for single-node `torch.distributed` runs
to avoid fixed rendezvous port conflicts on 29500.
- Update runner command-generation test expectation for the new
single-node torchrun behavior.

a961ebd4

test_runner.py 27.6 KB

Replace test_runner.py