- 28 Mar, 2025 1 commit
-
-
songhexiang authored
For the SMs which calculate metadata in notify_dispatch, each warp in the SM is used to calculate the metadata of one channel. The default configuration is 8 warps for 10 channels, which needs two rounds of loop. Maybe the number of warps can be configured to the number of the channels so that one loop is enough.
-
- 27 Mar, 2025 3 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
- 25 Mar, 2025 3 commits
-
-
Chenggang Zhao authored
Super tiny fix typo
-
Chenggang Zhao authored
-
fzyzcjy authored
-
- 18 Mar, 2025 3 commits
-
-
Chenggang Zhao authored
Support zero-copy for low-latency combine
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
- 14 Mar, 2025 4 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
Low latency kernels use rdma atomic to support AR
-
Shangyan Zhou authored
-
Shangyan Zhou authored
-
- 13 Mar, 2025 2 commits
-
-
Chenggang Zhao authored
Allow passing output tensor in low_latency_combine
-
Dmytro Dzhulgakov authored
-
- 11 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
Update NVSHMEM to v3.2.5.
-
- 10 Mar, 2025 2 commits
-
-
Dmytro Dzhulgakov authored
-
Chenggang Zhao authored
-
- 06 Mar, 2025 2 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
Fix AR bugs for normal kernels
-
- 05 Mar, 2025 3 commits
-
-
Chenggang Zhao authored
-
Shangyan Zhou authored
-
Chenggang Zhao authored
-
- 04 Mar, 2025 4 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
- 03 Mar, 2025 3 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
Chenggang Zhao authored
Update path
-
- 28 Feb, 2025 3 commits
-
-
youkaichao authored
-
Shangyan Zhou authored
fix installation
-
youkaichao authored
-
- 27 Feb, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 26 Feb, 2025 2 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
- 25 Feb, 2025 3 commits
-
-
haswelliris authored
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-