- 29 Jul, 2025 1 commit
-
-
Void authored
Fix the address of dispatch_rdma_recv_count_buffer to avoid cleaning after each change in hidden_size/token_num. (#313) Signed-off-by:Yilin Zhang <18275976+yilin-void@users.noreply.github.com>
-
- 16 Jun, 2025 1 commit
-
-
Chenggang Zhao authored
* Add automatic warp count control for low-latency dispatch * Add automatic warp count control for low-latency combine * More assertions
-
- 11 Jun, 2025 1 commit
-
-
Chenggang Zhao authored
* Update README * Update `setup.py` * Fix headers * Add `DISABLE_NVSHMEM` for APIs * Fix launch * Fix TMA settings * Fix TMA usages * Fix dlink * Separate layout kernels * Update version * Add `is_sm90_compiled` * Fix tests * Add NVLink connection checks * Update README * Fix tests * Add some comments * Minor fix * Minor fix * Fix bugs
-
- 07 Apr, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 18 Mar, 2025 2 commits
-
-
Chenggang Zhao authored
-
Chenggang Zhao authored
-
- 10 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 06 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 03 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 25 Feb, 2025 1 commit
-
-
Chenggang Zhao authored
-