- 06 Jun, 2025 1 commit
-
-
Chenggang Zhao authored
* Update CMake files * Use TMA instead of LD/ST for intranode dispatch * Use TMA instead of LD/ST for intranode combine * Adjust configs * Test default configs as well * More warps for combine * Add inter-thread fence * Enable more warps * Do not use TMA for senders * Update configs * Remove useless wait
-
- 11 Apr, 2025 1 commit
-
-
Hao Lin authored
Signed-off-by:Hao Lin <linhaomails@gmail.com>
-
- 10 Apr, 2025 1 commit
-
-
fujianhao.fjh authored
-
- 25 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 04 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 03 Mar, 2025 1 commit
-
-
Chenggang Zhao authored
-
- 25 Feb, 2025 1 commit
-
-
Chenggang Zhao authored
-