- 21 Oct, 2025 1 commit
-
-
Zhengju Tang authored
* [Feature] Add GQA backward kernel with varlen input * [Lint] * [BugFix] Freeze the memory order of all atomic_add operations * [Lint] * [Lint] * [BugFix] Use release order to boost performance
-
- 20 Oct, 2025 1 commit
-
-
Yu Cheng authored
-