- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)