Add sparse fine-tuning kernel for deepseek sparse attention to example (#1296)
* [EXAMPLE] add example for dsa sparse finetuning * [Refactor]
Showing
Please register or sign in to comment
* [EXAMPLE] add example for dsa sparse finetuning * [Refactor]