"model/git@developer.sourcefind.cn:OpenDAS/ollama.git" did not exist on "543240fb5f0eb1a5443fd2b45f857e7dd4dcbfed"
[Dev] Add MLA and GQA decode examples (#109)
* [CI][Test] Add test cases for tilelang transform MultiVersionBuffer and WarpSpecialized * Relax the mismatch ratio restrictions in the flash_linear_attention and mha tests * [Dev] Add mha backward example * [Dev] Add mla decode example * bug fix * Add triton impl * Add gqa decode example * [Dev] Add GQA decode example * lint * delete unused triton example * set default profiler to 'auto'
Showing
Please register or sign in to comment