Commit 9f442e8f authored by zhangqha's avatar zhangqha
Browse files

Update README.md

parent ebcba9f0
Pipeline #2432 canceled with stages
...@@ -8,6 +8,8 @@ MLAttention is an efficient MLA decoding kernel , optimized for variable-length ...@@ -8,6 +8,8 @@ MLAttention is an efficient MLA decoding kernel , optimized for variable-length
- BF16, FP16 - BF16, FP16
目前支持的实现方式: 目前支持的实现方式:
- OpenAI Triton - OpenAI Triton
正在开发的实现方式:
- Cutlass
``` ```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment