Commit 9f442e8f authored by zhangqha's avatar zhangqha
Browse files

Update README.md

parent ebcba9f0
Pipeline #2432 canceled with stages
......@@ -8,6 +8,8 @@ MLAttention is an efficient MLA decoding kernel , optimized for variable-length
- BF16, FP16
目前支持的实现方式:
- OpenAI Triton
正在开发的实现方式:
- Cutlass
```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment