Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
3e41992fecdc31ee60715bb350f18fec18ed6680
Switch branch/tag
vllm_cscc
vllm
v1
attention
backends
mla
flashmla_sparse.py
Find file
Blame
History
Permalink
[Attention] Use sparse prefill kernel for fp8 kv-cache in DeepSeek-v3.2 (#27532)
· 3e41992f
Lucas Wilkinson
authored
Dec 12, 2025
Signed-off-by:
Lucas Wilkinson
<
lwilkins@redhat.com
>
3e41992f
flashmla_sparse.py
38.7 KB
Edit
Web IDE
Replace flashmla_sparse.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace flashmla_sparse.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.