Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
b36adfa349cfab0e79f3d736d5e5413bd3ee19f5
Switch branch/tag
vllm_cscc
vllm
platforms
cuda.py
Find file
Blame
History
Permalink
[Perf] Set Flashinfer sparse MLA as default backend for FP8 kv cache (#37252)
· b36adfa3
Wei Zhao
authored
Mar 17, 2026
Signed-off-by:
wzhao18
<
wzhao18.sz@gmail.com
>
b36adfa3
cuda.py
23.3 KB
Edit
Web IDE
Replace cuda.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace cuda.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.