Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
0885aa25646d55d270b6a518d36861de2bec90d1
Switch branch/tag
vllm_cscc
vllm
platforms
cuda.py
Find file
Blame
History
Permalink
[feature][Attention Backend] TurboQuant: 2-bit KV cache compression with 4x capacity #38479
· 0885aa25
wanglong3
authored
Apr 18, 2026
0885aa25
cuda.py
23 KB
Edit
Web IDE
Replace cuda.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace cuda.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.