Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
6832707e90d460bc1d1eec550e0035af72db7a27
Switch branch/tag
vllm_cscc
vllm
attention
backends
flash_attn.py
Find file
Blame
History
Permalink
[V1][Bugfix] Standardize quantized kv cache rejection for attention backends (#14221)
· 6832707e
Michael Goin
authored
Mar 06, 2025
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
6832707e
flash_attn.py
39.7 KB
Edit
Web IDE
Replace flash_attn.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace flash_attn.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.