Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
b8665383dfda143f02378833326689cd79a1f1b3
Switch branch/tag
vllm_cscc
vllm
engine
arg_utils.py
Find file
Blame
History
Permalink
enable skipping of SW attention layers when using FP8 KV cache (#33695)
· 98e7f223
Jonas M. Kübler
authored
Mar 27, 2026
Signed-off-by:
Jonas Kuebler
<
kuebj@amazon.com
>
98e7f223
arg_utils.py
95.9 KB
Edit
Web IDE
Replace arg_utils.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace arg_utils.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.