Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
98e7f223b9fb03537adc856e594a34a3cd018536
Switch branch/tag
vllm_cscc
tests
quantization
test_fp8.py
Find file
Blame
History
Permalink
enable skipping of SW attention layers when using FP8 KV cache (#33695)
· 98e7f223
Jonas M. Kübler
authored
Mar 27, 2026
Signed-off-by:
Jonas Kuebler
<
kuebj@amazon.com
>
98e7f223
test_fp8.py
17.7 KB
Edit
Web IDE
Replace test_fp8.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace test_fp8.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.