Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
"tests/vscode:/vscode.git/clone" did not exist on "09c2eb85ddd3b2585979f4cd9cc97168d86718b6"
8332078cfdbd5e44e527893b695e79052d008172
Switch branch/tag
vllm_cscc
tests
quantization
test_fp8.py
Find file
Blame
History
Permalink
enable skipping of SW attention layers when using FP8 KV cache (#33695)
· 98e7f223
Jonas M. Kübler
authored
Mar 27, 2026
Signed-off-by:
Jonas Kuebler
<
kuebj@amazon.com
>
98e7f223
test_fp8.py
17.7 KB
Edit
Web IDE
Replace test_fp8.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace test_fp8.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.