Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
5a1271d83a65be5ed8dc3e4c990ed42074197db3
Switch branch/tag
vllm_cscc
vllm
model_executor
layers
quantization
mxfp4.py
Find file
Blame
History
Permalink
[Quantization] fix attention quantization of gpt_oss model (#27334)
· 5a1271d8
xuebwang-amd
authored
Nov 12, 2025
Signed-off-by:
xuebwang-amd
<
xuebwang@amd.com
>
5a1271d8
mxfp4.py
45.6 KB
Edit
Web IDE
Replace mxfp4.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace mxfp4.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.