Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
6ef770df7c3f0d135c2f3a594c461949113aae91
Switch branch/tag
vllm_cscc
vllm
model_executor
layers
quantization
fp8.py
Find file
Blame
History
Permalink
[MoE] Fix output_shape calculation in Attention layer to handle 3D query inputs (#31596)
· 6ef770df
Andreas Karatzas
authored
Jan 02, 2026
Signed-off-by:
Andreas Karatzas
<
akaratza@amd.com
>
6ef770df
fp8.py
57.6 KB
Edit
Web IDE
Replace fp8.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace fp8.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.