Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
e4ee6586721cd9e09ac50207cb5e754d7a4a773e
Switch branch/tag
vllm_cscc
vllm
model_executor
layers
quantization
modelopt.py
Find file
Blame
History
Permalink
Flashinfer_CUTLASS_MOE fuses quantization for TP (#27223)
· fc16f1c4
Shu Wang
authored
Oct 31, 2025
Signed-off-by:
Shu Wang.
<
shuw@nvidia.com
>
fc16f1c4
modelopt.py
68.2 KB
Edit
Web IDE
Replace modelopt.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace modelopt.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.