Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
"vllm/entrypoints/pooling/embed/serving.py" did not exist on "3729ed00ba4c5a9f351b1241b15a2ca3ca12a5e1"
72b1c2ae2c2d2782ef1850ca1b96a21f797f7af6
Switch branch/tag
vllm_cscc
vllm
model_executor
layers
quantization
modelopt.py
Find file
Blame
History
Permalink
[Bugfix] Use latency MOE backend as default for Flashinfer and other misc fixes (#27439)
· 72b1c2ae
Pavani Majety
authored
Nov 07, 2025
Signed-off-by:
Pavani Majety
<
pmajety@nvidia.com
>
72b1c2ae
modelopt.py
67.7 KB
Edit
Web IDE
Replace modelopt.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace modelopt.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.