Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
17f39880
Commit
17f39880
authored
Dec 17, 2025
by
Varun Sundar Rabindranath
Committed by
Kevin H. Luu
Dec 17, 2025
Browse files
[BugFix] Workspace allocation during profile run : DeepEPHighThroughput + DeepGEMM (#30899)
(cherry picked from commit
e3fc374a
)
parent
682c3858
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
1 deletion
+4
-1
vllm/model_executor/layers/fused_moe/modular_kernel.py
vllm/model_executor/layers/fused_moe/modular_kernel.py
+4
-1
No files found.
vllm/model_executor/layers/fused_moe/modular_kernel.py
View file @
17f39880
...
...
@@ -795,7 +795,10 @@ class FusedMoEModularKernel(torch.nn.Module):
top_k
,
global_num_experts
,
local_num_experts
,
expert_tokens_meta
,
# expert_tokens_meta help in allocating optimal/minimal
# amount of workspace. Mark it None, so we allocate for
# the worst-case scenario.
expert_tokens_meta
=
None
,
)
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment