Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d2f4a71c
Unverified
Commit
d2f4a71c
authored
Feb 05, 2026
by
Pavani Majety
Committed by
GitHub
Feb 05, 2026
Browse files
[Bugfix] Kimi-K2 grouped_topk usage for Flashinfer monolithic kernels. (#33858)
Signed-off-by:
Pavani Majety
<
pmajety@nvidia.com
>
parent
2abd9759
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
11 deletions
+3
-11
vllm/model_executor/models/deepseek_v2.py
vllm/model_executor/models/deepseek_v2.py
+3
-11
No files found.
vllm/model_executor/models/deepseek_v2.py
View file @
d2f4a71c
...
...
@@ -295,14 +295,6 @@ class DeepseekV2MoE(nn.Module):
prefix
=
f
"
{
prefix
}
.shared_experts"
,
)
n_group
=
getattr
(
config
,
"n_group"
,
1
)
topk_group
=
getattr
(
config
,
"topk_group"
,
1
)
use_grouped_topk
=
True
if
(
n_group
,
topk_group
)
==
(
1
,
1
):
n_group
=
None
topk_group
=
None
use_grouped_topk
=
False
self
.
experts
=
SharedFusedMoE
(
shared_experts
=
self
.
shared_experts
,
gate
=
self
.
gate
,
...
...
@@ -313,9 +305,9 @@ class DeepseekV2MoE(nn.Module):
reduce_results
=
False
,
renormalize
=
config
.
norm_topk_prob
,
quant_config
=
quant_config
,
use_grouped_topk
=
use_grouped_topk
,
num_expert_group
=
n_group
,
topk_group
=
topk_group
,
use_grouped_topk
=
True
,
num_expert_group
=
getattr
(
config
,
"
n_group
"
,
1
)
,
topk_group
=
getattr
(
config
,
"
topk_group
"
,
1
)
,
prefix
=
f
"
{
prefix
}
.experts"
,
scoring_func
=
getattr
(
config
,
"scoring_func"
,
"softmax"
),
# we do scaling outside, set factor to 1.0 to avoid double mul
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment