Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
e9e95d0f
Commit
e9e95d0f
authored
Feb 04, 2026
by
zhuwenwen
Browse files
[perf] use optimized topk_softmax + renormalize (lightop)
parent
06e16a27
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
18 additions
and
7 deletions
+18
-7
vllm/model_executor/layers/fused_moe/router/fused_topk_router.py
...del_executor/layers/fused_moe/router/fused_topk_router.py
+18
-7
No files found.
vllm/model_executor/layers/fused_moe/router/fused_topk_router.py
View file @
e9e95d0f
...
...
@@ -9,6 +9,8 @@ from vllm._aiter_ops import rocm_aiter_ops
from
vllm.distributed.eplb.eplb_state
import
EplbLayerState
from
vllm.model_executor.layers.fused_moe.config
import
RoutingMethodType
from
vllm.model_executor.layers.fused_moe.router.base_router
import
BaseRouter
import
vllm.envs
as
envs
from
lightop
import
op
as
op
def
vllm_topk_softmax
(
...
...
@@ -18,6 +20,15 @@ def vllm_topk_softmax(
gating_output
:
torch
.
Tensor
,
renormalize
:
bool
=
False
,
)
->
tuple
[
torch
.
Tensor
,
...]:
if
envs
.
VLLM_USE_TOPK_RENORM
and
renormalize
is
True
:
op
.
topk_softmax
(
topk_weights
,
topk_indices
,
token_expert_indices
,
gating_output
,
renormalize
,
)
else
:
ops
.
topk_softmax
(
topk_weights
,
topk_indices
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment