Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
2dc182c0
Commit
2dc182c0
authored
Apr 22, 2026
by
laibao
Browse files
[BUGFIX] 为 Marlin MoE 路径新增 silu_and_mul_opt 优先执行及回退机制
parent
3842b316
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
11 additions
and
5 deletions
+11
-5
vllm/model_executor/layers/fused_moe/fuse_moe_w16a16_marlin.py
...model_executor/layers/fused_moe/fuse_moe_w16a16_marlin.py
+11
-5
No files found.
vllm/model_executor/layers/fused_moe/fuse_moe_w16a16_marlin.py
View file @
2dc182c0
...
...
@@ -375,11 +375,17 @@ def fused_experts_impl_w16a16_marlin(hidden_states: torch.Tensor,
top_k_num
,
config_marlin_0
,
)
if
(
envs
.
VLLM_USE_FUSE_SILU_AND_MUL
and
intermediate_cache1
.
dtype
==
intermediate_cache2
.
dtype
==
torch
.
float16
):
from
lightop
import
fuse_silu_and_mul
fuse_silu_and_mul
(
intermediate_cache1
,
intermediate_cache2
)
if
envs
.
VLLM_USE_FUSE_SILU_AND_MUL
:
try
:
op
.
silu_and_mul_opt
(
intermediate_cache2
,
intermediate_cache1
)
except
Exception
:
try
:
from
lightop
import
fuse_silu_and_mul
fuse_silu_and_mul
(
intermediate_cache1
,
intermediate_cache2
)
except
Exception
:
torch
.
ops
.
_C
.
silu_and_mul
(
intermediate_cache2
,
intermediate_cache1
)
else
:
torch
.
ops
.
_C
.
silu_and_mul
(
intermediate_cache2
,
intermediate_cache1
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment