Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
4d479e7e
Commit
4d479e7e
authored
Jul 25, 2025
by
zhuwenwen
Browse files
Merge branch 'v0.9.2-dev-wm' into 'v0.9.2-dev'
[fix]解决deepseek报错 See merge request dcutoolkit/deeplearing/vllm!162
parents
b35835a1
cf4c84b5
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
19 additions
and
0 deletions
+19
-0
vllm/_custom_ops.py
vllm/_custom_ops.py
+19
-0
No files found.
vllm/_custom_ops.py
View file @
4d479e7e
...
@@ -2366,6 +2366,25 @@ def moe_fused_gate(
...
@@ -2366,6 +2366,25 @@ def moe_fused_gate(
routed_scaling_factor
,
routed_scaling_factor
,
)
)
if
hasattr
(
torch
.
ops
.
_moe_C
,
"moe_fused_gate"
):
@
register_fake
(
"_moe_C::moe_fused_gate"
)
def
moe_fused_gate_fake
(
input_tensor
:
torch
.
Tensor
,
bias
:
torch
.
Tensor
,
num_expert_group
:
int
,
topk_group
:
int
,
topk
:
int
,
n_share_experts_fusion
:
int
,
routed_scaling_factor
:
int
,
):
return
torch
.
empty
((
input_tensor
.
size
(
0
),
topk
),
dtype
=
input_tensor
.
dtype
,
device
=
input_tensor
.
device
),
\
torch
.
empty
((
input_tensor
.
size
(
0
),
topk
),
dtype
=
input_tensor
.
dtype
,
device
=
input_tensor
.
device
)
if
hasattr
(
torch
.
ops
.
_C
,
"weight_packed_linear"
):
if
hasattr
(
torch
.
ops
.
_C
,
"weight_packed_linear"
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment