Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
a564e001
"deployment/vscode:/vscode.git/clone" did not exist on "19173aa4370e36cba96ee7049eaaa0dceda5007c"
Unverified
Commit
a564e001
authored
May 28, 2025
by
fzyzcjy
Committed by
GitHub
May 27, 2025
Browse files
Fix DeepEP error in Qwen 3 MoE models (#6673)
parent
2103b806
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
9 additions
and
6 deletions
+9
-6
python/sglang/srt/layers/moe/ep_moe/token_dispatcher.py
python/sglang/srt/layers/moe/ep_moe/token_dispatcher.py
+9
-6
No files found.
python/sglang/srt/layers/moe/ep_moe/token_dispatcher.py
View file @
a564e001
...
...
@@ -93,17 +93,20 @@ class DeepEPBuffer:
),
num_rdma_bytes
,
)
if
deepep_mode
==
DeepEPMode
.
normal
:
num_qps_per_rank
=
DeepEPConfig
.
get_instance
().
num_sms
//
2
elif
deepep_mode
in
[
DeepEPMode
.
low_latency
,
DeepEPMode
.
auto
]:
num_qps_per_rank
=
num_experts
//
group
.
size
()
else
:
raise
NotImplementedError
cls
.
_buffer
=
Buffer
(
group
,
num_nvl_bytes
,
num_rdma_bytes
,
low_latency_mode
=
deepep_mode
.
enable_low_latency
(),
num_qps_per_rank
=
(
max
(
num_experts
//
group
.
size
(),
DeepEPConfig
.
get_instance
().
num_sms
//
2
,
)
),
num_qps_per_rank
=
num_qps_per_rank
,
)
return
cls
.
_buffer
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment