Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
zhaoyu6
sglang
Commits
e7e5a305
"docs/vscode:/vscode.git/clone" did not exist on "094f716bf2cdc213f2b812dbb489fbf6f4a4423c"
Unverified
Commit
e7e5a305
authored
Jul 31, 2025
by
Baizhou Zhang
Committed by
GitHub
Aug 01, 2025
Browse files
Update batch size limitation of dsv3_router_gemm kernel to 16 (#8051)
parent
dd7ca006
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
2 deletions
+1
-2
python/sglang/srt/models/deepseek_v2.py
python/sglang/srt/models/deepseek_v2.py
+1
-2
No files found.
python/sglang/srt/models/deepseek_v2.py
View file @
e7e5a305
...
...
@@ -252,8 +252,7 @@ class MoEGate(nn.Module):
# NOTE: For some unknown reason, router_gemm seems degrade accept length.
if
(
_is_cuda
and
not
self
.
is_nextn
and
hidden_states
.
shape
[
0
]
<
4
and
hidden_states
.
shape
[
0
]
<=
16
and
hidden_states
.
shape
[
1
]
==
7168
and
self
.
weight
.
shape
[
0
]
==
256
and
_device_sm
>=
90
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment