Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
849957bc
Unverified
Commit
849957bc
authored
Aug 21, 2025
by
Yineng Zhang
Committed by
GitHub
Aug 21, 2025
Browse files
fix: tmp revert gpt oss tp sharding on hopper (#9469)
parent
cded039b
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
3 deletions
+6
-3
python/sglang/srt/models/gpt_oss.py
python/sglang/srt/models/gpt_oss.py
+6
-3
No files found.
python/sglang/srt/models/gpt_oss.py
View file @
849957bc
...
@@ -793,9 +793,12 @@ class GptOssForCausalLM(nn.Module):
...
@@ -793,9 +793,12 @@ class GptOssForCausalLM(nn.Module):
intermediate_size
%
mxfp4_block
==
0
intermediate_size
%
mxfp4_block
==
0
),
f
"
{
intermediate_size
=
}
must be divisible by
{
mxfp4_block
=
}
"
),
f
"
{
intermediate_size
=
}
must be divisible by
{
mxfp4_block
=
}
"
intermediate_size_block
=
intermediate_size
//
mxfp4_block
intermediate_size_block
=
intermediate_size
//
mxfp4_block
per_rank_intermediate_size_block
=
math
.
ceil
(
if
_is_sm100_supported
:
intermediate_size_block
/
moe_tp_size
per_rank_intermediate_size_block
=
math
.
ceil
(
)
intermediate_size_block
/
moe_tp_size
)
else
:
per_rank_intermediate_size_block
=
intermediate_size_block
//
moe_tp_size
per_rank_intermediate_size
=
per_rank_intermediate_size_block
*
mxfp4_block
per_rank_intermediate_size
=
per_rank_intermediate_size_block
*
mxfp4_block
# Calculate common slicing bounds for current rank
# Calculate common slicing bounds for current rank
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment