Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
zhaoyu6
sglang
Commits
5239d795
"examples/dreambooth/train_dreambooth_sd3.py" did not exist on "9d9744075e5273650759d07977ff8cd8fac63e4e"
Unverified
Commit
5239d795
authored
Apr 21, 2025
by
fzyzcjy
Committed by
GitHub
Apr 20, 2025
Browse files
Speedup shared expert weight construction by avoid cloning (#5188)
parent
f0815419
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
python/sglang/srt/models/deepseek_v2.py
python/sglang/srt/models/deepseek_v2.py
+1
-1
No files found.
python/sglang/srt/models/deepseek_v2.py
View file @
5239d795
...
...
@@ -1628,7 +1628,7 @@ class DeepseekV2ForCausalLM(nn.Module):
f
"mlp.experts."
f
"
{
self
.
config
.
n_routed_experts
+
num_repeat
}
"
f
".
{
suffix
}
"
,
weights_dict
[
shared_expert_weight_name
]
.
clone
()
,
weights_dict
[
shared_expert_weight_name
],
)
)
names_to_remove
+=
[
shared_expert_weight_name
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment