Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
b41aeb34
Unverified
Commit
b41aeb34
authored
Dec 24, 2025
by
Pleaplusone
Committed by
GitHub
Dec 24, 2025
Browse files
[Bugfix][ROCm] Fix load issue on deepseek quark quantization when shared expert enabled (#31261)
Signed-off-by:
ganyi
<
ygan@amd.com
>
parent
ddfac703
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
11 additions
and
8 deletions
+11
-8
vllm/model_executor/models/deepseek_v2.py
vllm/model_executor/models/deepseek_v2.py
+11
-8
No files found.
vllm/model_executor/models/deepseek_v2.py
View file @
b41aeb34
...
...
@@ -1598,7 +1598,11 @@ class DeepseekV2ForCausalLM(
# Determine split axis based on op type
# gate/up: ColumnParallel → split along dim 0
# down: RowParallel → split along dim 1
split_dim
=
1
if
"down_proj.weight"
in
name
else
0
split_dim
=
(
1
if
(
"down_proj.weight"
in
name
and
loaded_weight
.
ndim
>
1
)
else
0
)
total
=
loaded_weight
.
shape
[
split_dim
]
assert
total
%
num_chunks
==
0
,
(
f
"Shared expert weight dim
{
total
}
"
...
...
@@ -1611,14 +1615,13 @@ class DeepseekV2ForCausalLM(
weight_to_load
=
loaded_weight
if
is_fusion_moe_shared_experts_layer
:
if
split_dim
==
0
:
weight_to_load
=
loaded_weight
[
j
*
chunk_size
:
(
j
+
1
)
*
chunk_size
,
:
]
chunk_slice
=
slice
(
j
*
chunk_size
,
(
j
+
1
)
*
chunk_size
)
if
loaded_weight
.
ndim
==
1
:
weight_to_load
=
loaded_weight
[
chunk_slice
]
elif
split_dim
==
0
:
weight_to_load
=
loaded_weight
[
chunk_slice
,
:]
else
:
weight_to_load
=
loaded_weight
[
:,
j
*
chunk_size
:
(
j
+
1
)
*
chunk_size
]
weight_to_load
=
loaded_weight
[:,
chunk_slice
]
# Synthesize an expert-style name so expert mapping
# can route it
chunk_name
=
name
.
replace
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment