Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
8abe8dea
Unverified
Commit
8abe8dea
authored
Aug 29, 2025
by
Yineng Zhang
Committed by
GitHub
Aug 29, 2025
Browse files
fix: dsv3 lite q_lora_rank none (#9815)
parent
1e85589d
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
12 additions
and
4 deletions
+12
-4
python/sglang/srt/models/deepseek_v2.py
python/sglang/srt/models/deepseek_v2.py
+12
-4
No files found.
python/sglang/srt/models/deepseek_v2.py
View file @
8abe8dea
...
...
@@ -2414,18 +2414,26 @@ class DeepseekV2ForCausalLM(nn.Module):
)
num_hidden_layers
=
1
if
is_nextn
else
self
.
config
.
num_hidden_layers
for
layer_id
in
range
(
num_hidden_layers
):
if
is_nextn
:
layer
=
self
.
model
.
decoder
else
:
layer
=
self
.
model
.
layers
[
layer_id
]
for
module
in
[
layer
.
self_attn
.
fused_qkv_a_proj_with_mqa
,
layer
.
self_attn
.
q_b_proj
,
module_list
=
[
layer
.
self_attn
.
kv_b_proj
,
layer
.
self_attn
.
o_proj
,
]:
]
if
self
.
config
.
q_lora_rank
is
not
None
:
module_list
.
append
(
layer
.
self_attn
.
fused_qkv_a_proj_with_mqa
)
module_list
.
append
(
layer
.
self_attn
.
q_b_proj
)
else
:
module_list
.
append
(
layer
.
self_attn
.
kv_a_proj_with_mqa
)
module_list
.
append
(
layer
.
self_attn
.
q_proj
)
for
module
in
module_list
:
requant_weight_ue8m0_inplace
(
module
.
weight
,
module
.
weight_scale_inv
,
weight_block_size
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment