Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
66d5d042
"python/vscode:/vscode.git/clone" did not exist on "31a7d50964ae1c9d5693661567e7a3e034383bd7"
Unverified
Commit
66d5d042
authored
Sep 03, 2025
by
Elfie Guo
Committed by
GitHub
Sep 03, 2025
Browse files
Minor update regarding issue #9704 (#9733)
parent
73179b76
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
3 deletions
+5
-3
python/sglang/srt/models/deepseek_v2.py
python/sglang/srt/models/deepseek_v2.py
+5
-3
No files found.
python/sglang/srt/models/deepseek_v2.py
View file @
66d5d042
...
@@ -1678,9 +1678,11 @@ class DeepseekV2AttentionMLA(nn.Module):
...
@@ -1678,9 +1678,11 @@ class DeepseekV2AttentionMLA(nn.Module):
latent_cache_buf
=
forward_batch
.
token_to_kv_pool
.
get_key_buffer
(
latent_cache_buf
=
forward_batch
.
token_to_kv_pool
.
get_key_buffer
(
self
.
attn_mha
.
layer_id
self
.
attn_mha
.
layer_id
)
)
latent_cache
=
latent_cache_buf
[
latent_cache
=
(
forward_batch
.
prefix_chunk_kv_indices
[
i
]
latent_cache_buf
[
forward_batch
.
prefix_chunk_kv_indices
[
i
]]
].
contiguous
()
.
contiguous
()
.
to
(
q
.
dtype
)
)
kv_a_normed
,
k_pe
=
latent_cache
.
split
(
kv_a_normed
,
k_pe
=
latent_cache
.
split
(
[
self
.
kv_lora_rank
,
self
.
qk_rope_head_dim
],
dim
=-
1
[
self
.
kv_lora_rank
,
self
.
qk_rope_head_dim
],
dim
=-
1
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment