Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9f1710f1
Unverified
Commit
9f1710f1
authored
Mar 07, 2025
by
Ying Zhong
Committed by
GitHub
Mar 06, 2025
Browse files
Fix mla prefill context performance (#13897)
Signed-off-by:
ZhongYingMatrix
<
zhongyingmatrix@gmail.com
>
parent
e642ec96
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
vllm/attention/backends/mla/common.py
vllm/attention/backends/mla/common.py
+1
-1
vllm/v1/attention/backends/mla/common.py
vllm/v1/attention/backends/mla/common.py
+1
-1
No files found.
vllm/attention/backends/mla/common.py
View file @
9f1710f1
...
@@ -1308,7 +1308,7 @@ class MLACommonImpl(MLAAttentionImpl[T], Generic[T]):
...
@@ -1308,7 +1308,7 @@ class MLACommonImpl(MLAAttentionImpl[T], Generic[T]):
)
)
kv_c_normed
=
workspace
[:
toks
]
\
kv_c_normed
=
workspace
[:
toks
]
\
[...,
:
self
.
kv_lora_rank
]
.
unsqueeze
(
1
)
[...,
:
self
.
kv_lora_rank
]
k_pe
=
workspace
[:
toks
]
\
k_pe
=
workspace
[:
toks
]
\
[...,
self
.
kv_lora_rank
:].
unsqueeze
(
1
)
[...,
self
.
kv_lora_rank
:].
unsqueeze
(
1
)
...
...
vllm/v1/attention/backends/mla/common.py
View file @
9f1710f1
...
@@ -874,7 +874,7 @@ class MLACommonImpl(MLAAttentionImpl[M], Generic[M]):
...
@@ -874,7 +874,7 @@ class MLACommonImpl(MLAAttentionImpl[M], Generic[M]):
)
)
kv_c_normed
=
workspace
[:
toks
]
\
kv_c_normed
=
workspace
[:
toks
]
\
[...,
:
self
.
kv_lora_rank
]
.
unsqueeze
(
1
)
[...,
:
self
.
kv_lora_rank
]
k_pe
=
workspace
[:
toks
]
\
k_pe
=
workspace
[:
toks
]
\
[...,
self
.
kv_lora_rank
:].
unsqueeze
(
1
)
[...,
self
.
kv_lora_rank
:].
unsqueeze
(
1
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment