Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
980a1724
Unverified
Commit
980a1724
authored
May 20, 2025
by
Percy
Committed by
GitHub
May 20, 2025
Browse files
[Kernel] update comment for KV shape in unified triton attn (#18099)
Signed-off-by:
haochengxia
<
xhc_1007@163.com
>
parent
e1f5a71e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
vllm/attention/ops/triton_unified_attention.py
vllm/attention/ops/triton_unified_attention.py
+2
-2
No files found.
vllm/attention/ops/triton_unified_attention.py
View file @
980a1724
...
@@ -31,8 +31,8 @@ def apply_softcap(S, x):
...
@@ -31,8 +31,8 @@ def apply_softcap(S, x):
def
kernel_unified_attention_2d
(
def
kernel_unified_attention_2d
(
output_ptr
,
# [num_tokens, num_query_heads, head_size]
output_ptr
,
# [num_tokens, num_query_heads, head_size]
query_ptr
,
# [num_tokens, num_query_heads, head_size]
query_ptr
,
# [num_tokens, num_query_heads, head_size]
key_cache_ptr
,
# [num_blks, num_kv_heads, head_size
// x, blk_size, x
]
key_cache_ptr
,
# [num_blks,
blk_size,
num_kv_heads, head_size]
value_cache_ptr
,
# [num_blks, num_kv_heads, head_size
, blk_size
]
value_cache_ptr
,
# [num_blks,
blk_size,
num_kv_heads, head_size]
block_tables_ptr
,
# [num_seqs, max_num_blocks_per_seq]
block_tables_ptr
,
# [num_seqs, max_num_blocks_per_seq]
seq_lens_ptr
,
# [num_seqs]
seq_lens_ptr
,
# [num_seqs]
alibi_slopes_ptr
,
# [num_query_heads]
alibi_slopes_ptr
,
# [num_query_heads]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment