Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
5a9db327
Commit
5a9db327
authored
Sep 10, 2024
by
zhuwenwen
Browse files
update fa interface param
parent
50ed6d0a
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
4 deletions
+4
-4
vllm/attention/backends/rocm_flash_attn.py
vllm/attention/backends/rocm_flash_attn.py
+4
-4
No files found.
vllm/attention/backends/rocm_flash_attn.py
View file @
5a9db327
...
...
@@ -449,8 +449,8 @@ class ROCmFlashAttentionImpl(AttentionImpl):
v
=
value
,
cu_seqlens_q
=
prefill_meta
.
seq_start_loc
,
cu_seqlens_k
=
prefill_meta
.
seq_start_loc
,
max_seqlen
s
_q
=
prefill_meta
.
max_prefill_seq_len
,
max_seqlen
s
_k
=
prefill_meta
.
max_prefill_seq_len
,
max_seqlen_q
=
prefill_meta
.
max_prefill_seq_len
,
max_seqlen_k
=
prefill_meta
.
max_prefill_seq_len
,
softmax_scale
=
self
.
scale
,
causal
=
True
,
)
...
...
@@ -526,8 +526,8 @@ class ROCmFlashAttentionImpl(AttentionImpl):
v
=
value
,
cu_seqlens_q
=
prefill_meta
.
seq_start_loc
,
cu_seqlens_k
=
prefill_meta
.
seq_start_loc
,
max_seqlen
s
_q
=
prefill_meta
.
max_prefill_seq_len
,
max_seqlen
s
_k
=
prefill_meta
.
max_prefill_seq_len
,
max_seqlen_q
=
prefill_meta
.
max_prefill_seq_len
,
max_seqlen_k
=
prefill_meta
.
max_prefill_seq_len
,
softmax_scale
=
self
.
scale
,
causal
=
True
,
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment