Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
22a6b9fc
"torch_sparse/view.py" did not exist on "57852a6664ce58324dbeab0db13837ffb9005930"
Unverified
Commit
22a6b9fc
authored
Jun 12, 2025
by
Binyao Jiang
Committed by
GitHub
Jun 12, 2025
Browse files
Remove unnecessary metadata_expand.max_seq_len_k operations in fa3 to… (#7140)
parent
b02df20a
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
10 deletions
+0
-10
python/sglang/srt/layers/attention/flashattention_backend.py
python/sglang/srt/layers/attention/flashattention_backend.py
+0
-10
No files found.
python/sglang/srt/layers/attention/flashattention_backend.py
View file @
22a6b9fc
...
...
@@ -394,7 +394,6 @@ class FlashAttentionBackend(AttentionBackend):
dtype
=
torch
.
int32
,
)
metadata_expand
.
max_seq_len_q
=
1
metadata_expand
.
max_seq_len_k
=
self
.
speculative_step_id
+
1
metadata_expand
.
cu_seqlens_q
=
torch
.
arange
(
0
,
metadata_expand
.
cache_seqlens_int32
.
numel
()
+
1
,
...
...
@@ -550,9 +549,6 @@ class FlashAttentionBackend(AttentionBackend):
),
(
1
,
0
),
)
metadata_expand
.
max_seq_len_k
=
(
metadata_expand
.
cache_seqlens_int32
.
max
().
item
()
)
self
.
forward_metadata_spec_decode_expand
=
metadata_expand
elif
forward_batch
.
forward_mode
.
is_extend_or_draft_extend_or_mixed
():
metadata
.
cache_seqlens_int32
=
seqlens_in_batch
.
to
(
torch
.
int32
)
...
...
@@ -1421,9 +1417,6 @@ class FlashAttentionBackend(AttentionBackend):
]
)
metadata_expand
.
max_seq_len_q
=
1
metadata_expand
.
max_seq_len_k
=
(
self
.
speculative_step_id
+
1
)
# , do this in replay
metadata_expand
.
cu_seqlens_q
=
(
self
.
draft_decode_metadata_topk_expand
[
"cu_seqlens_q"
][
:
bs
*
self
.
topk
+
1
...
...
@@ -1766,9 +1759,6 @@ class FlashAttentionBackend(AttentionBackend):
dtype
=
torch
.
int32
,
)
)
metadata_expand
.
max_seq_len_k
=
(
metadata_expand
.
cache_seqlens_int32
.
max
().
item
()
)
elif
forward_mode
.
is_draft_extend
():
metadata
=
self
.
draft_extend_metadata
[
bs
]
metadata
.
cache_seqlens_int32
.
copy_
(
seq_lens
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment