Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d4d27517
Unverified
Commit
d4d27517
authored
Dec 17, 2025
by
Michael Goin
Committed by
GitHub
Dec 16, 2025
Browse files
Update note comment for flashinfer attention warmup (#30711)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
009a7738
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
4 deletions
+3
-4
vllm/model_executor/warmup/kernel_warmup.py
vllm/model_executor/warmup/kernel_warmup.py
+3
-4
No files found.
vllm/model_executor/warmup/kernel_warmup.py
View file @
d4d27517
...
@@ -49,13 +49,12 @@ def kernel_warmup(worker: "Worker"):
...
@@ -49,13 +49,12 @@ def kernel_warmup(worker: "Worker"):
except
NotImplementedError
:
except
NotImplementedError
:
return
False
return
False
# NOTE: we add check for empty attn_groups to avoid errors when
# deploying models such as E instances and encoder-only models.
# As for those models, worker.model_runner.attn_groups is empty.
# This change is made during EPD feature development.
if
(
if
(
not
worker
.
model_runner
.
is_pooling_model
not
worker
.
model_runner
.
is_pooling_model
and
worker
.
model_runner
.
attn_groups
and
worker
.
model_runner
.
attn_groups
# NOTE: This should be `any` instead of `all` but other hybrid attention
# backends don't support this dummy run. Once we remove
# `build_for_cudagraph_capture`, we can change it to `any`.
and
all
(
and
all
(
_is_flashinfer_backend
(
group
.
backend
)
_is_flashinfer_backend
(
group
.
backend
)
for
groups
in
worker
.
model_runner
.
attn_groups
for
groups
in
worker
.
model_runner
.
attn_groups
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment