Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
ac0bb2c3
Unverified
Commit
ac0bb2c3
authored
Nov 12, 2025
by
Lukas Geiger
Committed by
GitHub
Nov 12, 2025
Browse files
[Core] Cache `vllm_is_batch_invariant` (#28304)
Signed-off-by:
Lukas Geiger
<
lukas.geiger94@gmail.com
>
parent
f31419ed
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
0 deletions
+2
-0
vllm/model_executor/layers/batch_invariant.py
vllm/model_executor/layers/batch_invariant.py
+2
-0
No files found.
vllm/model_executor/layers/batch_invariant.py
View file @
ac0bb2c3
...
...
@@ -4,6 +4,7 @@ import contextlib
import
os
from
collections
import
namedtuple
from
collections.abc
import
Callable
from
functools
import
cache
from
typing
import
Any
import
torch
...
...
@@ -857,6 +858,7 @@ def get_batch_invariant_attention_block_size() -> AttentionBlockSize:
return
AttentionBlockSize
(
block_m
=
16
,
block_n
=
16
)
@
cache
def
vllm_is_batch_invariant
():
env_key
=
"VLLM_BATCH_INVARIANT"
is_overridden
=
False
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment