Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
10546f92
Unverified
Commit
10546f92
authored
Feb 02, 2026
by
Roger Wang
Committed by
GitHub
Feb 03, 2026
Browse files
[Bugfix] Fix mm budget setting for Qwen Omni models (#33634)
Signed-off-by:
Roger Wang
<
hey@rogerw.io
>
parent
e69c990c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
0 deletions
+5
-0
vllm/multimodal/budget.py
vllm/multimodal/budget.py
+5
-0
No files found.
vllm/multimodal/budget.py
View file @
10546f92
...
...
@@ -72,9 +72,14 @@ class MultiModalBudget:
mm_counts
=
dict
.
fromkeys
(
active_modalities
,
1
),
)
# Some models (e.g., Qwen3Omni with use_audio_in_video=True) share
# placeholders between modalities, so not all active modalities will
# have their own entry in the returned dict. We filter to only include
# modalities that have independent placeholder tokens.
mm_max_toks_per_item
=
{
modality
:
all_mm_max_toks_per_item
[
modality
]
for
modality
in
active_modalities
if
modality
in
all_mm_max_toks_per_item
}
encoder_compute_budget
,
encoder_cache_size
=
compute_mm_encoder_budget
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment