Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
18e7cbbb
Unverified
Commit
18e7cbbb
authored
Feb 03, 2026
by
Cyrus Leung
Committed by
GitHub
Feb 03, 2026
Browse files
[Bugfix] Fix startup hang for Granite Speech (#33699)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
f0d52517
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
8 additions
and
8 deletions
+8
-8
vllm/multimodal/budget.py
vllm/multimodal/budget.py
+8
-8
No files found.
vllm/multimodal/budget.py
View file @
18e7cbbb
...
...
@@ -54,6 +54,7 @@ class MultiModalBudget:
self
.
max_model_len
=
model_config
.
max_model_len
self
.
max_num_reqs
=
scheduler_config
.
max_num_seqs
with
set_default_torch_num_threads
():
# Avoid hang during startup
cache
=
mm_registry
.
processor_only_cache_from_config
(
vllm_config
)
processor
=
mm_registry
.
create_processor
(
model_config
,
cache
=
cache
)
...
...
@@ -64,7 +65,6 @@ class MultiModalBudget:
modality
for
modality
,
limit
in
mm_limits
.
items
()
if
limit
>
0
}
with
set_default_torch_num_threads
():
# Avoid hang during startup
all_mm_max_toks_per_item
=
get_mm_max_toks_per_item
(
model_config
,
mm_registry
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment