Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
54d244f2
Unverified
Commit
54d244f2
authored
Apr 09, 2026
by
Cyrus Leung
Committed by
GitHub
Apr 09, 2026
Browse files
[UX] Improve error message for MM input too long (#39409)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
6c749399
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
vllm/v1/engine/input_processor.py
vllm/v1/engine/input_processor.py
+3
-3
No files found.
vllm/v1/engine/input_processor.py
View file @
54d244f2
...
...
@@ -405,11 +405,11 @@ class InputProcessor:
decoder_mm_positions
=
prompt_input
[
"mm_placeholders"
]
for
modality
,
mm_positions
in
decoder_mm_positions
.
items
():
for
mm_position
in
mm_positions
:
embed
_length
=
mm_position
.
get_num_embeds
()
if
embed
_length
>
self
.
mm_encoder_cache_size
:
num_
embed
s
=
mm_position
.
get_num_embeds
()
if
num_
embed
s
>
self
.
mm_encoder_cache_size
:
raise
ValueError
(
f
"The
{
prompt_type
}
prompt contains a(n)
{
modality
}
item "
f
"with
length
{
embed_length
}
, which exceeds the "
f
"with
{
num_embeds
}
embedding tokens
, which exceeds the "
f
"pre-allocated encoder cache size "
f
"
{
self
.
mm_encoder_cache_size
}
. Please reduce the input "
f
"size or increase the encoder cache size "
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment