Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
ddbb0d23
Unverified
Commit
ddbb0d23
authored
Mar 10, 2026
by
Nick Hill
Committed by
GitHub
Mar 10, 2026
Browse files
[Model Runner V2] Fix mm input embeddings lookup (#36588)
Signed-off-by:
Nick Hill
<
nickhill123@gmail.com
>
parent
9efc3bdc
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
1 deletion
+4
-1
vllm/v1/worker/gpu/model_states/default.py
vllm/v1/worker/gpu/model_states/default.py
+4
-1
No files found.
vllm/v1/worker/gpu/model_states/default.py
View file @
ddbb0d23
...
@@ -98,8 +98,11 @@ class DefaultModelState(ModelState):
...
@@ -98,8 +98,11 @@ class DefaultModelState(ModelState):
req_states
.
prefill_len
.
np
[
input_batch
.
idx_mapping_np
],
req_states
.
prefill_len
.
np
[
input_batch
.
idx_mapping_np
],
req_states
.
num_computed_prefill_tokens
[
input_batch
.
idx_mapping_np
],
req_states
.
num_computed_prefill_tokens
[
input_batch
.
idx_mapping_np
],
)
)
# Use unpadded input_ids to match is_mm_embed size (num_tokens).
# input_batch.input_ids may be padded for CUDA graphs.
input_ids_unpadded
=
input_batch
.
input_ids
[:
input_batch
.
num_tokens
]
inputs_embeds
=
self
.
encoder_runner
.
get_inputs_embeds
(
inputs_embeds
=
self
.
encoder_runner
.
get_inputs_embeds
(
input_
batch
.
input_ids
,
mm_embeds
,
is_mm_embed
input_
ids_unpadded
,
mm_embeds
,
is_mm_embed
)
)
return
inputs_embeds
[:
input_batch
.
num_tokens_after_padding
]
return
inputs_embeds
[:
input_batch
.
num_tokens_after_padding
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment