Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
f54f8512
Unverified
Commit
f54f8512
authored
Oct 15, 2025
by
wang.yuqi
Committed by
GitHub
Oct 15, 2025
Browse files
[Model][2/N] Improve all pooling task | Support multi-vector retrieval (#25370)
Signed-off-by:
wang.yuqi
<
noooop@126.com
>
parent
d4d1a602
Changes
41
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
6 deletions
+7
-6
vllm/v1/worker/gpu_model_runner.py
vllm/v1/worker/gpu_model_runner.py
+7
-6
No files found.
vllm/v1/worker/gpu_model_runner.py
View file @
f54f8512
...
@@ -1926,15 +1926,16 @@ class GPUModelRunner(LoRAModelRunnerMixin, KVConnectorModelRunnerMixin):
...
@@ -1926,15 +1926,16 @@ class GPUModelRunner(LoRAModelRunnerMixin, KVConnectorModelRunnerMixin):
supported_tasks
=
list
(
model
.
pooler
.
get_supported_tasks
())
supported_tasks
=
list
(
model
.
pooler
.
get_supported_tasks
())
if
(
if
self
.
scheduler_config
.
chunked_prefill_enabled
:
self
.
scheduler_config
.
chunked_prefill_enabled
if
"token_embed"
in
supported_tasks
:
and
"encode"
in
supported_tasks
supported_tasks
.
remove
(
"token_embed"
)
)
:
if
"token_classify"
in
supported_tasks
:
supported_tasks
.
remove
(
"
encode
"
)
supported_tasks
.
remove
(
"
token_classify
"
)
logger
.
debug_once
(
logger
.
debug_once
(
"Chunked prefill is not supported with "
"Chunked prefill is not supported with "
"encode task which using ALL pooling. "
"token_embed and token_classify tasks "
"which using ALL pooling. "
"Please turn off chunked prefill by "
"Please turn off chunked prefill by "
"`--no-enable-chunked-prefill` before using it."
"`--no-enable-chunked-prefill` before using it."
)
)
...
...
Prev
1
2
3
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment