Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
cf349c4a
Unverified
Commit
cf349c4a
authored
Nov 17, 2024
by
Isotr0py
Committed by
GitHub
Nov 16, 2024
Browse files
[Bugfix][CPU] Fix CPU embedding runner with tensor parallel (#10394)
Signed-off-by:
Isotr0py
<
2037008807@qq.com
>
parent
905d0f0a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
0 deletions
+4
-0
vllm/worker/cpu_embedding_model_runner.py
vllm/worker/cpu_embedding_model_runner.py
+4
-0
No files found.
vllm/worker/cpu_embedding_model_runner.py
View file @
cf349c4a
...
@@ -66,6 +66,10 @@ class CPUEmbeddingModelRunner(
...
@@ -66,6 +66,10 @@ class CPUEmbeddingModelRunner(
hidden_states
=
model_executable
(
**
execute_model_kwargs
)
hidden_states
=
model_executable
(
**
execute_model_kwargs
)
# Only perform pooling in the driver worker.
if
not
self
.
is_driver_worker
:
return
[]
return
[
return
[
self
.
model
.
pooler
(
hidden_states
=
hidden_states
,
self
.
model
.
pooler
(
hidden_states
=
hidden_states
,
pooling_metadata
=
model_input
.
pooling_metadata
)
pooling_metadata
=
model_input
.
pooling_metadata
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment