Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
77490c6f
Unverified
Commit
77490c6f
authored
Jun 15, 2024
by
Cyrus Leung
Committed by
GitHub
Jun 14, 2024
Browse files
[Core] Remove duplicate processing in async engine (#5525)
parent
48f589e1
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
13 deletions
+1
-13
vllm/engine/async_llm_engine.py
vllm/engine/async_llm_engine.py
+1
-13
No files found.
vllm/engine/async_llm_engine.py
View file @
77490c6f
...
...
@@ -580,21 +580,9 @@ class AsyncLLMEngine:
if
arrival_time
is
None
:
arrival_time
=
time
.
time
()
if
self
.
engine_use_ray
:
processed_inputs
=
await
self
.
engine
.
process_model_inputs_async
\
.
remote
(
# type: ignore
request_id
=
request_id
,
inputs
=
inputs
,
lora_request
=
lora_request
)
else
:
processed_inputs
=
await
self
.
engine
.
process_model_inputs_async
(
request_id
=
request_id
,
inputs
=
inputs
,
lora_request
=
lora_request
)
stream
=
self
.
_request_tracker
.
add_request
(
request_id
,
inputs
=
processed_
inputs
,
inputs
=
inputs
,
params
=
params
,
arrival_time
=
arrival_time
,
lora_request
=
lora_request
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment