Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
1ed30424
"vllm/vscode:/vscode.git/clone" did not exist on "babf52dade78ff3b1bea6cb6e9f4151dfd630251"
Commit
1ed30424
authored
Apr 27, 2025
by
lizhigong
Browse files
fix zero scheduler on v0.8.4
parent
351d607d
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
7 deletions
+2
-7
vllm/zero_overhead/v0/llm_engine.py
vllm/zero_overhead/v0/llm_engine.py
+2
-7
No files found.
vllm/zero_overhead/v0/llm_engine.py
View file @
1ed30424
...
...
@@ -16,7 +16,7 @@ from vllm.logger import init_logger
from
vllm.executor.executor_base
import
ExecutorBase
from
vllm.inputs
import
INPUT_REGISTRY
from
vllm.inputs.data
import
ProcessorInputs
from
vllm.inputs.parse
import
is_encoder_decoder
_inputs
from
vllm.inputs.parse
import
split_enc_dec
_inputs
from
vllm.inputs.preprocess
import
InputPreprocessor
from
vllm.inputs.registry
import
InputRegistry
from
vllm.lora.request
import
LoRARequest
...
...
@@ -573,12 +573,7 @@ class ZeroOverheadEngine(LLMEngine):
seq_id
=
next
(
self
.
seq_counter
)
eos_token_id
=
self
.
input_preprocessor
.
get_eos_token_id
(
lora_request
)
if
is_encoder_decoder_inputs
(
processed_inputs
):
decoder_inputs
=
processed_inputs
[
"decoder"
]
encoder_inputs
=
processed_inputs
[
"encoder"
]
else
:
decoder_inputs
=
processed_inputs
encoder_inputs
=
None
encoder_inputs
,
decoder_inputs
=
split_enc_dec_inputs
(
processed_inputs
)
seq
=
ZeroOverheadSequence
(
seq_id
,
decoder_inputs
,
block_size
,
eos_token_id
,
lora_request
,
prompt_adapter_request
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment