Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
5f671cb4
Unverified
Commit
5f671cb4
authored
Jan 28, 2025
by
Robert Shaw
Committed by
GitHub
Jan 29, 2025
Browse files
[V1] Improve Error Message for Unsupported Config (#12535)
Co-authored-by:
Michael Goin
<
michael@neuralmagic.com
>
parent
bd02164c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
2 deletions
+7
-2
vllm/platforms/cuda.py
vllm/platforms/cuda.py
+7
-2
No files found.
vllm/platforms/cuda.py
View file @
5f671cb4
...
...
@@ -120,13 +120,18 @@ class CudaPlatformBase(Platform):
if
parallel_config
.
worker_cls
==
"auto"
:
if
scheduler_config
.
is_multi_step
:
if
envs
.
VLLM_USE_V1
:
raise
NotImplementedError
raise
NotImplementedError
(
"Multi-step scheduling is not supported (and not "
"needed) on VLLM V1. Please launch without "
"--num-scheduler-steps."
)
else
:
parallel_config
.
worker_cls
=
\
"vllm.worker.multi_step_worker.MultiStepWorker"
elif
vllm_config
.
speculative_config
:
if
envs
.
VLLM_USE_V1
:
raise
NotImplementedError
raise
NotImplementedError
(
"Speculative decoding is not yet supported on VLLM V1."
)
else
:
parallel_config
.
worker_cls
=
\
"vllm.spec_decode.spec_decode_worker.create_spec_worker"
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment