Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
15310b51
Unverified
Commit
15310b51
authored
Aug 22, 2024
by
Michael Goin
Committed by
GitHub
Aug 22, 2024
Browse files
[Bugfix] Use LoadFormat values for `vllm serve --load-format` (#7784)
parent
57792ed4
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
6 deletions
+3
-6
vllm/engine/arg_utils.py
vllm/engine/arg_utils.py
+3
-6
No files found.
vllm/engine/arg_utils.py
View file @
15310b51
...
@@ -9,8 +9,8 @@ import torch
...
@@ -9,8 +9,8 @@ import torch
import
vllm.envs
as
envs
import
vllm.envs
as
envs
from
vllm.config
import
(
CacheConfig
,
DecodingConfig
,
DeviceConfig
,
from
vllm.config
import
(
CacheConfig
,
DecodingConfig
,
DeviceConfig
,
EngineConfig
,
LoadConfig
,
Lo
RAConfig
,
Model
Config
,
EngineConfig
,
LoadConfig
,
Lo
adFormat
,
LoRA
Config
,
ObservabilityConfig
,
ParallelConfig
,
ModelConfig
,
ObservabilityConfig
,
ParallelConfig
,
PromptAdapterConfig
,
SchedulerConfig
,
PromptAdapterConfig
,
SchedulerConfig
,
SpeculativeConfig
,
TokenizerPoolConfig
)
SpeculativeConfig
,
TokenizerPoolConfig
)
from
vllm.executor.executor_base
import
ExecutorBase
from
vllm.executor.executor_base
import
ExecutorBase
...
@@ -214,10 +214,7 @@ class EngineArgs:
...
@@ -214,10 +214,7 @@ class EngineArgs:
'--load-format'
,
'--load-format'
,
type
=
str
,
type
=
str
,
default
=
EngineArgs
.
load_format
,
default
=
EngineArgs
.
load_format
,
choices
=
[
choices
=
[
f
.
value
for
f
in
LoadFormat
],
'auto'
,
'pt'
,
'safetensors'
,
'npcache'
,
'dummy'
,
'tensorizer'
,
'bitsandbytes'
],
help
=
'The format of the model weights to load.
\n\n
'
help
=
'The format of the model weights to load.
\n\n
'
'* "auto" will try to load the weights in the safetensors format '
'* "auto" will try to load the weights in the safetensors format '
'and fall back to the pytorch bin format if safetensors format '
'and fall back to the pytorch bin format if safetensors format '
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment