Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
xdb4_94051
vllm
Commits
b9bcdc71
Unverified
Commit
b9bcdc71
authored
Dec 11, 2023
by
Woosuk Kwon
Committed by
GitHub
Dec 11, 2023
Browse files
Change the load format to pt for Mixtral (#2028)
parent
4ff02039
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
10 additions
and
0 deletions
+10
-0
vllm/config.py
vllm/config.py
+10
-0
No files found.
vllm/config.py
View file @
b9bcdc71
...
...
@@ -119,6 +119,16 @@ class ModelConfig:
# Force ROCm to load from pt weights if nothing specific is set
if
load_format
==
"auto"
:
load_format
=
"pt"
# FIXME(woosuk): This is a temporary hack. Support safetensor weights.
architectures
=
getattr
(
self
.
hf_config
,
"architectures"
,
[])
if
"MixtralForCausalLM"
in
architectures
and
load_format
!=
"pt"
:
logger
.
info
(
"Currently, only 'pt' format is supported for Mixtral. "
"Changing the format to 'pt'. This may re-download the "
"weights if you have downloaded the safetensor weights."
)
load_format
=
"pt"
self
.
load_format
=
load_format
def
_verify_tokenizer_mode
(
self
)
->
None
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment