Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
4e0b233d
Commit
4e0b233d
authored
Oct 23, 2024
by
zhuwenwen
Browse files
update lm_head weight to support llama3.2
parent
aba40fda
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
vllm/model_executor/model_loader/utils.py
vllm/model_executor/model_loader/utils.py
+2
-2
No files found.
vllm/model_executor/model_loader/utils.py
View file @
4e0b233d
...
@@ -30,7 +30,7 @@ def get_model_architecture(
...
@@ -30,7 +30,7 @@ def get_model_architecture(
os
.
environ
[
'LLAMA_NN'
]
=
'0'
os
.
environ
[
'LLAMA_NN'
]
=
'0'
else
:
else
:
os
.
environ
[
'LLAMA_NN'
]
=
'1'
os
.
environ
[
'LLAMA_NN'
]
=
'1'
if
architectures
==
[
'BloomForCausalLM'
]:
if
architectures
==
[
'BloomForCausalLM'
]
or
architectures
==
[
'LlamaForCausalLM'
]
:
os
.
environ
[
'LM_TN'
]
=
'1'
os
.
environ
[
'LM_TN'
]
=
'1'
else
:
else
:
os
.
environ
[
'LM_TN'
]
=
'0'
os
.
environ
[
'LM_TN'
]
=
'0'
...
@@ -50,7 +50,7 @@ def get_model_architecture(
...
@@ -50,7 +50,7 @@ def get_model_architecture(
os
.
environ
[
'AWQ_PAD'
]
=
'0'
os
.
environ
[
'AWQ_PAD'
]
=
'0'
else
:
else
:
os
.
environ
[
'LLAMA_NN'
]
=
'0'
os
.
environ
[
'LLAMA_NN'
]
=
'0'
os
.
environ
[
'LM_TN'
]
=
'
0
'
os
.
environ
[
'LM_TN'
]
=
'
1
'
os
.
environ
[
'GEMM_PAD'
]
=
'0'
os
.
environ
[
'GEMM_PAD'
]
=
'0'
os
.
environ
[
'FA_PAD'
]
=
'0'
os
.
environ
[
'FA_PAD'
]
=
'0'
os
.
environ
[
'AWQ_PAD'
]
=
'0'
os
.
environ
[
'AWQ_PAD'
]
=
'0'
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment