Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a869baca
Unverified
Commit
a869baca
authored
May 26, 2025
by
Cyrus Leung
Committed by
GitHub
May 26, 2025
Browse files
[Bugfix] Fix Llama GGUF initialization (#18717)
Signed-off-by:
DarkLight1337
<
tlleungac@connect.ust.hk
>
parent
82e2339b
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/model_executor/models/llama.py
vllm/model_executor/models/llama.py
+1
-1
No files found.
vllm/model_executor/models/llama.py
View file @
a869baca
...
@@ -208,7 +208,7 @@ class LlamaAttention(nn.Module):
...
@@ -208,7 +208,7 @@ class LlamaAttention(nn.Module):
quant_config
:
Optional
[
QuantizationConfig
])
->
None
:
quant_config
:
Optional
[
QuantizationConfig
])
->
None
:
is_neox_style
=
True
is_neox_style
=
True
is_gguf
=
quant_config
and
quant_config
.
get_name
()
==
"gguf"
is_gguf
=
quant_config
and
quant_config
.
get_name
()
==
"gguf"
if
is_gguf
and
self
.
config
.
model_type
==
"llama"
:
if
is_gguf
and
config
.
model_type
==
"llama"
:
is_neox_style
=
False
is_neox_style
=
False
self
.
rotary_emb
=
get_rope
(
self
.
rotary_emb
=
get_rope
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment