Unverified Commit a869baca authored by Cyrus Leung's avatar Cyrus Leung Committed by GitHub
Browse files

[Bugfix] Fix Llama GGUF initialization (#18717)


Signed-off-by: default avatarDarkLight1337 <tlleungac@connect.ust.hk>
parent 82e2339b
......@@ -208,7 +208,7 @@ class LlamaAttention(nn.Module):
quant_config: Optional[QuantizationConfig]) -> None:
is_neox_style = True
is_gguf = quant_config and quant_config.get_name() == "gguf"
if is_gguf and self.config.model_type == "llama":
if is_gguf and config.model_type == "llama":
is_neox_style = False
self.rotary_emb = get_rope(
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment