Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
598bd74c
Unverified
Commit
598bd74c
authored
Sep 02, 2025
by
nathan
Committed by
GitHub
Sep 02, 2025
Browse files
Fix weights loading for Apertus (#24100)
Signed-off-by:
Nathan Ranchin
<
nranchin@student.ethz.ch
>
parent
24177984
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
0 deletions
+6
-0
vllm/model_executor/models/apertus.py
vllm/model_executor/models/apertus.py
+6
-0
No files found.
vllm/model_executor/models/apertus.py
View file @
598bd74c
...
@@ -415,6 +415,12 @@ class ApertusModel(nn.Module):
...
@@ -415,6 +415,12 @@ class ApertusModel(nn.Module):
(
".qkv_proj"
,
".v_proj"
,
"v"
),
(
".qkv_proj"
,
".v_proj"
,
"v"
),
]
]
params_dict
=
dict
(
self
.
named_parameters
())
params_dict
=
dict
(
self
.
named_parameters
())
# we need to load the buffers for beta and eps (XIELU)
for
name
,
buffer
in
self
.
named_buffers
():
if
name
.
endswith
(
".beta"
)
or
name
.
endswith
(
".eps"
):
params_dict
[
name
]
=
buffer
loaded_params
:
set
[
str
]
=
set
()
loaded_params
:
set
[
str
]
=
set
()
for
name
,
loaded_weight
in
weights
:
for
name
,
loaded_weight
in
weights
:
if
"rotary_emb.inv_freq"
in
name
:
if
"rotary_emb.inv_freq"
in
name
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment