Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
cfea9c04
Unverified
Commit
cfea9c04
authored
Nov 23, 2024
by
Chen Wu
Committed by
GitHub
Nov 22, 2024
Browse files
[Model] Fix Baichuan BNB online quantization (#10572)
Signed-off-by:
Chen Wu
<
cntryroa@gmail.com
>
parent
7d8ffb34
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
15 additions
and
0 deletions
+15
-0
vllm/model_executor/models/baichuan.py
vllm/model_executor/models/baichuan.py
+15
-0
No files found.
vllm/model_executor/models/baichuan.py
View file @
cfea9c04
...
...
@@ -350,6 +350,21 @@ class BaiChuanBaseForCausalLM(nn.Module, SupportsLoRA, SupportsPP):
embedding_modules
=
{}
embedding_padding_modules
=
[]
# BitandBytes specific attributes
default_bitsandbytes_target_modules
=
[
".W_pack."
,
".o_proj."
,
".down_proj."
,
".up_proj."
,
".gate_proj."
,
".up_proj."
,
]
bitsandbytes_stacked_params_mapping
=
{
# shard_name, weight_name, index
"gate_proj"
:
(
"gate_up_proj"
,
0
),
"up_proj"
:
(
"gate_up_proj"
,
1
),
}
def
__init__
(
self
,
*
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment