Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d919ecc7
Unverified
Commit
d919ecc7
authored
Jun 15, 2024
by
Alexander Matveev
Committed by
GitHub
Jun 15, 2024
Browse files
add gptq_marlin test for bug report
https://github.com/vllm-project/vllm/issues/5088
(#5145)
parent
e691918e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
0 deletions
+3
-0
tests/models/test_gptq_marlin.py
tests/models/test_gptq_marlin.py
+3
-0
No files found.
tests/models/test_gptq_marlin.py
View file @
d919ecc7
...
...
@@ -40,6 +40,9 @@ MODELS = [
(
"TheBloke/TinyLlama-1.1B-Chat-v1.0-GPTQ"
,
"gptq-8bit-128g-actorder_True"
),
# 8-bit, act_order==True, group_size=32
(
"TheBloke/TinyLlama-1.1B-Chat-v1.0-GPTQ"
,
"gptq-8bit-32g-actorder_True"
),
# 4-bit, act_order==True, group_size=128
(
"TechxGenus/gemma-1.1-2b-it-GPTQ"
,
"main"
)
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment