Commit 7c35a179 authored by yangql's avatar yangql
Browse files

Update test_quant.py

parent f8070792
......@@ -14,7 +14,7 @@ def gptq():
bits=4, # 4 or 8
group_size=128,
damp_percent=0.01,
desc_act=False, # set to False can significantly speed up inference but the perplexity may slightly bad
desc_act=True, # set to False can significantly speed up inference but the perplexity may slightly bad
static_groups=False,
sym=True,
true_sequential=True,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment