Unverified Commit 43f75f75 authored by AllentDan's avatar AllentDan Committed by GitHub
Browse files

fix auto_awq readme (#228)

* fix auto_awq readme

* hide w_sym option
parent 902a3e16
......@@ -182,8 +182,8 @@ LMDeploy uses AWQ algorithm for model weight quantization
```
python3 -m lmdeploy.lite.apis.auto_awq \
--model $HF_MODEL \
--w_bits 4 \ # Bit number for weight quantization
--w_sym False \ # Whether to use symmetric quantization for weights
--w_group_size 128 \ # Group size for weight quantization statistics
--work_dir $WORK_DIR \ # Directory saving quantization parameters from Step 1
```
......
......@@ -180,6 +180,7 @@ LMDeploy 使用 [AWQ](https://arxiv.org/abs/2306.00978) 算法对模型权重进
```
python3 -m lmdeploy.lite.apis.auto_awq \
--model $HF_MODEL \
--w_bits 4 \ # 权重量化的 bit 数
--w_group_size 128 \ # 权重量化分组统计尺寸
--work_dir $WORK_DIR \ # Step 1 保存量化参数的目录
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment