Commits · e641dd86824554ec6174b8ddaf7de487dc730fd1 · OpenDAS / Lmdeploy

13 Nov, 2023 1 commit

[Docs] Update Supported Matrix (#679) · e641dd86

pppppM authored Nov 13, 2023

* update supported matrix

* change the default shard size when saving quantized weights

* baichuan2 kv8

e641dd86

03 Nov, 2023 1 commit
- [Fix] Qwen's quantization results are abnormal & Baichuan cannot be quantized (#605) · c15fbf47
  pppppM authored Nov 03, 2023
```
* fix awq

* adapt new qwen code

* adapt qwen 14b and baichuan2 7b

* add docstring

* add runtime error for qwen
```
  c15fbf47
25 Oct, 2023 1 commit

Add more user-friendly CLI (#541) · 169d5169

RunningLeon authored Oct 25, 2023

* add

* import fire in main

* wrap to speed up fire cli

* update

* update docs

* update docs

* fix

* resolve commennts

* resolve confict and add test for cli

169d5169

24 Aug, 2023 1 commit
- [Fix] Fix llama2 70b & qwen quantization error (#273) · d5cb0be2
  pppppM authored Aug 24, 2023
```
* fix llama2 70b

* fix qwen quantization

* remove pdb

* add faq
```
  d5cb0be2
11 Aug, 2023 1 commit

[Feature] Support AWQ (#108) · d3dbe179

pppppM authored Aug 11, 2023

* support kv cache offload

* add dataloader docstring

* complete gitignore

* refactor collect mod fn

* add calibration

* fix lint

* add observers and quantizers

* fix lints

* add global available mixin

* fix lints

* split batch inference

* support smoothquant and awq

* update export kv scales

* fix lints

* fix some bugs

* update weight only usage

* update usage

* auto mapping and support smooth internlm

* trust remote code

* fix num head key error

* fix bias error

* align shape and pack order with llm-awq

* modified according to LZHgrla's comments.

* update gitignore

* fix kv qparams export error

* update usage

* decouple calibrate and awq

* update docstrings

* update api name

* update readme

* update readme

* update readme

* update readme

* update kv_qparams and readme

* fix typos

d3dbe179