- 12 Dec, 2023 1 commit
-
-
Lyu Han authored
* simplify the header of the benchmark table * miss comma * fix lint
-
- 13 Nov, 2023 1 commit
-
-
pppppM authored
-
- 25 Oct, 2023 1 commit
-
-
RunningLeon authored
* add * import fire in main * wrap to speed up fire cli * update * update docs * update docs * fix * resolve commennts * resolve confict and add test for cli
-
- 12 Oct, 2023 1 commit
-
-
AllentDan authored
-
- 29 Aug, 2023 1 commit
-
-
tpoisonooo authored
* fix(kvint8): update doc * style(lmdeploy): format * style(kv_qparams.py): linting * fix lint * Update kv_int8.md * Update kv_int8.md --------- Co-authored-by:AllentDan <AllentDan@yeah.net>
-
- 21 Aug, 2023 1 commit
-
-
tpoisonooo authored
-
- 17 Aug, 2023 1 commit
-
-
tpoisonooo authored
* Update quantization.md * docs(quantization): update description * docs(README): rename quantization files
-
- 14 Aug, 2023 1 commit
-
-
tpoisonooo authored
* feat(quantization): kv cache use asymmetric
-
- 26 Jul, 2023 1 commit
-
-
Xin Li authored
* translate quantization doc * revise
-