- 25 Oct, 2023 1 commit
-
-
RunningLeon authored
* add * import fire in main * wrap to speed up fire cli * update * update docs * update docs * fix * resolve commennts * resolve confict and add test for cli
-
- 24 Aug, 2023 1 commit
-
-
WRH authored
* support decode * unit test and benckmark and improve * remove some drafts * enable numerical test * minor * add some benchmark data * add more output * update interface * remove debugs * format * update docstring * remove print and add benchmark results * use logits & add main * fix rb * dump large * update test * update test decode * add decimal
-
- 11 Aug, 2023 1 commit
-
-
pppppM authored
* support kv cache offload * add dataloader docstring * complete gitignore * refactor collect mod fn * add calibration * fix lint * add observers and quantizers * fix lints * add global available mixin * fix lints * split batch inference * support smoothquant and awq * update export kv scales * fix lints * fix some bugs * update weight only usage * update usage * auto mapping and support smooth internlm * trust remote code * fix num head key error * fix bias error * align shape and pack order with llm-awq * modified according to LZHgrla's comments. * update gitignore * fix kv qparams export error * update usage * decouple calibrate and awq * update docstrings * update api name * update readme * update readme * update readme * update readme * update kv_qparams and readme * fix typos
-
- 07 Aug, 2023 1 commit
-
-
WRH authored
* add some dist utils * add model utils * add termio and basicstreamer * typo * fix world size * refactor chat and tested llama1 * add internlm adapter and support stoping criteria * concat with id for internlm * update docstring * update and support llama2 * typo * move docs to docs * update docstring of session manager * update docstring * update docs * fix accel none in model * fix and add test for tensor broadcast * fix session using typing to check type * add docstrings and comprehensive condition test * unit test for dist * fix session * split unittests of utils * typo * update control flow of accel * move test model * remove main in unittest * remove some log * remove some comments
-
- 21 Jul, 2023 1 commit
-
-
Kevin Wang authored
* [Fix] fix issue 127 * 优化防止接口更改 * 如果没有deepspeed用python启动需要手动加载到GPU上 * rollback the changes about max_out_tokens and delelte torch > 2.0 if statement * support kernel injection with customized deepspeed * spelling error * Update chat.py --------- Co-authored-by:wangruohui <12756472+wangruohui@users.noreply.github.com>
-
- 20 Jul, 2023 1 commit
-
-
WRH authored
-
- 17 Jul, 2023 1 commit
-
-
Kevin Wang authored
* [Fix] fix attempted_relative_import * use try...except...else
-
- 14 Jul, 2023 1 commit
-
-
A60 authored
-
- 12 Jul, 2023 1 commit
-
-
WRH authored
* add some docstrings. * update docstring. fix * ignore magic methods
-
- 11 Jul, 2023 1 commit
-
-
WRH authored
* previous merged * add chinese * support torch<2 * add a docstring * fix typo * rename torch submodule * rename to pytorch * rename in readme
-