- 25 Oct, 2023 1 commit
-
-
RunningLeon authored
* add * import fire in main * wrap to speed up fire cli * update * update docs * update docs * fix * resolve commennts * resolve confict and add test for cli
-
- 23 Jul, 2023 1 commit
-
-
lvhan028 authored
* refactor model.py and support baichuan-7b * remove model_name * remove hard session_len * export tokenizer.py to target dir * remove model_name from client * remove model_name * update * correct throughput equation * fix session.response * update serving.md * update readme * update according to review comments * update * update * update * update
-
- 17 Jul, 2023 1 commit
-
-
Jaylin Lee authored
* [bugfix] Fix some docs' bug in 'serving' * [bugfix] Fix some docs' bug in 'serving'
-
- 11 Jul, 2023 1 commit
-
-
tpoisonooo authored
* docs(serving.md): typo * docs(README): quantization
-
- 05 Jul, 2023 1 commit
-
-
lvhan028 authored
* add performance * use png * update * update * update * update * update
-