- 23 Jul, 2023 1 commit
-
-
lvhan028 authored
* refactor model.py and support baichuan-7b * remove model_name * remove hard session_len * export tokenizer.py to target dir * remove model_name from client * remove model_name * update * correct throughput equation * fix session.response * update serving.md * update readme * update according to review comments * update * update * update * update
-
- 17 Jul, 2023 1 commit
-
-
Jaylin Lee authored
* [bugfix] Fix some docs' bug in 'serving' * [bugfix] Fix some docs' bug in 'serving'
-
- 11 Jul, 2023 1 commit
-
-
tpoisonooo authored
* docs(serving.md): typo * docs(README): quantization
-
- 05 Jul, 2023 1 commit
-
-
lvhan028 authored
* add performance * use png * update * update * update * update * update
-