- 26 Jul, 2023 1 commit
-
-
Xin Li authored
* translate quantization doc * revise
-
- 23 Jul, 2023 1 commit
-
-
lvhan028 authored
* refactor model.py and support baichuan-7b * remove model_name * remove hard session_len * export tokenizer.py to target dir * remove model_name from client * remove model_name * update * correct throughput equation * fix session.response * update serving.md * update readme * update according to review comments * update * update * update * update
-
- 17 Jul, 2023 1 commit
-
-
Jaylin Lee authored
* [bugfix] Fix some docs' bug in 'serving' * [bugfix] Fix some docs' bug in 'serving'
-
- 14 Jul, 2023 1 commit
-
-
lvhan028 authored
* move turbomind.md to docs/en * update link * update link
-
- 13 Jul, 2023 1 commit
-
-
del-zhenwu authored
-
- 11 Jul, 2023 2 commits
-
-
tpoisonooo authored
* docs(serving.md): typo * docs(README): quantization
-
q.yao authored
* update contrib * update links
-
- 05 Jul, 2023 1 commit
-
-
lvhan028 authored
* add performance * use png * update * update * update * update * update
-