- 22 Feb, 2024 1 commit
-
-
RunningLeon authored
* add lmdeploy pytorch model * fix * speed up encoding and decoding * fix * change tokenizer
-
- 01 Feb, 2024 1 commit
-
-
RunningLeon authored
* fix * update * fix internlm1 * fix docs * remove sys
-
- 18 Jan, 2024 1 commit
-
-
RunningLeon authored
* update * update docs * add engine_config and gen_config in eval_config * update * fix * fix * fix * fix docstr * fix url
-
- 17 Jan, 2024 1 commit
-
-
RunningLeon authored
* update * fix * fix * fix
-
- 21 Dec, 2023 1 commit
-
-
RunningLeon authored
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update
-
- 21 Nov, 2023 1 commit
-
-
Lyu Han authored
* integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by:Songyang Zhang <tonysy@users.noreply.github.com>
-