"app/vscode:/vscode.git/clone" did not exist on "5208cf09b13a5630ce967e0bccd1821ac8d5309a"
- 11 Sep, 2023 2 commits
- 08 Sep, 2023 1 commit
-
-
WRH authored
* support baichuan2-chat * update args from generation config * update deploy.py * update readme * tested with tp * step-1 when last id is eos * add news --------- Co-authored-by:chenxin <chenxin@pjlab.org.cn>
-
- 01 Sep, 2023 1 commit
-
-
AllentDan authored
* add incremental decoding for turbomind * update TIS * fix triton post processing * update doc * fix typo * SentencePieceTokenizer incremental decode, add qwen message prompt * docstring * update bot
-
- 22 Aug, 2023 1 commit
-
-
AllentDan authored
* add restful api * refine * add simple doc * lint * add uvicorn requirement * more args * add llama2 * docstring * update doc * save * refine * lint * better decode * add v1/embedding * add GenerateRequest * add llama2 chat template * correct profiling * update documents * add length judge * add faq * update doc and rename req_que to req_queue * fix md link, use get_logger, fix sequence_end bug * use another doc link for go to avoid lint error * add api_client.py * update doc * update doc * update function interface * update FAQ * resolve comments
-
- 21 Aug, 2023 1 commit
-
-
AllentDan authored
* pass args like meta_prompt to model * update chatbot * update * rollback * update llama2 and qwen * refine
-
- 18 Aug, 2023 1 commit
-
-
Li Zhang authored
* qwen support * dynamic ntk & logn attn * fix ntk & add chat template * fix ntk scaling & stop words * fix lint * add tiktoken to requirements.txt * fix tokenizer, set model format automatically * update model.py * update readme * fix lint
-
- 16 Aug, 2023 1 commit
-
-
Lyu Han authored
-
- 23 Jul, 2023 1 commit
-
-
lvhan028 authored
* refactor model.py and support baichuan-7b * remove model_name * remove hard session_len * export tokenizer.py to target dir * remove model_name from client * remove model_name * update * correct throughput equation * fix session.response * update serving.md * update readme * update according to review comments * update * update * update * update
-
- 20 Jul, 2023 1 commit
-
-
q.yao authored
* add llama2 template * update readme and fix lint * update readme * add bos * add bos * remove bos * Update model.py --------- Co-authored-by:grimoire <yaoqian@pjlab.org.cn>
-
- 19 Jul, 2023 2 commits
- 14 Jul, 2023 2 commits
- 12 Jul, 2023 1 commit
-
-
lvhan028 authored
* add docstring * update * update * fix according to review results
-
- 05 Jul, 2023 1 commit
-
-
lvhan028 authored
* update internlm model * update * update * update * update * update temperature, topk and top_p * update * update * loosen log level
-
- 30 Jun, 2023 1 commit
-
-
lvhan028 authored
* change llmdeploy to lmdeploy * update logo * update readme
-
- 25 Jun, 2023 1 commit
-
-
lvhan028 authored
* remove constraints on model name * remove duplicate model converter * add profile * get eos and bos from server * update stop_words * update sequence_length when the last generated token is eos_id * fix * fix * check-in models * valicate model_name * make stop_words as property * debug profiling * better stats * fix assistant reponse * update profile serving * update * update
-