1. 25 Oct, 2023 1 commit
    • RunningLeon's avatar
      Add more user-friendly CLI (#541) · 169d5169
      RunningLeon authored
      * add
      
      * import fire in main
      
      * wrap to speed up fire cli
      
      * update
      
      * update docs
      
      * update docs
      
      * fix
      
      * resolve commennts
      
      * resolve confict and add test for cli
      169d5169
  2. 11 Oct, 2023 1 commit
  3. 01 Sep, 2023 1 commit
    • AllentDan's avatar
      Decode generated token_ids incrementally (#309) · 9bfe03c6
      AllentDan authored
      * add incremental decoding for turbomind
      
      * update TIS
      
      * fix triton post processing
      
      * update doc
      
      * fix typo
      
      * SentencePieceTokenizer incremental decode, add qwen message prompt
      
      * docstring
      
      * update bot
      9bfe03c6
  4. 30 Aug, 2023 1 commit
  5. 24 Aug, 2023 1 commit
  6. 22 Aug, 2023 1 commit
    • AllentDan's avatar
      Add Restful API (#223) · d5c10e7a
      AllentDan authored
      * add restful api
      
      * refine
      
      * add simple doc
      
      * lint
      
      * add uvicorn requirement
      
      * more args
      
      * add llama2
      
      * docstring
      
      * update doc
      
      * save
      
      * refine
      
      * lint
      
      * better decode
      
      * add v1/embedding
      
      * add GenerateRequest
      
      * add llama2 chat template
      
      * correct profiling
      
      * update documents
      
      * add length judge
      
      * add faq
      
      * update doc and rename req_que to req_queue
      
      * fix md link, use get_logger, fix sequence_end bug
      
      * use another doc link for go to avoid lint error
      
      * add api_client.py
      
      * update doc
      
      * update doc
      
      * update function interface
      
      * update FAQ
      
      * resolve comments
      d5c10e7a