1. 01 Dec, 2023 2 commits
  2. 29 Nov, 2023 1 commit
  3. 27 Nov, 2023 1 commit
  4. 11 Sep, 2023 1 commit
    • Lyu Han's avatar
      Support codellama (#359) · 65c662f9
      Lyu Han authored
      * tmp
      
      * add demo for codellama inference
      
      * update
      
      * update
      
      * update
      
      * update codellama.md
      
      * export rope_theta
      
      * update
      
      * update doc
      
      * fix client.py
      
      * define SamplingParam
      
      * rollback 'end'
      
      * rotary_emb_base to rotary_embedding_base
      
      * change to baichuan2-7b
      65c662f9
  5. 24 Aug, 2023 1 commit
  6. 22 Aug, 2023 1 commit
    • AllentDan's avatar
      Add Restful API (#223) · d5c10e7a
      AllentDan authored
      * add restful api
      
      * refine
      
      * add simple doc
      
      * lint
      
      * add uvicorn requirement
      
      * more args
      
      * add llama2
      
      * docstring
      
      * update doc
      
      * save
      
      * refine
      
      * lint
      
      * better decode
      
      * add v1/embedding
      
      * add GenerateRequest
      
      * add llama2 chat template
      
      * correct profiling
      
      * update documents
      
      * add length judge
      
      * add faq
      
      * update doc and rename req_que to req_queue
      
      * fix md link, use get_logger, fix sequence_end bug
      
      * use another doc link for go to avoid lint error
      
      * add api_client.py
      
      * update doc
      
      * update doc
      
      * update function interface
      
      * update FAQ
      
      * resolve comments
      d5c10e7a
  7. 18 Aug, 2023 2 commits
  8. 20 Jul, 2023 1 commit
  9. 18 Jul, 2023 1 commit
  10. 06 Jul, 2023 2 commits
  11. 05 Jul, 2023 1 commit
  12. 01 Jul, 2023 1 commit
    • AllentDan's avatar
      Add lint action (#32) · fe46dac2
      AllentDan authored
      * temp
      
      * fix lint
      
      * csrc->src
      
      * remove clang-format
      
      * skip .rst
      
      * skip doc
      
      * clang-format
      
      version
      
      version
      
      * mat_B
      fe46dac2
  13. 25 Jun, 2023 1 commit
  14. 18 Jun, 2023 1 commit