1. 08 May, 2025 1 commit
  2. 16 Apr, 2025 1 commit
  3. 29 Aug, 2024 2 commits
  4. 21 Aug, 2024 1 commit
  5. 20 Aug, 2024 2 commits
  6. 13 Jul, 2024 1 commit
  7. 23 Feb, 2024 3 commits
  8. 31 Jan, 2024 4 commits
  9. 24 Jan, 2024 1 commit
  10. 11 Jan, 2024 1 commit
  11. 22 Dec, 2023 2 commits
  12. 30 Nov, 2023 2 commits
  13. 23 Nov, 2023 2 commits
  14. 22 Nov, 2023 5 commits
  15. 16 Nov, 2023 1 commit
  16. 15 Nov, 2023 1 commit
  17. 25 Oct, 2023 1 commit
    • RunningLeon's avatar
      Add more user-friendly CLI (#541) · 169d5169
      RunningLeon authored
      * add
      
      * import fire in main
      
      * wrap to speed up fire cli
      
      * update
      
      * update docs
      
      * update docs
      
      * fix
      
      * resolve commennts
      
      * resolve confict and add test for cli
      169d5169
  18. 19 Oct, 2023 1 commit
  19. 12 Oct, 2023 2 commits
  20. 25 Sep, 2023 1 commit
  21. 20 Sep, 2023 1 commit
    • Lyu Han's avatar
      Support InternLM 20B (#440) · df7955de
      Lyu Han authored
      
      
      * better profiler
      
      * wait for releasing mem
      
      * remove fire
      
      * remove support for multiple model benchmark
      
      * comments
      
      * support actual seqlen
      
      * change chat template
      
      * update
      
      * fix ut
      
      * int->size_t
      
      * output more details
      
      * correct tp
      
      * rollback
      
      * update
      
      * update readme
      
      * add 'internlm-chat' as the default tag for internlm chat models
      
      * rollback tokenizer
      
      ---------
      Co-authored-by: default avatarAllentDan <AllentDan@yeah.net>
      Co-authored-by: default avatargrimoire <yaoqian@pjlab.org.cn>
      df7955de
  22. 11 Sep, 2023 1 commit
    • Lyu Han's avatar
      Support codellama (#359) · 65c662f9
      Lyu Han authored
      * tmp
      
      * add demo for codellama inference
      
      * update
      
      * update
      
      * update
      
      * update codellama.md
      
      * export rope_theta
      
      * update
      
      * update doc
      
      * fix client.py
      
      * define SamplingParam
      
      * rollback 'end'
      
      * rotary_emb_base to rotary_embedding_base
      
      * change to baichuan2-7b
      65c662f9
  23. 08 Sep, 2023 1 commit
  24. 06 Sep, 2023 1 commit
  25. 05 Sep, 2023 1 commit