1. 04 Jul, 2023 1 commit
    • tpoisonooo's avatar
      docs(project): add quantization test results (#46) · 197b3ee1
      tpoisonooo authored
      * docs(README): update description
      
      * docs(project): add quantization test results
      
      * docs(README): reorder
      
      * docs(quantization): add more description
      
      * docs(README): remove openmmlab badge
      
      * docs(README): scale up image
      
      * docs(dir): add zh_cn subdir
      197b3ee1
  2. 03 Jul, 2023 1 commit
  3. 01 Jul, 2023 2 commits
  4. 30 Jun, 2023 2 commits
  5. 29 Jun, 2023 1 commit
    • AllentDan's avatar
      Add webui (#27) · 0cc48011
      AllentDan authored
      * add webui
      
      * update readme
      
      * resolve comments
      
      * readme
      0cc48011
  6. 28 Jun, 2023 1 commit
    • tpoisonooo's avatar
      feat(src): add kv cache int8 quantization (#22) · cc93136e
      tpoisonooo authored
      * feat(src): add int8 and compile passed
      
      * feat(kernels): fix
      
      * feat(llama): update kernel
      
      * feat(src): add debug
      
      * fix(kernel): k_cache use int8_t pointer
      
      * style(llama): clean code
      
      * feat(deploy.py): revert to enable fmha
      
      * style(LlamaV2): clean code
      
      * feat(deploy.py): add default quant policy
      cc93136e
  7. 20 Jun, 2023 2 commits
  8. 18 Jun, 2023 1 commit