1. 16 Aug, 2023 3 commits
  2. 15 Aug, 2023 2 commits
  3. 14 Aug, 2023 7 commits
  4. 11 Aug, 2023 1 commit
    • pppppM's avatar
      [Feature] Support AWQ (#108) · d3dbe179
      pppppM authored
      * support kv cache offload
      
      * add dataloader docstring
      
      * complete gitignore
      
      * refactor collect mod fn
      
      * add calibration
      
      * fix lint
      
      * add observers and quantizers
      
      * fix lints
      
      * add global available mixin
      
      * fix lints
      
      * split batch inference
      
      * support smoothquant and awq
      
      * update export kv scales
      
      * fix lints
      
      * fix some bugs
      
      * update weight only usage
      
      * update usage
      
      * auto mapping and support smooth internlm
      
      * trust remote code
      
      * fix num head key error
      
      * fix bias error
      
      * align shape and pack order with llm-awq
      
      * modified according to LZHgrla's comments.
      
      * update gitignore
      
      * fix kv qparams export error
      
      * update usage
      
      * decouple calibrate and awq
      
      * update docstrings
      
      * update api name
      
      * update readme
      
      * update readme
      
      * update readme
      
      * update readme
      
      * update kv_qparams and readme
      
      * fix typos
      d3dbe179
  5. 10 Aug, 2023 1 commit
  6. 07 Aug, 2023 5 commits
  7. 04 Aug, 2023 1 commit
  8. 03 Aug, 2023 3 commits
  9. 01 Aug, 2023 1 commit
  10. 31 Jul, 2023 4 commits
  11. 28 Jul, 2023 1 commit
  12. 27 Jul, 2023 4 commits
  13. 26 Jul, 2023 3 commits
  14. 25 Jul, 2023 2 commits
  15. 24 Jul, 2023 2 commits