Merge branch 'v0.9.2-dev-w8a8-marlin' into 'v0.9.2-dev'
feat: w8a8_marlin 接入,通过-q slimquant_marlin开启,优化w4a8_marlin代码 See merge request dcutoolkit/deeplearing/vllm!240
Showing
Please register or sign in to comment
feat: w8a8_marlin 接入,通过-q slimquant_marlin开启,优化w4a8_marlin代码 See merge request dcutoolkit/deeplearing/vllm!240