Merge branch 'v0.9.2-dev-marlin' into 'v0.9.2-dev'
fix: 优化w4a8 marlin 中 weight重排耗时 See merge request dcutoolkit/deeplearing/vllm!200
Showing
Please register or sign in to comment
fix: 优化w4a8 marlin 中 weight重排耗时 See merge request dcutoolkit/deeplearing/vllm!200