Merge branch 'v0.11.0-dev_yql_2.11' into 'v0.11.0-dev'
修复CompressedTensors的w8a16的支持,新增awq_marlin gemm的qwen72B的支持 See merge request dcutoolkit/deeplearing/vllm!429
Showing
Please register or sign in to comment
修复CompressedTensors的w8a16的支持,新增awq_marlin gemm的qwen72B的支持 See merge request dcutoolkit/deeplearing/vllm!429