修复CompressedTensors的w8a16的支持,新增awq_marlin gemm的qwen72B的支持 See merge request dcutoolkit/deeplearing/vllm!429
Attach a file by drag & drop or click to upload