Merge branch 'v0.9.2-dev-update' into 'v0.9.2-dev'
修复w8a8 triton config 择优位运算可能引发torch compile 编译错误,修复smquant w8a8 权重后处理位置 See merge request dcutoolkit/deeplearing/vllm!320
Showing
Please register or sign in to comment
修复w8a8 triton config 择优位运算可能引发torch compile 编译错误,修复smquant w8a8 权重后处理位置 See merge request dcutoolkit/deeplearing/vllm!320