Merge branch 'gptq_fix' into 'v0.5.0-dtk24.04.1'
fix gptq performance degradation when batch size>4 issue See merge request dcutoolkit/deeplearing/vllm!5
Showing
Please register or sign in to comment
fix gptq performance degradation when batch size>4 issue See merge request dcutoolkit/deeplearing/vllm!5