Merge branch '1117_deepseek_reduce_fusenorm_quant_push_1' into 'v0.9.2-dev'
deepseekv2-w4a8支持custom-rms-quant融合 See merge request dcutoolkit/deeplearing/vllm!259
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment