Merge branch 'v0.9.2-dev_fix' into 'v0.9.2-dev'
feat: Support enable rms quant and shared expert overlap at same time. See merge request dcutoolkit/deeplearing/vllm!352
Showing
Please register or sign in to comment
feat: Support enable rms quant and shared expert overlap at same time. See merge request dcutoolkit/deeplearing/vllm!352