Merge branch 'v0.15.1-dev_GLM4.7_moe_call_RQ' into 'v0.15.1-dev'
perf: GLM4.7增加MOE调用rmsQuant, fix: 修掉fused_moe向后传递None导致的报错 See merge request dcutoolkit/deeplearing/vllm!505
Showing
Please register or sign in to comment
perf: GLM4.7增加MOE调用rmsQuant, fix: 修掉fused_moe向后传递None导致的报错 See merge request dcutoolkit/deeplearing/vllm!505