Merge branch 'v0.8.5.post1-dev-wm' into 'v0.8.5.post1-dev'
[feat]量化模型添加支持 moe_fused_gate kernel,并使用VLLM_ENABLE_MOE_FUSED_GATE环境变量控制开关,默认打开 See merge request dcutoolkit/deeplearing/vllm!127
Showing
Please register or sign in to comment
[feat]量化模型添加支持 moe_fused_gate kernel,并使用VLLM_ENABLE_MOE_FUSED_GATE环境变量控制开关,默认打开 See merge request dcutoolkit/deeplearing/vllm!127