Merge branch 'v0.11.0-dev-moe_tune' into 'v0.11.0-dev'
feat: add Marlin W16A16 fused MoE behind VLLM_USE_MARLIN_W16A16_MOE See merge request dcutoolkit/deeplearing/vllm!300
Showing
Please register or sign in to comment
feat: add Marlin W16A16 fused MoE behind VLLM_USE_MARLIN_W16A16_MOE See merge request dcutoolkit/deeplearing/vllm!300