This change uses the updated Marlin MoE kernel from vLLM to support MoE with activation sorting and groups.
Attach a file by drag & drop or click to upload