feat: SLA Planner configuration support for DEP and TEP VLLM (#4783)
Extends the MOE planner profiler to support TEP (tensor expert parallel) and DEP (data expert parallel) configs with the vllm backend
Showing
Please register or sign in to comment