Merge branch 'v0.11.0-dev-moe_tune' into 'v0.11.0-dev'
Add new benchmark configurations for gfx936_80cu with E=512,N=64 and E=512,N=128 Qwen3-Next-80B-A3B-Instruct nn tp4 tp8 moe json See merge request dcutoolkit/deeplearing/vllm!283
Showing
Please register or sign in to comment