feat: add dp attention support for Qwen 2/3 MoE models, fixes #6088 (#6121)
Co-authored-by:King.Zevin <zevin@mail.ustc.edu.cn> Co-authored-by:
Yi Zhang <1109276519@qq.com>
Showing
Please register or sign in to comment
Co-authored-by:King.Zevin <zevin@mail.ustc.edu.cn> Co-authored-by:
Yi Zhang <1109276519@qq.com>