"vscode:/vscode.git/clone" did not exist on "b6c14ec0b4f3d7f744c734a3835298b3242a2b90"
feat: add dp attention support for Qwen 2/3 MoE models, fixes #6088 (#6121)
Co-authored-by:King.Zevin <zevin@mail.ustc.edu.cn> Co-authored-by:
Yi Zhang <1109276519@qq.com>
Showing
Please register or sign in to comment