"docs/vscode:/vscode.git/clone" did not exist on "40d0e7411dbeb276befd33c4485115ac3d4d7f2a"
[Bugfix] Fix mismatch between global and local attention heads in...
[Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707) Signed-off-by:bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Showing
Please register or sign in to comment