[Bugfix][MoE] Only unpad routed output before shared expert add or routed output transform (#40865)
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
Showing
Please register or sign in to comment