[MoE] Fix output_shape calculation in Attention layer to handle 3D query inputs (#31596)
Signed-off-by:
Andreas Karatzas <akaratza@amd.com>
Showing
Please register or sign in to comment
Signed-off-by:
Andreas Karatzas <akaratza@amd.com>