[Model] Mamba2 preallocate SSM output tensor to avoid d2d copy overhead (#21075)
Signed-off-by:Chih-Chieh Yang <7364402+cyang49@users.noreply.github.com> Signed-off-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com>
Showing
Please register or sign in to comment