[Model] Reduce redundant computations in mamba2 blocks for Bamba-9B (#15423)
Signed-off-by:Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
Yu Chin Fabian Lim <flim@sg.ibm.com>
Showing
Please register or sign in to comment