[V1] - Split Prefill and Decode for Mamba1 models (#22653)
Signed-off-by:amirk <amirk@ai21.com> Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com> Co-authored-by:
Asaf Joseph Gardin <39553475+Josephasafg@users.noreply.github.com>
Showing
Please register or sign in to comment