[Model][Spec Decode] Nemotron-H MTP and Mamba Speculative Decoding Support (#33726)
Signed-off-by:Shahar Mor <smor@nvidia.com> Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Shahar Mor <smor@nvidia.com> Co-authored-by:
Roi Koren <roik@nvidia.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com>
Showing
Please register or sign in to comment