[LoRA][Spec Decode] Support LoRA for Nemotron-H MTP models (#32265)
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>