[NVIDIA] Support SiluMul + NVFP4 quant fusion (#23671)
Signed-off-by:jindih <jindih@nvidia.com> Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
jindih <jindih@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedic <lgovedic@redhat.com>
Showing
Please register or sign in to comment