[NVIDIA] Add support for cudnn fp4 gemm via flashinfer (#26107)
Signed-off-by:kaixih <kaixih@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:kaixih <kaixih@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>