[Kernel] Added flashinfer fp8 per-tensor gemms (#22895)
Signed-off-by:Julien Lin <jullin@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:Julien Lin <jullin@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>