[Kernel] Optimize SM120 CUTLASS blockwise FP8 GEMM (#37970)
Signed-off-by:Necofish <liuxiangyang@mail.ustc.edu.cn> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:Necofish <liuxiangyang@mail.ustc.edu.cn> Co-authored-by:
Michael Goin <mgoin64@gmail.com>