[Perf] SM100 - add swap AB optimization to CUTLASS FP8 GEMM (#27284)
Signed-off-by:Faqin Zhong <faqin.zhong@gmail.com> Co-authored-by:
Faqin Zhong <zhofaqin@amazon.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
Showing
Please register or sign in to comment