Create col-major and tma-aligned x_scale for deep_gemm.gemm_fp8_fp8_bf16_nt (#4515)
Co-authored-by:
Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
Showing
Please register or sign in to comment
Co-authored-by:
Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>