Modify the mfma_16x16x16->mfma_32x32x8 for int8_dequant to match the kernal of...
Modify the mfma_16x16x16->mfma_32x32x8 for int8_dequant to match the kernal of gemm_xdl_fp16_pk_i4_v3
Showing
Please register or sign in to comment
Modify the mfma_16x16x16->mfma_32x32x8 for int8_dequant to match the kernal of gemm_xdl_fp16_pk_i4_v3