"...composable_kernel.git" did not exist on "fb7b460960a1f692ac3ef20febc34ee54b8ae52b"
Do not hardcode the function parameter, use template instead. (#72)
* Do not hardcode the function parameter, use template instead. * [What] Remove AThreadTransferSrcResetCoordinateAfterRun and BThreadTransferSrcResetCoordinateAfterRun in host API [Why] "C_Shuffle" version is supposed to be similar to the vanilla one * Fix typo Let DeviceGemmXdl_C_Shuffle use kernel_gemm_xdlops_v3r1
Showing
Please register or sign in to comment