"src/include/threadwise_2d_tensor_op.cuh" did not exist on "29496c95d3d04eafae5eb9d0de2b3e4673df3a73"
Batched Gemm with C Permute (#305)
* init commit
* add c_permute
* add mnk padding
* fixed comments
* Fixed comments
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing