add tensor slicing API (#7)
* add tensor slicing API
* remove redundant ck namespace
* better gemm_gemm interface
* modify gemm_gemm
* add slice_tile api
* fix merge bug
* update to 3d padding, since we no longer need that much LDS size
* clean
* cleang
* clean
* clean
* clean
* clean
* clean
* clean
* clean
* clean
* clean
* clean
* clean
* clean
---------
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing
Please register or sign in to comment