Ck tile gemm cshuffle & CK Tile GEMM restructure (#1535)
* ake the cshuffle compilable
* modify Mhe reference on gpu and cpu. Correaccess of cshuffle
* fix the cpu reference code
* Complete the in tile shuffle logic
* restructure the kernel template input
* change the naming pattern of ck_tile gemm pipeline
* Re-format files using remod.py
* Solve the fmha conflict with gemm
* Comment Addressed from Carlus
---------
Co-authored-by:
Po Yen, Chen <PoYen.Chen@amd.com>
Showing
Please register or sign in to comment