"src/include/threadwise_direct_convolution.hpp" did not exist on "4957d5a399a1c3f6bcf812c9e2fa104ed0ea7742"
No raw index calculation (#31)
* Replace most raw index calculation to coordinate transformation
* Overhaul blockwise and threadwise GEMM
* Overhaul driver for gridwies GEMM kernel
Co-authored-by:
Jing Zhang <jizhan@amd.com>
Showing
This diff is collapsed.
script/docker-rocm3.7.sh
0 → 100644
Please register or sign in to comment