No raw index calculation (#31)
* Replace most raw index calculation to coordinate transformation
* Overhaul blockwise and threadwise GEMM
* Overhaul driver for gridwies GEMM kernel
Co-authored-by:
Jing Zhang <jizhan@amd.com>
Showing
script/docker-rocm3.7.sh
0 → 100644
Please register or sign in to comment