"...composable_kernel_rocm.git" did not exist on "3dc5db7270f3e2129d05fac756a45add2b8165f9"
Improve kernel code generation (#1285)
* Only run __syncthreads when there is data to preload * Improve loops * Add const attribute to improve optimizations
Showing
Please register or sign in to comment