"...composable_kernel_rocm.git" did not exist on "cac014f17355d6504b618f5945c6326a285db7e9"
added lds double buffer (on C dimension) for implicit gemm v1r3, as a result,...
added lds double buffer (on C dimension) for implicit gemm v1r3, as a result, it should achieve 90% of peak for all filter sizes, on CHWN format
Showing
Please register or sign in to comment