"...composable_kernel-1.git" did not exist on "8a4b59785b4f5ba48468d53618ca270c5da599a7"
Hybrid direct + implicit GEMM forward convolution NCHWc v5r1 (#25)
* Hybrid direct + implicit GEMM forward convolution NCHWc v5r1. Input tensor bypass LDS. Support fp32/fp16/int8
Showing
Please register or sign in to comment