"src/include/threadwise_2d_tensor_op.hip.hpp" did not exist on "84d9802d30de16795e63a8625098634527c80ae4"
DL GEMM fp32/fp16/int8 (#41)
* add threadwise copy the copy a tensor in one copy, added kpack to DL GEMM * add kpack into fwd v4r5 nchw fp32
Showing
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment