Reduce CUDA driver calls when choosing transpose kernels (#1002)
Reduce CUDA driver API calls when choosing transpose kernels
Signed-off-by:
Tim Moon <tmoon@nvidia.com>
Showing
Please register or sign in to comment
Reduce CUDA driver API calls when choosing transpose kernels
Signed-off-by:
Tim Moon <tmoon@nvidia.com>