Fix cublasLt context create/destroy overhead in MLP extension (#1083)
* don't create cublasLt handle, fix zero block size case * cleanup
Showing
Please register or sign in to comment
* don't create cublasLt handle, fix zero block size case * cleanup