[JAX] Async issuing D2H memcpy for grouped_gemm group_sizes array (#2213)
* Try async copy of grouped GEMM group_sizes data Signed-off-by:Hua Huang <huah@nvidia.com> --------- Signed-off-by:
Hua Huang <huah@nvidia.com> Co-authored-by:
Phuong Nguyen <phuonguyen@nvidia.com>
Showing
Please register or sign in to comment