[Kernel] Fixup for CUTLASS kernels in CUDA graphs (#4954)
Pass the CUDA stream into the CUTLASS GEMMs, to avoid future issues with CUDA graphs
Showing
Please register or sign in to comment
Pass the CUDA stream into the CUTLASS GEMMs, to avoid future issues with CUDA graphs