benchmark_grouped_gemm_cutlass.py 12.2 KB