benchmark_grouped_gemm_cutlass.py 11.9 KB