benchmark_grouped_gemm_cutlass.py 12.5 KB