[Bugfix] Fix benchmark_moe.py for blockwise fp8. (#23823)

Signed-off-by: crischeng <420985011@qq.com> Co-authored-by: cris <grace@guisenbindeMacBook-Pro.local>

[Bugfix] Fix benchmark_moe.py for blockwise fp8. (#23823)
Signed-off-by: crischeng <420985011@qq.com> Co-authored-by: cris <grace@guisenbindeMacBook-Pro.local>
66548f66 · YUQI.CHENG · GitHub · d3da2eea · 66548f66
Unverified Commit 66548f66 authored Aug 28, 2025 by YUQI.CHENG Committed by GitHub Aug 28, 2025
Show whitespace changes
Inline Side-by-side

Showing with 4 additions and 1 deletion

benchmarks/kernels/benchmark_moe.py benchmarks/kernels/benchmark_moe.py +4 -1

No files found.
--- a/benchmarks/kernels/benchmark_moe.py
+++ b/benchmarks/kernels/benchmark_moe.py
@@ -419,8 +419,10 @@ class BenchmarkWorker:
        )
        # NOTE(woosuk): The current naming convention uses w2.shape[2], which
        # is the intermediate size after silu_and_mul.
+        block_n = block_quant_shape[0] if block_quant_shape else None
+        block_k = block_quant_shape[1] if block_quant_shape else None
        op_config = get_moe_configs(
-            num_experts, shard_intermediate_size // 2, dtype_str
+            num_experts, shard_intermediate_size // 2, dtype_str, block_n, block_k
        )
        if op_config is None:
            config = get_default_config(
@@ -430,6 +432,7 @@ class BenchmarkWorker:
                hidden_size,
                topk,
                dtype_str,
+                block_quant_shape,
            )
        else:
            config = op_config[min(op_config.keys(), key=lambda x: abs(x - num_tokens))]