Unverified Commit fc56f4a0 authored by Fadi Arafeh's avatar Fadi Arafeh Committed by GitHub
Browse files

[BugFix] Fix invalid flashinfer_fused_moe_blockscale_fp8 op registration (#32855)


Signed-off-by: default avatarFadi Arafeh <fadi.arafeh@arm.com>
parent d08b356e
......@@ -131,7 +131,7 @@ def flashinfer_fused_moe_blockscale_fp8(
expert_offset: int,
local_num_experts: int,
block_shape: list[int],
routing_method_type: int = RoutingMethodType.DeepSeekV3,
routing_method_type: int = int(RoutingMethodType.DeepSeekV3),
routed_scaling: float | None = 1.0,
) -> torch.Tensor:
from vllm.utils.flashinfer import flashinfer_trtllm_fp8_block_scale_moe
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment