Unverified Commit fc56f4a0 authored by Fadi Arafeh's avatar Fadi Arafeh Committed by GitHub
Browse files

[BugFix] Fix invalid flashinfer_fused_moe_blockscale_fp8 op registration (#32855)


Signed-off-by: default avatarFadi Arafeh <fadi.arafeh@arm.com>
parent d08b356e
...@@ -131,7 +131,7 @@ def flashinfer_fused_moe_blockscale_fp8( ...@@ -131,7 +131,7 @@ def flashinfer_fused_moe_blockscale_fp8(
expert_offset: int, expert_offset: int,
local_num_experts: int, local_num_experts: int,
block_shape: list[int], block_shape: list[int],
routing_method_type: int = RoutingMethodType.DeepSeekV3, routing_method_type: int = int(RoutingMethodType.DeepSeekV3),
routed_scaling: float | None = 1.0, routed_scaling: float | None = 1.0,
) -> torch.Tensor: ) -> torch.Tensor:
from vllm.utils.flashinfer import flashinfer_trtllm_fp8_block_scale_moe from vllm.utils.flashinfer import flashinfer_trtllm_fp8_block_scale_moe
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment