"vscode:/vscode.git/clone" did not exist on "deeb9cb822f50b6dc7921fbf47bfb8688fd10657"
Unverified Commit 92540529 authored by TJian's avatar TJian Committed by GitHub
Browse files

[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in...


[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in unquantizedMethod to reenable LLama4 BF16 (#18205)
Signed-off-by: default avatartjtanaa <tunjian.tan@embeddedllm.com>
parent fadb8d5c
......@@ -503,7 +503,6 @@ class UnquantizedFusedMoEMethod(FusedMoEMethodBase, CustomOp):
indices_type=torch.uint32 if self.moe.use_pplx_kernels else None)
if self.rocm_aiter_moe_enabled:
assert not apply_router_weight_on_input
assert expert_map is None
return self.rocm_aiter_fused_experts(
hidden_states=x,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment