Unverified Commit 8f36850f authored by Wentao Ye's avatar Wentao Ye Committed by GitHub
Browse files

[Bug] Fix Shape Validation for Fallback while Enabling E8M0 for DeepGEMM (#26322)


Signed-off-by: default avataryewentao256 <zhyanwentao@126.com>
parent 29fd2662
...@@ -143,7 +143,7 @@ class TritonOrDeepGemmExperts(mk.FusedMoEPermuteExpertsUnpermute): ...@@ -143,7 +143,7 @@ class TritonOrDeepGemmExperts(mk.FusedMoEPermuteExpertsUnpermute):
apply_router_weight_on_input: bool, apply_router_weight_on_input: bool,
): ):
use_deep_gemm = self.allow_deep_gemm and ( use_deep_gemm = self.allow_deep_gemm and (
_valid_deep_gemm(hidden_states, w1, w2) or is_deep_gemm_e8m0_used() is_deep_gemm_e8m0_used() or _valid_deep_gemm(hidden_states, w1, w2)
) )
experts = self.deep_gemm_expert if use_deep_gemm else self.triton_expert experts = self.deep_gemm_expert if use_deep_gemm else self.triton_expert
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment