Unverified Commit 74898a70 authored by Danielle Robinson's avatar Danielle Robinson Committed by GitHub
Browse files

[BugFix][LoRA] TritonExperts is ModularMoEPath for FP8 models (#33393)


Signed-off-by: default avatarDanielle Robinson <dmmaddix@amazon.com>
Co-authored-by: default avatarDanielle Robinson <dmmaddix@amazon.com>
parent 8f5d5120
......@@ -143,9 +143,7 @@ class FusedMoEWithLoRA(BaseLayerWithLoRA):
m_fused_moe_fn.fused_experts, (MarlinExperts, UnfusedOAITritonExperts)
)
else:
assert isinstance(
m_fused_moe_fn.fused_experts, (MarlinExperts, TritonExperts)
)
assert isinstance(m_fused_moe_fn.fused_experts, TritonExperts)
def fwd_decorator(layer, func):
def wrapper(*args, **kwargs):
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment