"vscode:/vscode.git/clone" did not exist on "72c2b68dc9d4fb20eb135c22ee8c86caca48d28b"
Unverified Commit fa028207 authored by Shaun Kotek's avatar Shaun Kotek Committed by GitHub
Browse files

Fix/resupport nongated fused moe triton (#36412)


Signed-off-by: default avatarShaun Kotek - Nvidia <skotek@nvidia.com>
Signed-off-by: default avatarNatan Bagrov <nbagrov@nvidia.com>
Signed-off-by: default avatarDaniel Serebrenik <daserebrenik@nvidia.com>
Signed-off-by: default avatarzjy0516 <riverclouds.zhu@qq.com>
Signed-off-by: default avatarHarry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: default avataryewentao256 <zhyanwentao@126.com>
Signed-off-by: default avatarWentao Ye <44945378+yewentao256@users.noreply.github.com>
Signed-off-by: default avatarliweiguang <codingpunk@gmail.com>
Signed-off-by: default avatarwang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: default avatarwang.yuqi <noooop@126.com>
Signed-off-by: default avatarAlex Brooks <albrooks@redhat.com>
Signed-off-by: default avatarDarkLight1337 <tlleungac@connect.ust.hk>
Signed-off-by: default avatarcong-or <conchubhar.gannon@gmail.com>
Signed-off-by: default avatarTushar Shetty <tushar.shetty@abbyy.com>
Signed-off-by: default avatarTushar Shetty <54362365+tusharshetty61@users.noreply.github.com>
Signed-off-by: default avatarjiang1.li <jiang1.li@intel.com>
Signed-off-by: default avatarzhenwei-intel <zhenwei.liu@intel.com>
Signed-off-by: default avatarXin Yang <xyangx@amazon.com>
Signed-off-by: default avatarKevin H. Luu <khluu000@gmail.com>
Signed-off-by: default avatarIsotr0py <mozf@mail2.sysu.edu.cn>
Co-authored-by: default avatarnvnbagrov <nbagrov@nvidia.com>
Co-authored-by: default avatarSage <80211083+sagearc@users.noreply.github.com>
Co-authored-by: default avatardanisereb <daserebrenik@nvidia.com>
Co-authored-by: default avatarJiangyun Zhu <riverclouds.zhu@qq.com>
Co-authored-by: default avatarKunshang Ji <kunshang.ji@intel.com>
Co-authored-by: default avatarcopilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: default avatarHarry Mellor <19981378+hmellor@users.noreply.github.com>
Co-authored-by: default avatarWentao Ye <44945378+yewentao256@users.noreply.github.com>
Co-authored-by: default avatargemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: default avatarWeiguang Li <codingpunk@gmail.com>
Co-authored-by: default avatarClaude Opus 4.6 <noreply@anthropic.com>
Co-authored-by: default avatarLi, Jiang <jiang1.li@intel.com>
Co-authored-by: default avatarwang.yuqi <yuqi.wang@daocloud.io>
Co-authored-by: default avatarAlex Brooks <albrooks@redhat.com>
Co-authored-by: default avatarCyrus Leung <tlleungac@connect.ust.hk>
Co-authored-by: default avatarcong-or <conchubhar.gannon@gmail.com>
Co-authored-by: default avatarTushar Shetty <54362365+tusharshetty61@users.noreply.github.com>
Co-authored-by: default avatarliuzhenwei <zhenwei.liu@intel.com>
Co-authored-by: default avatarXin Yang <105740670+xyang16@users.noreply.github.com>
Co-authored-by: default avatarKevin H. Luu <khluu000@gmail.com>
Co-authored-by: default avatarIsotr0py <mozf@mail2.sysu.edu.cn>
parent d460a18f
...@@ -912,7 +912,7 @@ class BatchedTritonExperts(mk.FusedMoEExpertsModular): ...@@ -912,7 +912,7 @@ class BatchedTritonExperts(mk.FusedMoEExpertsModular):
@staticmethod @staticmethod
def _supports_no_act_and_mul() -> bool: def _supports_no_act_and_mul() -> bool:
return False return True
@staticmethod @staticmethod
def _supports_quant_scheme( def _supports_quant_scheme(
......
...@@ -1944,7 +1944,7 @@ class TritonExperts(mk.FusedMoEExpertsModular): ...@@ -1944,7 +1944,7 @@ class TritonExperts(mk.FusedMoEExpertsModular):
@staticmethod @staticmethod
def _supports_no_act_and_mul() -> bool: def _supports_no_act_and_mul() -> bool:
return False return True
@staticmethod @staticmethod
def _supports_quant_scheme( def _supports_quant_scheme(
...@@ -1983,6 +1983,9 @@ class TritonExperts(mk.FusedMoEExpertsModular): ...@@ -1983,6 +1983,9 @@ class TritonExperts(mk.FusedMoEExpertsModular):
MoEActivation.GELU, MoEActivation.GELU,
MoEActivation.SWIGLUOAI, MoEActivation.SWIGLUOAI,
MoEActivation.SWIGLUSTEP, MoEActivation.SWIGLUSTEP,
MoEActivation.SILU_NO_MUL,
MoEActivation.GELU_NO_MUL,
MoEActivation.RELU2_NO_MUL,
] ]
@staticmethod @staticmethod
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment