- 07 Jun, 2025 13 commits
-
-
JieXin Liang authored
-
Xiaoyu Zhang authored
-
Xiaoyu Zhang authored
Add triton version as a fused_moe_triton config search key to avoid performace decrease in different Triton version (#5955)
-
fzyzcjy authored
-
Swipe4057 authored
-
fzyzcjy authored
-
JieXin Liang authored
-
Baizhou Zhang authored
-
miter authored
Signed-off-by:miter <miterv@outlook.com>
-
Xinyuan Tong authored
Signed-off-by:Xinyuan Tong <justinning0323@outlook.com>
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
Lianmin Zheng authored
-
fzyzcjy authored
-
- 06 Jun, 2025 3 commits
-
-
Lianmin Zheng authored
-
Jianan Ji authored
-
HAI authored
Co-authored-by:
wunhuang <wunhuang@amd.com> Co-authored-by:
Hubert Lu <Hubert.Lu@amd.com>
-
- 05 Jun, 2025 13 commits
-
-
Zaili Wang authored
Co-authored-by:
diwei sun <diwei.sun@intel.com> Co-authored-by:
Yineng Zhang <me@zhyncs.com>
-
Chang Su authored
-
Pavani Majety authored
[CUTLASS-FP4-MOE] Introduce CutlassMoEParams class for easy initialization of Cutlass Grouped Gems Metadata (#6887) Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
fzyzcjy authored
-
shangmingc authored
Signed-off-by:Shangming Cai <caishangming@linux.alibaba.com>
-
Ravi Theja authored
Co-authored-by:Ravi Theja Desetty <ravitheja@Ravis-MacBook-Pro.local>
-
fzyzcjy authored
-
fzyzcjy authored
-
fzyzcjy authored
-
fzyzcjy authored
-
zyksir authored
-
Lifu Huang authored
-
Cheng Wan authored
-
- 04 Jun, 2025 7 commits
-
-
Cheng Wan authored
Set `num_fused_shared_experts` as `num_shared_experts` when shared_experts fusion is not disabled (#6736)
-
ishandhanani authored
-
Chanh Nguyen authored
Co-authored-by:Chanh Nguyen <cnguyen@linkedin.com>
-
Xinyuan Tong authored
Signed-off-by:Xinyuan Tong <justinning0323@outlook.com>
-
JieXin Liang authored
-
Marc Sun authored
-
Cheng Wan authored
-
- 03 Jun, 2025 3 commits
-
-
fzyzcjy authored
-
fzyzcjy authored
-
pansicheng authored
-
- 02 Jun, 2025 1 commit
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-