- 25 Jun, 2025 2 commits
-
-
Chunyuan WU authored
[CPU] [BF16] Call fused_experts_cpu, weight_packed_linear and bmm_cpu kernel in DeepSeek model (#6641) Co-authored-by:Thien Tran <gau.nernst@yahoo.com.sg>
-
Ke Bao authored
-
- 24 Jun, 2025 1 commit
-
-
Yineng Zhang authored
-
- 23 Jun, 2025 4 commits
-
-
Zhiqiang Xie authored
-
Lianmin Zheng authored
-
kk authored
Co-authored-by:
wunhuang <wunhuang@amd.com> Co-authored-by:
Sai Enduri <saimanas.enduri@amd.com> Co-authored-by:
HAI <hixiao@gmail.com>
-
xutizhou authored
Co-authored-by:
tianqilin.99 <tianqilin.99@bytedance.com> Co-authored-by:
TianQiLin666666 <1834987979@qq.com> Co-authored-by:
Cheng Wan <54331508+ch-wan@users.noreply.github.com>
-
- 18 Jun, 2025 1 commit
-
-
linzhuo authored
-
- 17 Jun, 2025 2 commits
-
-
Yineng Zhang authored
-
AniZpZ authored
-
- 16 Jun, 2025 2 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
- 15 Jun, 2025 1 commit
-
-
Yineng Zhang authored
-
- 14 Jun, 2025 1 commit
-
-
JieXin Liang authored
-
- 13 Jun, 2025 3 commits
-
-
Yineng Zhang authored
-
fzyzcjy authored
-
fzyzcjy authored
-
- 12 Jun, 2025 3 commits
-
-
sogalin authored
Co-authored-by:
HAI <hixiao@gmail.com> Co-authored-by:
Yineng Zhang <me@zhyncs.com>
-
Yineng Zhang authored
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
- 10 Jun, 2025 3 commits
-
-
Yineng Zhang authored
-
fzyzcjy authored
-
YanbingJiang authored
Co-authored-by:mingfeima <mingfei.ma@intel.com>
-
- 09 Jun, 2025 1 commit
-
-
JieXin Liang authored
-
- 08 Jun, 2025 1 commit
-
-
Yineng Zhang authored
-
- 07 Jun, 2025 5 commits
-
-
Yineng Zhang authored
-
Elfie Guo authored
Co-authored-by:Elfie Guo <elfiegxf@gmail.com>
-
Xiaoyu Zhang authored
-
Yineng Zhang authored
-
JieXin Liang authored
-
- 05 Jun, 2025 3 commits
-
-
Pavani Majety authored
[CUTLASS-FP4-MOE] Introduce CutlassMoEParams class for easy initialization of Cutlass Grouped Gems Metadata (#6887) Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
zyksir authored
-
- 04 Jun, 2025 3 commits
-
-
Cheng Wan authored
Set `num_fused_shared_experts` as `num_shared_experts` when shared_experts fusion is not disabled (#6736)
-
Xiaoyu Zhang authored
Co-authored-by:JieXin Liang <Alcanderian@users.noreply.github.com>
-
Cheng Wan authored
-
- 03 Jun, 2025 1 commit
-
-
jianan-gu authored
-
- 02 Jun, 2025 2 commits
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
- 01 Jun, 2025 1 commit
-
-
Wenxuan Tan authored
-