- 09 Jul, 2025 1 commit
-
-
Chunyuan WU authored
[CPU]convert topk_weights to fp32 for INT8 and FP8 paths (for llama4) and fix LmHead weight pack (#7818)
-
- 07 Jul, 2025 1 commit
-
-
Ke Bao authored
-
- 05 Jul, 2025 4 commits
-
-
Lianmin Zheng authored
Co-authored-by:Pranjal Shankhdhar <pranjal.ssh@gmail.com>
-
Mick authored
-
Qi Yuhang authored
-
SijiaYang authored
Signed-off-by:
yangsijia.614 <yangsijia.614@bytedance.com> Co-authored-by:
yicwang <yichen.wang@bytedance.com>
-
- 03 Jul, 2025 4 commits
-
-
Yi Zhang authored
Co-authored-by:ispobock <ispobaoke@gmail.com>
-
ayrnb authored
Co-authored-by:
HydraQYH <QYH820@Outlook.com> Co-authored-by:
TianQiLin666666 <1834987979@qq.com>
-
Chunyuan WU authored
-
YanbingJiang authored
-
- 02 Jul, 2025 1 commit
-
-
AniZpZ authored
Co-authored-by:
晟海 <huangtingwei.htw@antgroup.com> Co-authored-by:
yych0745 <1398089567@qq.com> Co-authored-by:
HandH1998 <1335248067@qq.com> Co-authored-by:
弋云 <yiyun.wyt@antgroup.com> Co-authored-by:
walker-ai <2398833647@qq.com>
-
- 01 Jul, 2025 1 commit
-
-
Chunyuan WU authored
-
- 30 Jun, 2025 2 commits
-
-
Baizhou Zhang authored
-
Chunyuan WU authored
-
- 29 Jun, 2025 1 commit
-
-
Ke Bao authored
-
- 25 Jun, 2025 2 commits
-
-
Chunyuan WU authored
[CPU] [BF16] Call fused_experts_cpu, weight_packed_linear and bmm_cpu kernel in DeepSeek model (#6641) Co-authored-by:Thien Tran <gau.nernst@yahoo.com.sg>
-
Ke Bao authored
-
- 23 Jun, 2025 1 commit
-
-
Zhiqiang Xie authored
-
- 17 Jun, 2025 1 commit
-
-
AniZpZ authored
-
- 16 Jun, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 14 Jun, 2025 1 commit
-
-
JieXin Liang authored
-
- 13 Jun, 2025 2 commits
- 12 Jun, 2025 1 commit
-
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
- 10 Jun, 2025 2 commits
-
-
fzyzcjy authored
-
YanbingJiang authored
Co-authored-by:mingfeima <mingfei.ma@intel.com>
-
- 09 Jun, 2025 1 commit
-
-
JieXin Liang authored
-
- 07 Jun, 2025 2 commits
-
-
Elfie Guo authored
Co-authored-by:Elfie Guo <elfiegxf@gmail.com>
-
Xiaoyu Zhang authored
-
- 05 Jun, 2025 2 commits
-
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
zyksir authored
-
- 04 Jun, 2025 3 commits
-
-
Cheng Wan authored
Set `num_fused_shared_experts` as `num_shared_experts` when shared_experts fusion is not disabled (#6736)
-
Xiaoyu Zhang authored
Co-authored-by:JieXin Liang <Alcanderian@users.noreply.github.com>
-
Cheng Wan authored
-
- 03 Jun, 2025 1 commit
-
-
jianan-gu authored
-
- 02 Jun, 2025 2 commits
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
- 23 May, 2025 2 commits
-
-
Chunyuan WU authored
-
blzheng authored
-
- 22 May, 2025 1 commit
-
-
HandH1998 authored
Co-authored-by:
yych0745 <1398089567@qq.com> Co-authored-by:
sleepcoo <sleepcoo@gmail.com>
-