- 20 Jul, 2025 1 commit
-
-
Baizhou Zhang authored
-
- 19 Jul, 2025 1 commit
-
-
Yineng Zhang authored
-
- 18 Jul, 2025 2 commits
-
-
Peng Zhang authored
-
Qi Yuhang authored
-
- 17 Jul, 2025 1 commit
-
-
Yuan Luo authored
-
- 16 Jul, 2025 2 commits
-
-
Peng Zhang authored
-
Peng Zhang authored
-
- 15 Jul, 2025 2 commits
-
-
Yineng Zhang authored
-
Qi Yuhang authored
[feat]Support fusion kernel for constructing quant input and scale factor for fp8_blockwise_scaled_grouped_mm (#8023)
-
- 14 Jul, 2025 1 commit
-
-
ykcombat authored
-
- 12 Jul, 2025 1 commit
-
-
Yineng Zhang authored
-
- 11 Jul, 2025 1 commit
-
-
Qi Yuhang authored
-
- 10 Jul, 2025 1 commit
-
-
likesen-alibaba authored
-
- 09 Jul, 2025 2 commits
-
-
Chunyuan WU authored
-
Chunyuan WU authored
[CPU]convert topk_weights to fp32 for INT8 and FP8 paths (for llama4) and fix LmHead weight pack (#7818)
-
- 07 Jul, 2025 1 commit
-
-
Ke Bao authored
-
- 05 Jul, 2025 6 commits
-
-
Yineng Zhang authored
-
Lianmin Zheng authored
Co-authored-by:Pranjal Shankhdhar <pranjal.ssh@gmail.com>
-
Yineng Zhang authored
-
Mick authored
-
Qi Yuhang authored
-
SijiaYang authored
Signed-off-by:
yangsijia.614 <yangsijia.614@bytedance.com> Co-authored-by:
yicwang <yichen.wang@bytedance.com>
-
- 03 Jul, 2025 5 commits
-
-
Yineng Zhang authored
-
Yi Zhang authored
Co-authored-by:ispobock <ispobaoke@gmail.com>
-
ayrnb authored
Co-authored-by:
HydraQYH <QYH820@Outlook.com> Co-authored-by:
TianQiLin666666 <1834987979@qq.com>
-
Chunyuan WU authored
-
YanbingJiang authored
-
- 02 Jul, 2025 1 commit
-
-
AniZpZ authored
Co-authored-by:
晟海 <huangtingwei.htw@antgroup.com> Co-authored-by:
yych0745 <1398089567@qq.com> Co-authored-by:
HandH1998 <1335248067@qq.com> Co-authored-by:
弋云 <yiyun.wyt@antgroup.com> Co-authored-by:
walker-ai <2398833647@qq.com>
-
- 01 Jul, 2025 2 commits
-
-
Yineng Zhang authored
-
Chunyuan WU authored
-
- 30 Jun, 2025 2 commits
-
-
Baizhou Zhang authored
-
Chunyuan WU authored
-
- 29 Jun, 2025 1 commit
-
-
Ke Bao authored
-
- 26 Jun, 2025 1 commit
-
-
Ruihang Lai authored
-
- 25 Jun, 2025 2 commits
-
-
Chunyuan WU authored
[CPU] [BF16] Call fused_experts_cpu, weight_packed_linear and bmm_cpu kernel in DeepSeek model (#6641) Co-authored-by:Thien Tran <gau.nernst@yahoo.com.sg>
-
Ke Bao authored
-
- 24 Jun, 2025 1 commit
-
-
Yineng Zhang authored
-
- 23 Jun, 2025 3 commits
-
-
Zhiqiang Xie authored
-
Lianmin Zheng authored
-
kk authored
Co-authored-by:
wunhuang <wunhuang@amd.com> Co-authored-by:
Sai Enduri <saimanas.enduri@amd.com> Co-authored-by:
HAI <hixiao@gmail.com>
-