- 05 Aug, 2025 1 commit
-
-
Chunyuan WU authored
Co-authored-by:
jianan-gu <jianan.gu@intel.com> Co-authored-by:
YanbingJiang <yanbing.jiang@intel.com>
-
- 04 Aug, 2025 1 commit
-
-
Xiaoyu Zhang authored
-
- 03 Aug, 2025 1 commit
-
-
Qi Yuhang authored
-
- 02 Aug, 2025 3 commits
-
-
Liangsheng Yin authored
-
Liangsheng Yin authored
-
Trevor Morris authored
-
- 01 Aug, 2025 3 commits
-
-
YanbingJiang authored
-
Stefan He authored
-
Peter Pan authored
-
- 31 Jul, 2025 3 commits
-
-
Tao He authored
[bugifx] QWen-1M context support[2/3] using current cuda stream in the DCA's kernel for bugfix. (#8611) Signed-off-by:
Tao He <linzhu.ht@alibaba-inc.com> Co-authored-by:
sa-buc <linzhu.ht@w32d09270.cloud.sqa.na131>
-
Cheng Wan authored
Co-authored-by:
Xiaoyu Zhang <35585791+BBuf@users.noreply.github.com> Co-authored-by:
Ke Bao <ispobaoke@gmail.com>
-
Qi Yuhang authored
-
- 30 Jul, 2025 1 commit
-
-
Yuan Luo authored
Co-authored-by:
luoyuan.luo <luoyuan.luo@antgroup.com> Co-authored-by:
Xiaoyu Zhang <35585791+BBuf@users.noreply.github.com> Co-authored-by:
JieXin Liang <Alcanderian@users.noreply.github.com>
-
- 29 Jul, 2025 1 commit
-
-
Xiaoyu Zhang authored
[sgl-kernel performace] fix fp8 quant kernels dispatch __nv_fp8_e4m3 bug to improve performance 10%-20% (#8499) Co-authored-by:Ke Bao <ispobaoke@gmail.com>
-
- 28 Jul, 2025 2 commits
-
-
Xiaoyu Zhang authored
-
strgrb authored
Co-authored-by:Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
-
- 27 Jul, 2025 1 commit
-
-
Baizhou Zhang authored
-
- 25 Jul, 2025 2 commits
-
-
Hubert Lu authored
[AMD] Add silu_and_mul, gelu_and_mul, gelu_tanh_and_mul, and gelu_quick kernels for AMD GPUs (#7135) Co-authored-by:
yiakwy-xpu-ml-framework-team <961186938@qq.com> Co-authored-by:
HAI <hixiao@gmail.com>
-
li haoyang authored
Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 23 Jul, 2025 2 commits
-
-
Yuan Luo authored
Co-authored-by:luoyuan.luo <luoyuan.luo@antgroup.com>
-
Zhiqiang Xie authored
-
- 20 Jul, 2025 1 commit
-
-
Baizhou Zhang authored
-
- 18 Jul, 2025 2 commits
-
-
Peng Zhang authored
-
Qi Yuhang authored
-
- 17 Jul, 2025 1 commit
-
-
Yuan Luo authored
-
- 16 Jul, 2025 1 commit
-
-
Peng Zhang authored
-
- 14 Jul, 2025 1 commit
-
-
ykcombat authored
-
- 10 Jul, 2025 1 commit
-
-
likesen-alibaba authored
-
- 09 Jul, 2025 2 commits
-
-
Chunyuan WU authored
-
Chunyuan WU authored
[CPU]convert topk_weights to fp32 for INT8 and FP8 paths (for llama4) and fix LmHead weight pack (#7818)
-
- 07 Jul, 2025 1 commit
-
-
Ke Bao authored
-
- 05 Jul, 2025 4 commits
-
-
Lianmin Zheng authored
Co-authored-by:Pranjal Shankhdhar <pranjal.ssh@gmail.com>
-
Mick authored
-
Qi Yuhang authored
-
SijiaYang authored
Signed-off-by:
yangsijia.614 <yangsijia.614@bytedance.com> Co-authored-by:
yicwang <yichen.wang@bytedance.com>
-
- 03 Jul, 2025 4 commits
-
-
Yi Zhang authored
Co-authored-by:ispobock <ispobaoke@gmail.com>
-
ayrnb authored
Co-authored-by:
HydraQYH <QYH820@Outlook.com> Co-authored-by:
TianQiLin666666 <1834987979@qq.com>
-
Chunyuan WU authored
-
YanbingJiang authored
-
- 02 Jul, 2025 1 commit
-
-
AniZpZ authored
Co-authored-by:
晟海 <huangtingwei.htw@antgroup.com> Co-authored-by:
yych0745 <1398089567@qq.com> Co-authored-by:
HandH1998 <1335248067@qq.com> Co-authored-by:
弋云 <yiyun.wyt@antgroup.com> Co-authored-by:
walker-ai <2398833647@qq.com>
-