- 15 Apr, 2026 2 commits
-
-
Yufeng He authored
Signed-off-by:
Yufeng He <40085740+he-yufeng@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Vibhav Agarwal authored
Signed-off-by:
vibhavagarwal5 <vibhavagarwal5@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 14 Apr, 2026 14 commits
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Jackmin801 authored
Signed-off-by:
Robert Shaw <robertgshaw2@gmail.com> Signed-off-by:
Jackmin801 <ongjackm@gmail.com> Co-authored-by:
Robert Shaw <robertgshaw2@gmail.com>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
danielafrimi authored
Signed-off-by:
root <root@lyris0017.lyris.clusters.nvidia.com> Signed-off-by:
Daniel Afrimi <dafrimi@nvidia.com> Co-authored-by:
root <root@lyris0017.lyris.clusters.nvidia.com>
-
Albert Cheng authored
Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Albert Cheng (Engrg-Hardware 1) <albecheng@login-lyris02.lyris.clusters.nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Hexiang Wang authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
bhargav-patel-29 authored
[Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707) Signed-off-by:
bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Thomas authored
Signed-off-by:
thomasmaindron <thomasmaindron@users.noreply.github.com> Co-authored-by:
thomasmaindron <thomasmaindron@users.noreply.github.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
fxmarty-amd authored
[fix][MOE] Fix MOE experts `intermediate_size` dimension not being narrowed before weight loading (#39688) Signed-off-by:Felix Marty <Felix.Marty@amd.com>
-
Shanshan Shen authored
Signed-off-by:
shen-shanshan <467638484@qq.com> Signed-off-by:
Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
lalit10 authored
Signed-off-by:
Lalit Laxminarayan Bangad <lalitbangad@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 13 Apr, 2026 11 commits
-
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Monishver authored
Signed-off-by:
Monishver Chandrasekaran <monishverchandrasekaran@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Pedram Razavi authored
Signed-off-by:Pedram Razavi <pedram.razavi@gmail.com>
-
Tyler Michael Smith authored
[Bugfix] Reject non-nvfp4 dtypes when using the flashinfer_nvlink_one_sided all2all backend (#39717) Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
JartX authored
Signed-off-by:JartX <sagformas@epdcenter.es>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Santino Ramos authored
Signed-off-by:Santino Ramos <santinor@inferact.ai>
-
Yi Liu authored
Signed-off-by:yiliu30 <yi4.liu@intel.com>
-
Tihomir Elek authored
Signed-off-by:Tihomir Elek <tiho.elek@gmail.com>
-
zofia authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
Jesus Federico authored
Signed-off-by:
Jesus Federico <jefp@amazon.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 12 Apr, 2026 3 commits
-
-
Le Yang authored
-
r266-tech authored
Signed-off-by:
r266-tech <r266.tech@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
- 11 Apr, 2026 4 commits
-
-
EdalatiAli authored
Signed-off-by:EdalatiAli <aliedalati@cohere.com>
-
ShubyM authored
Signed-off-by:ShubyM <shubymishra20@gmail.com>
-
Vibhav Agarwal authored
Signed-off-by:
Vibhav Agarwal <vibhavagarwal5@gmail.com> Co-authored-by:
vibhav-agarwal <vibhav.agarwal@glance.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Lee Yongjun authored
Signed-off-by:leeyongjun <jqueen.astro@gmail.com>
-
- 10 Apr, 2026 6 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Fynn Schmitt-Ulms authored
Signed-off-by:
Rahul-Tuli <rtuli@redhat.com> Signed-off-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Co-authored-by:
Rahul-Tuli <rtuli@redhat.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Manu authored
Signed-off-by:manu <fortin.emmanuel@gmail.com>
-
Jesus Federico authored
Signed-off-by:
Jesus Federico <jefp@amazon.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-