- 01 Apr, 2026 19 commits
-
-
Elvir Crnčević authored
[Bugfix] Revert "Zero-init MLA attention output buffers to prevent NaN from CUDA graph padding" (#38359) Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
손세정 authored
Signed-off-by:
AAISSJ <maze0717@g.skku.edu> Signed-off-by: <> Co-authored-by:
세덩 <saison@sedeong-ui-MacBookAir.local>
-
yjz authored
Signed-off-by:
JianDan0212 <zhangyj0212@gmail.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Juan Pérez de Algaba authored
Signed-off-by:jperezde <jperezde@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Zhanda Zhu authored
Signed-off-by:Zhanda Zhu <zhandazhu@gmail.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Jeffrey Wang authored
Signed-off-by:Jeffrey Wang <jeffreywang@anyscale.com>
-
Augusto Yao authored
Signed-off-by:
augusto.yjh <augusto.yjh@antgroup.com> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
HarshRathva authored
Signed-off-by:
HarshRathva <harshrathvaai@gmail.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Samu Tamminen authored
Signed-off-by:
Samu Tamminen <stammine@amd.com> Co-authored-by:
Tuukka Sarvi <tuukka.sarvi@amd.com>
-
Ben Browning authored
Signed-off-by:
Ben Browning <bbrownin@redhat.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Signed-off-by:
chzhang <chaojun.zhang@intel.com> Signed-off-by:
Luka Govedic <luka.govedic@gmail.com> Co-authored-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Chaojun Zhang <chaojun.zhang@intel.com> Co-authored-by:
Luka Govedič <ProExpertProg@h100-01.nemg-001.lab.rdu2.dc.redhat.com>
-
Elvir Crnčević authored
Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@berkeley.edu> Signed-off-by:
Yifan Qiao <yifanqiao@inferact.ai>
-
- 31 Mar, 2026 21 commits
-
-
Stig-Arne Grönroos authored
Signed-off-by:
Stig-Arne Grönroos <stig-arne.gronroos@amd.com> Signed-off-by:
Stig-Arne Grönroos <sgronroo@amd.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Vedant V Jhaveri authored
Signed-off-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Chang Su authored
Signed-off-by:Chang Su <chang.s.su@oracle.com>
-
Asaf Gardin authored
Signed-off-by:Josephasafg <ajgard7@gmail.com>
-
Yanan Cao authored
Signed-off-by:
Yanan Cao <gmagogsfm@gmail.com> Co-authored-by:
Claude Sonnet 4 <noreply@anthropic.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
czhu-cohere authored
Signed-off-by:root <conway.zhu@cohere.com>
-
yzong-rh authored
Signed-off-by:
Yifan Zong <yzong@redhat.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
yzong-rh <yzong@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Olya Kozlova authored
Signed-off-by:Olya Kozlova <okozlova@nvidia.com>
-
Xu Jinyang authored
[Model] Sync upstream BT=chunk_size fix for GDN chunk_fwd_kernel_o, simplify warmup to single pass (#38343) Signed-off-by:
AuYang <459461160@qq.com> Co-authored-by:
Jiangyun Zhu <riverclouds.zhu@qq.com>
-
BadrBasowid authored
Signed-off-by:
BadrBasowid <badr.basowid@gmail.com> Co-authored-by:
vllmellm <vllm.ellm@embeddedllm.com>
-
Run Yu authored
Signed-off-by:Run Yu <yurun00@gmail.com>
-
mikaylagawarecki authored
Signed-off-by:Mikayla Gawarecki <mikaylagawarecki@gmail.com>
-
Yi Liu authored
Signed-off-by:yiliu30 <yi4.liu@intel.com>
-
SandishKumarHN authored
Signed-off-by:SandishKumarHN <sandish@fb.com>
-
zhang-prog authored
Signed-off-by:zhangyue66 <zhangyue66@baidu.com>
-
Jingu Kang authored
Signed-off-by:Jingu Kang <jg.k@navercorp.com>
-
Matthew Bonanni authored
Signed-off-by:
SandishKumarHN <sandishkumarhn@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
SandishKumarHN <sandishkumarhn@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
wliao2 authored
Signed-off-by:
Liao, Wei <wei.liao@intel.com> Signed-off-by:
wliao2 <wei.liao@intel.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-