- 07 Apr, 2026 3 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Andrew Barnes authored
Signed-off-by:Bortlesboat <bortstheboat@gmail.com>
-
- 06 Apr, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Lukas Geiger authored
[Models][GDN] Remove GPU/CPU syncs in `GDNAttentionMetadata.build` during speculative decoding (#38047) Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
- 03 Apr, 2026 5 commits
-
-
Artem Perevedentsev authored
Signed-off-by:
Artem Perevedentsev <aperevedents@nvidia.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
-
wufann authored
Signed-off-by:
wufann <36477220+wufann@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
shunting314 authored
Signed-off-by:shunting314 <shunting@meta.com>
-
Carl Y authored
Signed-off-by:Carl You <4531192+carlyou@users.noreply.github.com>
-
Carl Y authored
Signed-off-by:
Carl You <4531192+carlyou@users.noreply.github.com> Signed-off-by:
Carl Y <4531192+carlyou@users.noreply.github.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 02 Apr, 2026 3 commits
-
-
Koushik Dutta authored
Signed-off-by:
Koushik Dutta <koushd@gmail.com> Co-authored-by:
root <root@ubuntu-nvidia.localdomain> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
JartX authored
Signed-off-by:
JartX <sagformas@epdcenter.es> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
yangyang4991 <yangyang4991@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 01 Apr, 2026 6 commits
-
-
yzong-rh authored
Signed-off-by:
Yifan <yzong@redhat.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Elvir Crnčević authored
[Bugfix] Revert "Zero-init MLA attention output buffers to prevent NaN from CUDA graph padding" (#38359) Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Samu Tamminen authored
Signed-off-by:
Samu Tamminen <stammine@amd.com> Co-authored-by:
Tuukka Sarvi <tuukka.sarvi@amd.com>
-
- 31 Mar, 2026 3 commits
-
-
Olya Kozlova authored
Signed-off-by:Olya Kozlova <okozlova@nvidia.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
sungsoo ha authored
Signed-off-by:
Sungsoo Ha <sungsooh@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 30 Mar, 2026 4 commits
-
-
Prathmesh Bhatt authored
Signed-off-by:Prathmesh Bhatt <71340361+Prathmesh234@users.noreply.github.com>
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Chendi.Xue authored
[HMA]Fix corner case when hybrid page_size can not be evenly divided issue (blk_size=64,tp=4) (#37467) Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
- 29 Mar, 2026 1 commit
-
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 27 Mar, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 26 Mar, 2026 5 commits
-
-
Stig-Arne Grönroos authored
Signed-off-by:Stig-Arne Grönroos <stig-arne.gronroos@amd.com>
-
jennyyyyzhen authored
Signed-off-by:
jennyyyyzhen <yzhen@hmc.edu> Co-authored-by:
yZhen <yZhen@fb.com>
-
haosdent authored
Signed-off-by:
haosdent <haosdent@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 25 Mar, 2026 3 commits
-
-
Sathish Sanjeevi authored
Signed-off-by:Sathish Sanjeevi <sathish.krishnan.p.s@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Signed-off-by:
Micah Williamson <micah.williamson@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
Chauncey authored
[Revert] Remove CUDA torch fallbacks for fp8_mqa_logits/fp8_paged_mqa_logits_torch function (#37968) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 24 Mar, 2026 2 commits
-
-
liangel-02 authored
Signed-off-by:Angel Li <liangel@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Mar, 2026 2 commits
-
-
Ranran authored
Signed-off-by:
Ranran <1012869439@qq.com> Signed-off-by:
Ranran <hzz5361@psu.edu> Signed-off-by:
ran <hzz5361@psu.edu> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-