- 16 Apr, 2026 1 commit
-
-
Fynn Schmitt-Ulms authored
Signed-off-by:
Rahul-Tuli <rtuli@redhat.com> Signed-off-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Co-authored-by:
Rahul-Tuli <rtuli@redhat.com> Co-authored-by:
Claude <noreply@anthropic.com> (cherry picked from commit e7cfd7c5)
-
- 01 Apr, 2026 4 commits
-
-
Chauncey authored
Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com> (cherry picked from commit cbe7d180)
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@berkeley.edu> Signed-off-by:
Yifan Qiao <yifanqiao@inferact.ai> (cherry picked from commit 91e4521f)
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> (cherry picked from commit eb474549)
-
Matthew Bonanni authored
Signed-off-by:
SandishKumarHN <sandishkumarhn@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
SandishKumarHN <sandishkumarhn@gmail.com> (cherry picked from commit 757068dc)
-
- 31 Mar, 2026 1 commit
-
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> (cherry picked from commit 6557f493)
-
- 29 Mar, 2026 3 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Wentao Ye authored
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 27 Mar, 2026 3 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Bvicii authored
Signed-off-by:
Bvicii <yizhanhuang2002@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 26 Mar, 2026 8 commits
-
-
yzong-rh authored
Signed-off-by:Yifan Zong <yzong@redhat.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
Stig-Arne Grönroos authored
Signed-off-by:Stig-Arne Grönroos <stig-arne.gronroos@amd.com>
-
jennyyyyzhen authored
Signed-off-by:
jennyyyyzhen <yzhen@hmc.edu> Co-authored-by:
yZhen <yZhen@fb.com>
-
haosdent authored
Signed-off-by:
haosdent <haosdent@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 25 Mar, 2026 10 commits
-
-
Sathish Sanjeevi authored
Signed-off-by:Sathish Sanjeevi <sathish.krishnan.p.s@gmail.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Co-authored-by:
Woosuk Kwon <woosuk@inferact.ai>
-
Andrii Skliar authored
Signed-off-by:
Andrii Skliar <askliar@nvidia.com> Signed-off-by:
[Andrii Skliar] <askliar@nvidia.com> Co-authored-by:
Andrii Skliar <askliar@nvidia.com>
-
Matthias Gehre authored
Signed-off-by:Matthias Gehre <matthias.gehre@amd.com>
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Signed-off-by:
Micah Williamson <micah.williamson@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
Chauncey authored
[Revert] Remove CUDA torch fallbacks for fp8_mqa_logits/fp8_paged_mqa_logits_torch function (#37968) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 24 Mar, 2026 10 commits
-
-
Junhao authored
Signed-off-by:Junhao Li <junhao@ubicloud.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
liangel-02 authored
Signed-off-by:Angel Li <liangel@meta.com>
-
Willy Hardy authored
Signed-off-by:
Willy Hardy <whardy@redhat.com> Signed-off-by:
Will Hardy <whardy@redhat.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Ming Yang authored
-
Dan Blanaru authored
Signed-off-by:
Dan Blanaru <48605845+DanBlanaru@users.noreply.github.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Sungjae Lee authored
Signed-off-by:
Sungjae Lee <33976427+llsj14@users.noreply.github.com> Signed-off-by:
Sungjae Lee <sung-jae.lee@navercorp.com> Signed-off-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Ronen Schaffer authored
[KV Offload] Refactor CPU offloading: pluggable CachePolicy, remove Backend abstraction, restructure into `cpu/` package (#37874) Signed-off-by:Ronen Schaffer <ronen.schaffer@ibm.com>
-