- 31 Mar, 2026 9 commits
-
-
Kfir Toledo authored
[kv_offload+HMA] Fix num_blocks with different per-layer page sizes and improve assert message (#38554) Signed-off-by:
Kfir Toledo <kfir.toledo@ibm.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
Louie Tsai authored
Signed-off-by:louie-tsai <louie.tsai@intel.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
Martin Hickey authored
Signed-off-by:Martin Hickey <martin.hickey@ie.ibm.com>
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
sungsoo ha authored
Signed-off-by:
Sungsoo Ha <sungsooh@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 30 Mar, 2026 31 commits
-
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Prathmesh Bhatt authored
Signed-off-by:Prathmesh Bhatt <71340361+Prathmesh234@users.noreply.github.com>
-
Netanel Haber authored
Restore non-hf processor path for Nano-Nemotron-VL (bypass `call_hf_processor_mm_only`) - fixes #38018 (#38567) Signed-off-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
tomeras91 <57313761+tomeras91@users.noreply.github.com>
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Asaf Gardin authored
Signed-off-by:Josephasafg <ajgard7@gmail.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Ilya Markov authored
Signed-off-by:ilmarkov <markovilya197@gmail.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
mikaylagawarecki authored
Signed-off-by:Mikayla Gawarecki <mikaylagawarecki@gmail.com>
-
fangyuchu authored
Signed-off-by:
fangyuchu <fangyuchu@qq.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Chendi.Xue authored
[HMA]Fix corner case when hybrid page_size can not be evenly divided issue (blk_size=64,tp=4) (#37467) Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Johnny authored
Signed-off-by:
johnnynunez <johnnynuca14@gmail.com> Signed-off-by:
Johnny <johnnynuca14@gmail.com>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Hongxia Yang authored
Signed-off-by:
Hongxia Yang <hongxiay.yang@amd.com> Co-authored-by:
Hongxia Yang <hongxiay.yang@amd.com>
-
Matthias Gehre authored
Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
tomeras91 authored
Signed-off-by:Tomer Asida <57313761+tomeras91@users.noreply.github.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Collin McCarthy authored
Signed-off-by:
Collin McCarthy <cmccarthy@nvidia.com> Signed-off-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
aliialsaeedii authored
Signed-off-by:aliialsaeedii <ali.al-saeedi@nscale.com>
-
yzong-rh authored
Signed-off-by:
Yifan Zong <yzong@redhat.com> Signed-off-by:
Yifan <yzong@redhat.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Nicolò Lucchesi authored
[Mamba][Bugfix] Raise on insufficient cache blocks instead of silently capping cudagraph sizes (#38270) Signed-off-by:NickLucche <nlucches@redhat.com>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
haosdent authored
Signed-off-by:haosdent <haosdent@gmail.com>
-
Tan Pin Siang authored
Signed-off-by:Tan Pin Siang <pinsiang.tan@amd.com>
-
Juan Pérez de Algaba authored
Signed-off-by:jperezde <jperezde@redhat.com>
-
Jee Jee Li authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
PikaPikachu authored
Signed-off-by:kangletian <Letian.Kang@amd.com>
-
Kevin H. Luu authored
Signed-off-by:Kevin H. Luu <khluu000@gmail.com>
-