- 02 Apr, 2026 5 commits
-
-
Luciano Martins authored
feat(models): implement Google Gemma 4 architecture support (MoE, Multimodal, Reasoning, Tool-Use) (#38826) Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Signed-off-by:
Luciano Martins <lucianomartins@google.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
bsliu authored
Signed-off-by:
bsliu <1187291748@qq.com> Signed-off-by:
吴炳贤 <wubingxian24@mails.ucas.ac.cn>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
- 01 Apr, 2026 4 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Zhanda Zhu authored
Signed-off-by:Zhanda Zhu <zhandazhu@gmail.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 31 Mar, 2026 1 commit
-
-
zhang-prog authored
Signed-off-by:zhangyue66 <zhangyue66@baidu.com>
-
- 30 Mar, 2026 6 commits
-
-
Netanel Haber authored
Restore non-hf processor path for Nano-Nemotron-VL (bypass `call_hf_processor_mm_only`) - fixes #38018 (#38567) Signed-off-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
tomeras91 <57313761+tomeras91@users.noreply.github.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Chendi.Xue authored
[HMA]Fix corner case when hybrid page_size can not be evenly divided issue (blk_size=64,tp=4) (#37467) Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Jee Jee Li authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
PikaPikachu authored
Signed-off-by:kangletian <Letian.Kang@amd.com>
-
- 29 Mar, 2026 3 commits
-
-
Wentao Ye authored
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
allgather authored
Signed-off-by:allgather <all2allops@gmail.com>
-
haosdent authored
Signed-off-by:haosdent <haosdent@gmail.com>
-
- 27 Mar, 2026 1 commit
-
-
Xiaoshuang Wang authored
Signed-off-by:
wxsIcey <1790571317@qq.com> Signed-off-by:
Icey <1790571317@qq.com>
-
- 26 Mar, 2026 8 commits
-
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
zhang-prog authored
[Fix] Remove unused packing_position_embedding from PaddleOCRVL for better checkpoint compatibility (#38232) Signed-off-by:zhangyue66 <zhangyue66@baidu.com>
-
Jared Wen authored
Signed-off-by:JaredforReal <w13431838023@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Terry Gao authored
Signed-off-by:tianrengao <terrygao87@gmail.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Wei Zhao authored
Signed-off-by:wzhao18 <wzhao18.sz@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 25 Mar, 2026 5 commits
-
-
Ekagra Ranjan authored
Signed-off-by:Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
grYe99 authored
Signed-off-by:
grYe99 <guorongye99@gmail.com> Co-authored-by:
grYe99 <guorongye99@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Matthias Gehre authored
Signed-off-by:Matthias Gehre <matthias.gehre@amd.com>
-
- 24 Mar, 2026 2 commits
-
-
Nick Cao authored
Signed-off-by:Nick Cao <ncao@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Mar, 2026 5 commits
-
-
Yufeng He authored
-
Artem Perevedentsev authored
Signed-off-by:Artem Perevedentsev <aperevedents@nvidia.com>
-
Hojin Yang authored
Signed-off-by:
effortprogrammer <yhjhoward7@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
r266-tech authored
Co-authored-by:r266-tech <r266-tech@users.noreply.github.com>
-
Baorun (Lauren) Mu authored
Signed-off-by:Baorun Mu <bmu@nvidia.com>
-