- 16 Apr, 2026 14 commits
-
-
Nikita Shapovalov authored
[Bugfix] Fix Ray compiled-DAG SHM channel stalls by detaching zero-copy `np.ndarray` logprobs buffers (#35736) Signed-off-by:Nikita Shapovalov <nikita@poolside.ai>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
daiyu1111 authored
Signed-off-by:daiyu1111 <2356690121@qq.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
grYe99 authored
Signed-off-by:
grYe99 <guorongye99@gmail.com> Co-authored-by:
grYe99 <guorongye99@gmail.com>
-
Yanan Cao authored
Signed-off-by:
Yanan Cao <gmagogsfm@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
lalit10 authored
Signed-off-by:Lalit Laxminarayan Bangad <lalitbangad@gmail.com>
-
Abhijit Roy authored
Signed-off-by:
Abhijit <abroy@redhat.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
realliujiaxu authored
[Bugfix] add support for 'num_attention_groups' in ModelArchConfigConvertorBase for Step3p5 (#39796) Signed-off-by:realliujiaxu <realliujiaxu@163.com>
-
Fadi Arafeh authored
Signed-off-by:
Fadi Arafeh <fadi.arafeh@arm.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
jigangz authored
Signed-off-by:
Jigang Zhou <zjg0907008@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Julien Denize authored
Signed-off-by:juliendenize <julien.denize@mistral.ai>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Asaf Gardin authored
Signed-off-by:Josephasafg <ajgard7@gmail.com>
-
- 15 Apr, 2026 15 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
khluu <khluu000@gmail.com> Signed-off-by:
Kevin H. Luu <khluu000@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
khluu <khluu000@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
jiang1.li <jiang1.li@intel.com>
-
Collin McCarthy authored
Signed-off-by:Collin McCarthy <cmccarthy@nvidia.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
Zhewen Li authored
Signed-off-by:
Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
OpenAI Codex <codex@openai.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Csrayz authored
[Metrics] Add request_id to FinishedRequestStats to enable correlation between metrics and requests (#39710) Enables external `StatLogger` plugins to correlate per-request metrics with request-level context. Also, this is a pre-requisite for Prometheus exemplars in #30972. Signed-off-by:Csrayz <33659823+Csrayz@users.noreply.github.com>
-
Zhenzhong Xu authored
Signed-off-by:
Zhenzhong1 <zhenzhong.xu@intel.com> Signed-off-by:
Zhenzhong Xu <zhenzhong.xu@intel.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Wojciech Wais authored
Signed-off-by:
Wojciech Wais <wojciech.wais@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Signed-off-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Signed-off-by:
Rishi Puri <riship@nvidia.com> Signed-off-by:
Jaebok Lee <jaebok9541@naver.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
yuwei <yuwei@dev.local> Signed-off-by:
Artem Perevedentsev <aperevedents@nvidia.com> Signed-off-by:
Ibrahim Arshad <38925737+ibrahim1023@users.noreply.github.com> Signed-off-by:
Li <chuali@amd.com> Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com> Signed-off-by:
R <Ganesh.R@amd.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
lkm2835 <lkm2835@gmail.com> Signed-off-by:
Ronen Schaffer <ronen.schaffer@ibm.com> Signed-off-by:
vnadathur <glvikramn@gmail.com> Signed-off-by:
WorldExplored <srreyansh.sethi@gmail.com> Signed-off-by:
Srreyansh Sethi <107075589+WorldExplored@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Elham Harirpoush <elham.harirpoush@arm.com> Signed-off-by:
Yan Ma <yan.ma@intel.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Signed-off-by:
jackcfwang <jackcfwang@tencent.com> Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Injae Ryou <injaeryou@gmail.com> Signed-off-by:
Richard Zou <zou3519@gmail.com> Signed-off-by:
milesial <milesial@users.noreply.github.com> Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Signed-off-by:
whx-sjtu <2952154980@qq.com> Signed-off-by:
Lalithnarayan C <Lalithnarayan.C@amd.com> Signed-off-by:
PatchouliTaisa <patchychen@tencent.com> Signed-off-by:
jatseng-ai <jatseng@amd.com> Signed-off-by:
jatseng-ai <janet.tseng@amd.com> Signed-off-by:
Matthias Gehre <matthias.gehre@amd.com> Signed-off-by:
xaguilar-amd <xaguilar@amd.com> Signed-off-by:
rdondeti <ravitez.dondeti@gmail.com> Signed-off-by:
Ravitez Dondeti <ravitez.dondeti@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Peter Nguyen <petern0408@gmail.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Signed-off-by:
Jesus Federico <jefp@amazon.com> Signed-off-by:
manu <fortin.emmanuel@gmail.com> Signed-off-by:
ZhanqiuHu <zhu@redhat.com> Signed-off-by:
Yifan Zong <yzong@redhat.com> Signed-off-by:
Rahul-Tuli <rtuli@redhat.com> Signed-off-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Tianyu Guo <guoty9@mail2.sysu.edu.cn> Signed-off-by:
leeyongjun <jqueen.astro@gmail.com> Signed-off-by:
Ziying Tao <tzzying@outlook.com> Signed-off-by:
jiang1.li <jiang1.li@intel.com> Signed-off-by:
Vibhav Agarwal <vibhavagarwal5@gmail.com> Signed-off-by:
ShubyM <shubymishra20@gmail.com> Signed-off-by:
wzhao18 <wzhao18.sz@gmail.com> Signed-off-by:
Itay Etelis <itay.etelis@ibm.com> Signed-off-by:
EdalatiAli <aliedalati@cohere.com> Signed-off-by:
Andreas Karatzas <akaratza@amd.com> Signed-off-by:
r266-tech <r266.tech@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Signed-off-by:
Animesh Jain <anijain@umich.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
zhxchen17 <zhxchen17@fb.com> Signed-off-by:
EricccYang <yangyang4991@gmail.com> Signed-off-by:
Kaicheng Yang <53411596+EricccYang@users.noreply.github.com> Signed-off-by:
baoloongmao <baoloongmao@tencent.com> Signed-off-by:
sihao.li <sihao.li@intel.com> Signed-off-by:
sfeng33 <4florafeng@gmail.com> Signed-off-by:
Yufeng He <40085740+he-yufeng@users.noreply.github.com> Signed-off-by:
Zhu, Zufang <zufang.zhu@intel.com> Signed-off-by:
Tihomir Elek <tiho.elek@gmail.com> Signed-off-by:
yiliu30 <yi4.liu@intel.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Santino Ramos <santinor@inferact.ai> Signed-off-by:
haosdent <haosdent@gmail.com> Signed-off-by:
JartX <sagformas@epdcenter.es> Signed-off-by:
George-ao <yuyiao772@gmail.com> Signed-off-by:
Yuyi Ao <yuyiao772@gmail.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
Mukesh Baphna <mukesh@hippocraticai.com> Signed-off-by:
Pedram Razavi <pedram.razavi@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by:
Rishi Puri <riship@nvidia.com> Co-authored-by:
zzaebok <44357534+zzaebok@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Yuwei An <ayw.sirius19@gmail.com> Co-authored-by:
yuwei <yuwei@dev.local> Co-authored-by:
Artem Perevedentsev <aperevedents@nvidia.com> Co-authored-by:
Ibrahim Arshad <38925737+ibrahim1023@users.noreply.github.com> Co-authored-by:
Chuan (Richard) Li <chuali@amd.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Ganesh R <ganesh.r@amd.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Kyungmin Lee <30465912+lkm2835@users.noreply.github.com> Co-authored-by:
Ronen Schaffer <ronen.schaffer@ibm.com> Co-authored-by:
Srreyansh Sethi <107075589+WorldExplored@users.noreply.github.com> Co-authored-by:
vnadathur <glvikramn@gmail.com> Co-authored-by:
vnadathur <236933696+vnadathur@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Elham <elham.harirpoush@arm.com> Co-authored-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Chaofan Wang <jackcfwang@tencent.com> Co-authored-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Injae Ryou <injaeryou@gmail.com> Co-authored-by:
Richard Zou <zou3519@users.noreply.github.com> Co-authored-by:
milesial <milesial@users.noreply.github.com> Co-authored-by:
Elvir Crnčević <elvircrn@gmail.com> Co-authored-by:
Claude Sonnet 4 <noreply@anthropic.com> Co-authored-by:
Hexiang Wang <56632993+whx-sjtu@users.noreply.github.com> Co-authored-by:
Lalithnarayan C <Lalithnarayan.C@amd.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
PatchyTIS <58251192+PatchouliTIS@users.noreply.github.com> Co-authored-by:
PatchouliTaisa <patchychen@tencent.com> Co-authored-by:
jatseng-ai <janet.tseng@amd.com> Co-authored-by:
Matthias Gehre <matthias.gehre@amd.com> Co-authored-by:
xaguilar-amd <xavier.aguilarfruto@amd.com> Co-authored-by:
Ravitez Dondeti <dondetir@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com> Co-authored-by:
Peter Nguyen <petern0408@gmail.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
zhrrr <43847754+izhuhaoran@users.noreply.github.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com> Co-authored-by:
Jesus Federico <14651+jefp@users.noreply.github.com> Co-authored-by:
Manu <efortin@users.noreply.github.com> Co-authored-by:
zhanqiuhu <49648934+ZhanqiuHu@users.noreply.github.com> Co-authored-by:
yzong-rh <yzong@redhat.com> Co-authored-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Co-authored-by:
Rahul-Tuli <rtuli@redhat.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com> Co-authored-by:
Tianyu Guo <guoty9@mail2.sysu.edu.cn> Co-authored-by:
Lee Yongjun <35302114+elwhyjay@users.noreply.github.com> Co-authored-by:
z1ying <55220715+z1ying@users.noreply.github.com> Co-authored-by:
Li, Jiang <jiang1.li@intel.com> Co-authored-by:
Vibhav Agarwal <vibhavagarwal5@gmail.com> Co-authored-by:
vibhav-agarwal <vibhav.agarwal@glance.com> Co-authored-by:
ShubyM <shubymishra20@gmail.com> Co-authored-by:
Wei Zhao <51183510+wzhao18@users.noreply.github.com> Co-authored-by:
Itay Etelis <92247226+Etelis@users.noreply.github.com> Co-authored-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
EdalatiAli <aliedalati@cohere.com> Co-authored-by:
Andreas Karatzas <akaratza@amd.com> Co-authored-by:
r266-tech <r2668940489@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
Or Ozeri <or@ozery.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Le Yang <562593859@qq.com> Co-authored-by:
Animesh Jain <anijain@umich.edu> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Zhengxu Chen <zhxchen17@fb.com> Co-authored-by:
Kaicheng Yang <53411596+EricccYang@users.noreply.github.com> Co-authored-by:
maobaolong <baoloongmao@tencent.com> Co-authored-by:
sihao_li <165983188+1643661061leo@users.noreply.github.com> Co-authored-by:
Flora Feng <4florafeng@gmail.com> Co-authored-by:
Yufeng He <40085740+he-yufeng@users.noreply.github.com> Co-authored-by:
zofia <110436990+zufangzhu@users.noreply.github.com> Co-authored-by:
Tihomir Elek <tiho.elek@gmail.com> Co-authored-by:
Yi Liu <yi4.liu@intel.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Santino Ramos <51103228+santiramos27@users.noreply.github.com> Co-authored-by:
haosdent <haosdent@gmail.com> Co-authored-by:
JartX <sagformas@epdcenter.es> Co-authored-by:
Yuyi Ao <yuyiao772@gmail.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
mukesh-hai <mukesh@hippocraticai.com> Co-authored-by:
Pedram Razavi <pedram@sierra.ai>
-
wliao2 authored
[Test] Refactor hard coded device string in test files under compile/quantization/models/model_executor folders (#38901) Signed-off-by:Liao, Wei <wei.liao@intel.com>
-
Vibhav Agarwal authored
Signed-off-by:
vibhavagarwal5 <vibhavagarwal5@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 14 Apr, 2026 11 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Jackmin801 authored
Signed-off-by:
Robert Shaw <robertgshaw2@gmail.com> Signed-off-by:
Jackmin801 <ongjackm@gmail.com> Co-authored-by:
Robert Shaw <robertgshaw2@gmail.com>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
zhanqiuhu authored
[CI][KVConnector][Metrics] Update multi KV connector edge case according to prefill stats changes (#39808) Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
danielafrimi authored
Signed-off-by:
root <root@lyris0017.lyris.clusters.nvidia.com> Signed-off-by:
Daniel Afrimi <dafrimi@nvidia.com> Co-authored-by:
root <root@lyris0017.lyris.clusters.nvidia.com>
-
omerpaz95 authored
Signed-off-by:
omerpaz95 <omerpaz95@gmail.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Thomas authored
Signed-off-by:
thomasmaindron <thomasmaindron@users.noreply.github.com> Co-authored-by:
thomasmaindron <thomasmaindron@users.noreply.github.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
fxmarty-amd authored
[fix][MOE] Fix MOE experts `intermediate_size` dimension not being narrowed before weight loading (#39688) Signed-off-by:Felix Marty <Felix.Marty@amd.com>
-
Julien Debache authored
Signed-off-by:jdebache <jdebache@nvidia.com>
-