- 16 Apr, 2026 17 commits
-
-
Yanan Cao authored
Signed-off-by:
Yanan Cao <gmagogsfm@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
lalit10 authored
Signed-off-by:Lalit Laxminarayan Bangad <lalitbangad@gmail.com>
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Simon Mo authored
Signed-off-by:Simon Mo <simon.mo@hey.com>
-
Tim Messerschmidt authored
Signed-off-by:Tim Messerschmidt <timmesserschmidt@gmail.com>
-
Xinyu Chen authored
Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Abhijit Roy authored
Signed-off-by:
Abhijit <abroy@redhat.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
realliujiaxu authored
[Bugfix] add support for 'num_attention_groups' in ModelArchConfigConvertorBase for Step3p5 (#39796) Signed-off-by:realliujiaxu <realliujiaxu@163.com>
-
R3hankhan authored
Signed-off-by:Rehan Khan <Rehan.Khan7@ibm.com>
-
Fadi Arafeh authored
Signed-off-by:
Fadi Arafeh <fadi.arafeh@arm.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
jigangz authored
Signed-off-by:
Jigang Zhou <zjg0907008@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Julien Denize authored
Signed-off-by:juliendenize <julien.denize@mistral.ai>
-
Zhengxu Chen authored
Signed-off-by:zhxchen17 <zhxchen17@fb.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
Asaf Gardin authored
Signed-off-by:Josephasafg <ajgard7@gmail.com>
-
- 15 Apr, 2026 23 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
khluu <khluu000@gmail.com> Signed-off-by:
Kevin H. Luu <khluu000@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
khluu <khluu000@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
jiang1.li <jiang1.li@intel.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com>
-
Collin McCarthy authored
Signed-off-by:Collin McCarthy <cmccarthy@nvidia.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
Kevin H. Luu authored
Signed-off-by:
Kevin H. Luu <khluu000@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Zhewen Li authored
Signed-off-by:
Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
OpenAI Codex <codex@openai.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Monishver authored
Signed-off-by:Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
daniebrill authored
[BugFix] KeyError on scope["method"] for realtime api websocket in AuthenticationMiddleware (#36934) Signed-off-by:
daniebrill <50454544+daniebrill@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Roy Huang authored
[KVConnector][LMCache] Propagate cache_salt through MP connector for per-user cache isolation (#39837) Signed-off-by:
royyhuang <royyhuang@gmail.com> Signed-off-by:
royyhuang <roy.y.huang@gmail.com>
-
Talor Abramovich authored
Signed-off-by:
talora <talora@nvidia.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
danielafrimi authored
Signed-off-by: <> Co-authored-by:root <root@lyris0144.lyris.clusters.nvidia.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
zofia authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
Csrayz authored
[Metrics] Add request_id to FinishedRequestStats to enable correlation between metrics and requests (#39710) Enables external `StatLogger` plugins to correlate per-request metrics with request-level context. Also, this is a pre-requisite for Prometheus exemplars in #30972. Signed-off-by:Csrayz <33659823+Csrayz@users.noreply.github.com>
-
Zhenzhong Xu authored
Signed-off-by:
Zhenzhong1 <zhenzhong.xu@intel.com> Signed-off-by:
Zhenzhong Xu <zhenzhong.xu@intel.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-