- 13 Feb, 2026 3 commits
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Martin Hickey authored
Signed-off-by:Martin Hickey <martin.hickey@ie.ibm.com>
-
Jaewon authored
Signed-off-by:
Jaewon Lee <jaewon@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
- 10 Feb, 2026 5 commits
-
-
Ilya Markov authored
Signed-off-by:ilmarkov <markovilya197@gmail.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Qi Wang authored
Signed-off-by:Qi Wang <qiwa@nvidia.com>
-
Zetong Li authored
Signed-off-by:Zetong Li <slippersss@126.com>
-
Yuwei An authored
Signed-off-by:Oasis-Git <ayw.sirius19@gmail.com>
-
- 09 Feb, 2026 1 commit
-
-
ZhengHongming888 authored
Signed-off-by:
Hongming Zheng <hongming.zheng@intel.com> Signed-off-by:
ZhengHongming888 <hongming.zheng@intel.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 06 Feb, 2026 1 commit
-
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
- 05 Feb, 2026 4 commits
-
-
zackyoray authored
Signed-off-by:Yoray Zack <yorayz@nvidia.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Aaron Hao authored
Signed-off-by:
ahao-anyscale <ahao@anyscale.com> Signed-off-by:
Aaron Hao <ahao@anyscale.com> Co-authored-by:
SumanthRH <sumanthrh99@gmail.com>
-
liranschour authored
Signed-off-by:
Liran Schour <lirans@il.ibm.com> Signed-off-by:
liranschour <liranschour@users.noreply.github.com> Co-authored-by:
Or Ozeri <or@ozery.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
- 04 Feb, 2026 2 commits
-
-
Sage Moore authored
Change the type signature of MixtureOfExperts.expert_weights to MutableSequence[Sequence[Tensor]] (#33573) Signed-off-by:
Sage Moore <sagmoore@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
- 03 Feb, 2026 1 commit
-
-
dtc authored
Signed-off-by:
Tianchen Ding <dtcccc@linux.alibaba.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
- 30 Jan, 2026 1 commit
-
-
杨朱 · Kiki authored
Signed-off-by:
carlory <baofa.fan@daocloud.io> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
- 29 Jan, 2026 2 commits
-
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> Signed-off-by:
Li, Jiang <bigpyj64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Ilya Markov authored
Signed-off-by:ilmarkov <markovilya197@gmail.com>
-
- 28 Jan, 2026 2 commits
-
-
Angela Yi authored
Signed-off-by:angelayi <yiangela7@gmail.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Kevin H. Luu <khluu000@gmail.com>
-
- 27 Jan, 2026 4 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
omerpaz95 authored
Added queries and hits metrics for the Offloading Connector. Also added timing metrics for store and load operations, which take the average time it takes to load/store, per-token. The metrics are available from Prometheus and from the StatLogger. Signed-off-by:
omerpaz95 <omerpaz95@gmail.com> Co-authored-by:
Omer Paz <Omer.Paz@ibm.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Amir Klein <203507526+amirkl94@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
amirkl94 <203507526+amirkl94@users.noreply.github.com>
-
- 26 Jan, 2026 2 commits
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 22 Jan, 2026 4 commits
-
-
Fadi Arafeh authored
[CPU Backend] [Perf] Accelerate tensor-parallel/data-parallel inference across NUMA domains on Arm (#32792) Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
liranschour authored
Signed-off-by:
Liran Schour <lirans@il.ibm.com> Signed-off-by:
liranschour <liranschour@users.noreply.github.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
Alex Sun authored
Signed-off-by:Alex Sun <alex.s@amd.com>
-
knlnguyen1802 authored
Signed-off-by:knlnguyen1802 <knlnguyen1802@gmail.com>
-
- 21 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 20 Jan, 2026 2 commits
-
-
YiSheng5 authored
Signed-off-by:yisheng <yi.sheng@intel.com>
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
- 19 Jan, 2026 2 commits
-
-
qli88 authored
Signed-off-by:
Qiang Li <qiang.li2@amd.com> Signed-off-by:
Matthew Wong <Matthew.Wong2@amd.com>
-
Nicolò Lucchesi authored
Add a new metric to track the number of requests that had their KV blocks expire. The scenario is particularly important to surface and track as it is a vital indicator of the health of the deployment. Currently we're resorting to track these failures through unstructured log parsing (which is, among other thing, error string dependent); current main: > Releasing expired KV blocks for request cmpl-071d which were retrieved by 0 decode worker(s) within 0 seconds. Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 18 Jan, 2026 2 commits
-
-
Deming authored
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 16 Jan, 2026 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-