- 30 Jan, 2026 1 commit
-
-
杨朱 · Kiki authored
Signed-off-by:
carlory <baofa.fan@daocloud.io> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
- 28 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Kevin H. Luu <khluu000@gmail.com>
-
- 27 Jan, 2026 3 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
omerpaz95 authored
Added queries and hits metrics for the Offloading Connector. Also added timing metrics for store and load operations, which take the average time it takes to load/store, per-token. The metrics are available from Prometheus and from the StatLogger. Signed-off-by:
omerpaz95 <omerpaz95@gmail.com> Co-authored-by:
Omer Paz <Omer.Paz@ibm.com>
-
- 26 Jan, 2026 1 commit
-
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 22 Jan, 2026 1 commit
-
-
liranschour authored
Signed-off-by:
Liran Schour <lirans@il.ibm.com> Signed-off-by:
liranschour <liranschour@users.noreply.github.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
- 21 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 19 Jan, 2026 2 commits
-
-
qli88 authored
Signed-off-by:
Qiang Li <qiang.li2@amd.com> Signed-off-by:
Matthew Wong <Matthew.Wong2@amd.com>
-
Nicolò Lucchesi authored
Add a new metric to track the number of requests that had their KV blocks expire. The scenario is particularly important to surface and track as it is a vital indicator of the health of the deployment. Currently we're resorting to track these failures through unstructured log parsing (which is, among other thing, error string dependent); current main: > Releasing expired KV blocks for request cmpl-071d which were retrieved by 0 decode worker(s) within 0 seconds. Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 18 Jan, 2026 1 commit
-
-
Deming authored
-
- 13 Jan, 2026 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Mathis Felardos authored
Signed-off-by:Mathis Felardos <mathis@mistral.ai>
-
Martin Hickey authored
Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 12 Jan, 2026 4 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
dtc authored
Signed-off-by:
Tianchen Ding <dtcccc@linux.alibaba.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
- 11 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 10 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 09 Jan, 2026 4 commits
-
-
Chendi.Xue authored
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
inkcherry authored
Signed-off-by:inkcherry <mingzhi.liu@amd.com>
-
Bofeng Xue authored
Signed-off-by:
Bofeng BF1 Xue <xuebf1@Lenovo.com> Co-authored-by:
Bofeng BF1 Xue <xuebf1@Lenovo.com>
-
- 07 Jan, 2026 4 commits
-
-
Kfir Toledo authored
Signed-off-by:Kfir Toledo <kfir.toledo@ibm.com>
-
BlankR authored
Signed-off-by:
BlankR <hjyblanche@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
weiyu authored
Signed-off-by:
Wei-Yu Lin <weiyulin@google.com> Signed-off-by:
weiyu <62784299+weiyu0824@users.noreply.github.com>
-
Yihua Cheng authored
Signed-off-by:ApostaC <yihua98@uchicago.edu>
-
- 05 Jan, 2026 1 commit
-
-
Yihua Cheng authored
Signed-off-by:ApostaC <yihua98@uchicago.edu>
-
- 02 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
njhill <nickhill123@gmail.com>
-
- 30 Dec, 2025 1 commit
-
-
Sage authored
[Prefix Cache] Include lora_name in BlockStored event for deterministic KV-cache reconstruction (#27577) Signed-off-by:
Sage Ahrac <sagiahrak@gmail.com> Co-authored-by:
Sage <80211083+sagiahrac@users.noreply.github.com>
-
- 29 Dec, 2025 2 commits
-
-
qli88 authored
Signed-off-by:qli88 <qiang.li2@amd.com>
-
chunxiaozheng authored
Signed-off-by:idellzheng <idellzheng@tencent.com>
-
- 24 Dec, 2025 1 commit
-
-
Chao Lei authored
Signed-off-by:LCAIZJ <leichao139636@163.com>
-
- 18 Dec, 2025 4 commits
-
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
wz1qqx authored
Signed-off-by:
wz1qqx <ziqi.wang@novita.ai> Co-authored-by:
wz1qqx <ziqi.wang@novita.ai>
-
Yihua Cheng authored
Signed-off-by:ApostaC <yihua98@uchicago.edu>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 14 Dec, 2025 2 commits
-
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Qier Li authored
Co-authored-by:Qier Li <qier@fb.com>
-