- 16 Jan, 2026 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> (cherry picked from commit 1be5a735)
-
- 13 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 12 Jan, 2026 1 commit
-
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Signed-off-by:
Hongxin Xu <70438206+xhx1022@users.noreply.github.com> Signed-off-by:
arlenxu <arlenxu@tencent.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
- 09 Jan, 2026 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Max Hu authored
Signed-off-by:
Max Hu <maxhu@nvidia.com> Signed-off-by:
Max Hu <hyoung2991@gmail.com> Co-authored-by:
Max Hu <maxhu@nvidia.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 08 Jan, 2026 2 commits
-
-
omer-dayan authored
Signed-off-by:
Omer Dayan <omdayan@nvidia.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
prashanth058 authored
Signed-off-by:prashanth058 <prashanth.dannamaneni@uipath.com>
-
- 06 Jan, 2026 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 02 Jan, 2026 2 commits
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
njhill <nickhill123@gmail.com>
-
Reagan Lee authored
Signed-off-by:
Reagan Lee <reaganjlee@gmail.com> Signed-off-by:
Reagan <reaganjlee@gmail.com>
-
- 26 Dec, 2025 1 commit
-
-
Jee Jee Li authored
Signed-off-by:
bk-201 <joy25810@foxmail.com> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Signed-off-by:
prashanth058 <prashanth.dannamaneni@uipath.com> Co-authored-by:
bk-201 <joy25810@foxmail.com> Co-authored-by:
prashanth058 <prashanth.dannamaneni@uipath.com> Co-authored-by:
Anexdeus <5142168@mail.ru>
-
- 24 Dec, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 20 Dec, 2025 1 commit
-
-
Jeffrey Wang authored
Signed-off-by:Jeffrey Wang <jeffreywang@anyscale.com>
-
- 18 Dec, 2025 2 commits
-
-
Elizabeth Thomas authored
Signed-off-by:
Elizabeth Thomas <email2eliza@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
SungMinCho authored
Signed-off-by:
SungMinCho <tjdals4565@gmail.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
- 17 Dec, 2025 1 commit
-
-
danielafrimi authored
Signed-off-by: <> Co-authored-by:root <root@gpu-937.slurm-workers-slurm.slurm.svc.cluster.local>
-
- 16 Dec, 2025 1 commit
-
-
jiangkuaixue123 authored
Signed-off-by:
jiangkuaixue123 <jiangxiaozhou111@163.com> Co-authored-by:
root <root@hk01dgx028.cm.cluster>
-
- 15 Dec, 2025 1 commit
-
-
Nicolò Lucchesi authored
[Misc][Hybrid allocator + kv connector] Optionally enable hybrid allocator + KV cache connector (#29805) Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 12 Dec, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 11 Dec, 2025 4 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 09 Dec, 2025 1 commit
-
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 07 Dec, 2025 2 commits
-
-
Cyrus Leung authored
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Dec, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 05 Dec, 2025 3 commits
-
-
Tova Movshovitz authored
Signed-off-by:
tovam <tovam@pliops.com> Signed-off-by:
Tova Movshovitz <tovam@pliops.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Max Hu authored
Signed-off-by:
Max Hu <hyoung2991@gmail.com> Signed-off-by:
Max Hu <maxhu@nvidia.com> Co-authored-by:
Max Hu <maxhu@nvidia.com>
-
- 03 Dec, 2025 1 commit
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
- 02 Dec, 2025 3 commits
-
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Zhuohan Li authored
Signed-off-by:
Zhuohan Li <zhuohan123@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 01 Dec, 2025 2 commits
-
-
shivampr authored
Introduces three new Prometheus histograms for fine-grained observability of KV cache residency behavior: vllm:kv_block_lifetime_seconds — total lifetime from allocation to free vllm:kv_block_idle_before_evict_seconds — idle duration before eviction vllm:kv_block_reuse_gap_seconds — time between consecutive reuses of the same block These metrics help operators analyze KV cache efficiency, reuse patterns, and eviction timing beyond simple utilization rates. Implementation uses monotonic timestamps for accuracy, 1% sampling for minimal overhead (~48 bytes/block), and is fully thread-safe with zero runtime cost when disabled. Two new runtime flags are introduced: --kv-cache-metrics – enable KV cache residency metrics --kv-cache-metrics-sample – control sampling ratio (default: 0.01) Signed-off-by:Shivam <shivamprasad91@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 29 Nov, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Tsukasa OI authored
Signed-off-by:Tsukasa OI <floss_llm@irq.a4lg.com>
-