- 03 Dec, 2025 1 commit
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
- 02 Dec, 2025 2 commits
-
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 01 Dec, 2025 2 commits
-
-
shivampr authored
Introduces three new Prometheus histograms for fine-grained observability of KV cache residency behavior: vllm:kv_block_lifetime_seconds — total lifetime from allocation to free vllm:kv_block_idle_before_evict_seconds — idle duration before eviction vllm:kv_block_reuse_gap_seconds — time between consecutive reuses of the same block These metrics help operators analyze KV cache efficiency, reuse patterns, and eviction timing beyond simple utilization rates. Implementation uses monotonic timestamps for accuracy, 1% sampling for minimal overhead (~48 bytes/block), and is fully thread-safe with zero runtime cost when disabled. Two new runtime flags are introduced: --kv-cache-metrics – enable KV cache residency metrics --kv-cache-metrics-sample – control sampling ratio (default: 0.01) Signed-off-by:Shivam <shivamprasad91@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 29 Nov, 2025 1 commit
-
-
Tsukasa OI authored
Signed-off-by:Tsukasa OI <floss_llm@irq.a4lg.com>
-
- 28 Nov, 2025 4 commits
-
-
Yanan Cao authored
Signed-off-by:
Yanan Cao <gmagogsfm@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 27 Nov, 2025 1 commit
-
-
Morrison Turnansky authored
Signed-off-by:
morrison-turnansky <mturnans@redhat.com> Signed-off-by:
adabeyta <aabeyta@redhat.com> Signed-off-by:
Morrison Turnansky <mturnans@redhat.com> Co-authored-by:
adabeyta <aabeyta@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 25 Nov, 2025 8 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Injae Ryou authored
Signed-off-by:
Injae Ryou <injaeryou@gmail.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
zhrrr authored
Signed-off-by:zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 22 Nov, 2025 2 commits
-
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 21 Nov, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 20 Nov, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 19 Nov, 2025 2 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
Didier Durand authored
Signed-off-by:Didier Durand <durand.didier@gmail.com>
-
- 18 Nov, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 16 Nov, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:
Lu Fang <fanglu@fb.com> Signed-off-by:
github-actions[bot] <github-actions[bot]@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 14 Nov, 2025 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 13 Nov, 2025 2 commits
-
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 12 Nov, 2025 2 commits
-
-
Chenguang Zheng authored
Signed-off-by:
n00909098 <nguyen.kha.long@huawei.com> Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
herotai214 <herotai214@gmail.com> Signed-off-by:
Khuong Le <khuong.le.manh@huawei.com> Signed-off-by:
Khuong Le <lemanhkhuong2611@gmail.com> Co-authored-by:
n00909098 <nguyen.kha.long@huawei.com> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
herotai214 <herotai214@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Khuong Le <khuong.le.manh@huawei.com> Co-authored-by:
Khuong Le <lemanhkhuong2611@gmail.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 11 Nov, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 08 Nov, 2025 2 commits
-
-
zhangsicheng5 authored
Signed-off-by:
zhangsicheng5 <zhangsicheng5@huawei.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
Qiu <qiuchunshuo@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
-
Xiaohong (Sean) Chen authored
Signed-off-by:
Sean Chen <xiaohong_chen1991@hotmail.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Danielle Robinson <dcmaddix@gmail.com> Co-authored-by:
Haipeng Li <li2haipeng@gmail.com> Co-authored-by:
li2haipeng <44383182+li2haipeng@users.noreply.github.com>
-
- 07 Nov, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 06 Nov, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-