- 04 Dec, 2025 2 commits
-
-
TimWang authored
Signed-off-by:Tim <tim.wang03@sap.com>
-
CYJiang authored
In Prometheus Counters always expose their actual numeric value with a metric name that ends in _total. We should document the base name, as this what appears in the get_metrics() API. Signed-off-by:CYJiang <86391540+googs1025@users.noreply.github.com>
-
- 01 Dec, 2025 1 commit
-
-
shivampr authored
Introduces three new Prometheus histograms for fine-grained observability of KV cache residency behavior: vllm:kv_block_lifetime_seconds — total lifetime from allocation to free vllm:kv_block_idle_before_evict_seconds — idle duration before eviction vllm:kv_block_reuse_gap_seconds — time between consecutive reuses of the same block These metrics help operators analyze KV cache efficiency, reuse patterns, and eviction timing beyond simple utilization rates. Implementation uses monotonic timestamps for accuracy, 1% sampling for minimal overhead (~48 bytes/block), and is fully thread-safe with zero runtime cost when disabled. Two new runtime flags are introduced: --kv-cache-metrics – enable KV cache residency metrics --kv-cache-metrics-sample – control sampling ratio (default: 0.01) Signed-off-by:Shivam <shivamprasad91@gmail.com>
-
- 24 Oct, 2025 1 commit
-
-
Lifans authored
Signed-off-by:Lifan Shen <lifans@meta.com>
-
- 22 Oct, 2025 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:
Simon Mo <simon.mo@hey.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Signed-off-by:
atalhens <sneh.lata@nutanix.com> Co-authored-by:
Simon Mo <simon.mo@hey.com> Co-authored-by:
atalhens <sneh.lata@nutanix.com>
-
- 17 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 14 Oct, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 28 Aug, 2025 1 commit
-
-
Didier Durand authored
Signed-off-by:Didier Durand <durand.didier@gmail.com>
-
- 25 Aug, 2025 1 commit
-
-
Didier Durand authored
Signed-off-by:
Didier Durand <durand.didier@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 08 Aug, 2025 1 commit
-
-
Guy Stone authored
Signed-off-by:
Guy Stone <guys@spotify.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 30 Jul, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 29 Jul, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 23 Jul, 2025 1 commit
-
-
Michael Yao authored
Signed-off-by:windsonsea <haifeng.yao@daocloud.io>
-
- 21 Jul, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 19 Jul, 2025 1 commit
-
-
Jiayi Yan authored
-
- 26 May, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 24 May, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 23 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 14 May, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 29 Apr, 2025 1 commit
-
-
Russell Bryant authored
Signed-off-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 17 Apr, 2025 2 commits
-
-
Mark McLoughlin authored
-
Insu Kim authored
[Doc] Changed explanation of generation_tokens_total and prompt_tokens_total counter type metrics to avoid confusion (#16784) Signed-off-by:insukim1994 <insu.kim@moreh.io>
-
- 14 Mar, 2025 1 commit
-
-
yasu52 authored
Signed-off-by:yasu52 <tsuguro4649@gmail.com>
-
- 10 Mar, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 04 Mar, 2025 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-