Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9d2b4a70
Unverified
Commit
9d2b4a70
authored
Mar 14, 2025
by
Mark McLoughlin
Committed by
GitHub
Mar 15, 2025
Browse files
[V1][Metrics] Updated list of deprecated metrics in v0.8 (#14695)
Signed-off-by:
Mark McLoughlin
<
markmc@redhat.com
>
parent
0b0d6421
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
10 additions
and
1 deletion
+10
-1
docs/source/serving/metrics.md
docs/source/serving/metrics.md
+10
-1
No files found.
docs/source/serving/metrics.md
View file @
9d2b4a70
...
...
@@ -39,7 +39,16 @@ The following metrics are exposed:
The following metrics are deprecated and due to be removed in a future version:
-
*(No metrics are currently deprecated)*
-
`vllm:num_requests_swapped`
,
`vllm:cpu_cache_usage_perc`
, and
`vllm:cpu_prefix_cache_hit_rate`
because KV cache offloading is not
used in V1.
-
`vllm:gpu_prefix_cache_hit_rate`
is replaced by queries+hits
counters in V1.
-
`vllm:time_in_queue_requests`
because it duplicates
`vllm:request_queue_time_seconds`
.
-
`vllm:model_forward_time_milliseconds`
and
`vllm:model_execute_time_milliseconds`
because
prefill/decode/inference time metrics should be used instead.
Note: when metrics are deprecated in version
`X.Y`
, they are hidden in version
`X.Y+1`
but can be re-enabled using the
`--show-hidden-metrics-for-version=X.Y`
escape hatch,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment