Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
846197f5
Unverified
Commit
846197f5
authored
Sep 23, 2025
by
Wentao Ye
Committed by
GitHub
Sep 23, 2025
Browse files
[Log] Optimize kv cache memory log from Bytes to GiB (#25204)
Signed-off-by:
yewentao256
<
zhyanwentao@126.com
>
parent
2357480b
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
4 deletions
+6
-4
vllm/v1/worker/gpu_worker.py
vllm/v1/worker/gpu_worker.py
+6
-4
No files found.
vllm/v1/worker/gpu_worker.py
View file @
846197f5
...
...
@@ -383,11 +383,13 @@ class Worker(WorkerBase):
f
"for non-torch memory, and
{
GiB
(
cuda_graph_memory_bytes
)
}
"
f
"GiB for CUDAGraph memory. Replace gpu_memory_utilization "
f
"config with `--kv-cache-memory="
f
"
{
kv_cache_memory_bytes_to_requested_limit
}
` to fit into "
f
"requested memory, or `--kv-cache-memory="
f
"
{
kv_cache_memory_bytes_to_gpu_limit
}
` to fully "
f
"
{
kv_cache_memory_bytes_to_requested_limit
}
` "
f
"(
{
GiB
(
kv_cache_memory_bytes_to_requested_limit
)
}
GiB) to fit "
f
"into requested memory, or `--kv-cache-memory="
f
"
{
kv_cache_memory_bytes_to_gpu_limit
}
` "
f
"(
{
GiB
(
kv_cache_memory_bytes_to_gpu_limit
)
}
GiB) to fully "
f
"utilize gpu memory. Current kv cache memory in use is "
f
"
{
int
(
self
.
available_kv_cache_memory_bytes
)
}
bytes
."
)
f
"
{
GiB
(
self
.
available_kv_cache_memory_bytes
)
}
GiB
."
)
logger
.
debug
(
msg
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment