[Doc] Fix misleading log during multi-modal profiling (#14955)

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

[Doc] Fix misleading log during multi-modal profiling (#14955)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
166a168b · Cyrus Leung · GitHub · 2bb0e1a7 · 166a168b
Unverified Commit 166a168b authored Mar 17, 2025 by Cyrus Leung Committed by GitHub Mar 17, 2025
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 1 deletion

vllm/multimodal/profiling.py vllm/multimodal/profiling.py +3 -1

No files found.
--- a/vllm/multimodal/profiling.py
+++ b/vllm/multimodal/profiling.py
@@ -218,8 +218,10 @@ class MultiModalProfiler(Generic[_I]):
        # V0 does not support chunked prefill.
        if total_len > seq_len and not envs.VLLM_USE_V1:
+            # `max_num_batched_tokens` is defined by `SchedulerConfig`
            logger.warning(
-                "The context length (%d) of the model is too short "
+                "The sequence length used for profiling ("
+                "max_num_batched_tokens / max_num_seqs = %d) is too short "
                "to hold the multi-modal embeddings in the worst case "
                "(%d tokens in total, out of which %s are reserved for "
                "multi-modal embeddings). This may cause certain "