[Fix] Add Spacing when Requesting Output Token > max_model_len (#40324)

Signed-off-by: San-Nguyen <san.nguyen@ibm.com>

[Fix] Add Spacing when Requesting Output Token > max_model_len (#40324)
Signed-off-by: San-Nguyen <san.nguyen@ibm.com>
e729cc82 · San-Nguyen · GitHub · ec7aafc0 · e729cc82
Unverified Commit e729cc82 authored Apr 20, 2026 by San-Nguyen Committed by GitHub Apr 20, 2026
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

vllm/renderers/params.py vllm/renderers/params.py +1 -1

No files found.
--- a/vllm/renderers/params.py
+++ b/vllm/renderers/params.py
@@ -204,7 +204,7 @@ class TokenizeParams:
            and max_output_tokens > max_total_tokens
        ):
            raise VLLMValidationError(
-                f"{self.max_output_tokens_param}={max_output_tokens}"
+                f"{self.max_output_tokens_param}={max_output_tokens} "
                f"cannot be greater than "
                f"{self.max_total_tokens_param}={max_total_tokens=}. "
                f"Please request fewer output tokens.",