Unverified Commit e729cc82 authored by San-Nguyen's avatar San-Nguyen Committed by GitHub
Browse files

[Fix] Add Spacing when Requesting Output Token > max_model_len (#40324)


Signed-off-by: default avatarSan-Nguyen <san.nguyen@ibm.com>
parent ec7aafc0
......@@ -204,7 +204,7 @@ class TokenizeParams:
and max_output_tokens > max_total_tokens
):
raise VLLMValidationError(
f"{self.max_output_tokens_param}={max_output_tokens}"
f"{self.max_output_tokens_param}={max_output_tokens} "
f"cannot be greater than "
f"{self.max_total_tokens_param}={max_total_tokens=}. "
f"Please request fewer output tokens.",
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment