fix: reject prompts exceeding max_seq_len with HTTP 400 (#6635)
Signed-off-by:Yuewei Na <nv-yna@users.noreply.github.com> Co-authored-by:
Yuewei Na <nv-yna@users.noreply.github.com>
Showing
Please register or sign in to comment
Signed-off-by:Yuewei Na <nv-yna@users.noreply.github.com> Co-authored-by:
Yuewei Na <nv-yna@users.noreply.github.com>