feat: Support a dynamic default max_tokens for TensorRT-LLM backend (#5152)
Signed-off-by:Steven Murr <stevemurr@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Showing
Please register or sign in to comment
Signed-off-by:Steven Murr <stevemurr@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>