fix: Use min of max tokens or context length (#1322)
This PR modifies the mistralrs engine to ensure that the maximum output token length never exceeds the context length provided.
Showing
Please register or sign in to comment
This PR modifies the mistralrs engine to ensure that the maximum output token length never exceeds the context length provided.