"vscode:/vscode.git/clone" did not exist on "19844fc07e38467f8636fffd29acf28926c28729"
feat(mistralrs): Let the engine enforce max tokens (#134)
Previously we tokenized and counted tokens to stop when max tokens was reached. Now we let the mistral.rs engine do it which saves the extra tokenization step. Also dynamo-run prints which engines are compiled in in help message, and some minor lint fixes.
Showing
Please register or sign in to comment