"vscode:/vscode.git/clone" did not exist on "f63a23acb14867da6f4a234aae19227a0847b4e6"
vllm: handle max_length better and substitute Collator (#1241)
* copies max_length from huggingface
* handle max_length properly
* get tokens from inputs
* substitute Collator for Reorderer
* `batch=auto` if using data_parallel
* nit
* cleanup
* update code comments
* `ray.shutdown()` after calling method if data_parallel_size > 1
---------
Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
Showing
Please register or sign in to comment