API: fix maxlen; vllm: prefix_token_id bug (#2262)
* max_length - 1 (generation always >= 1) * vllm: fix rolling prefix_token * nit: add comment * fixup! max_length should be handled for logliklihoods
Showing
Please register or sign in to comment