[Bug fix][Core] assert num_new_tokens == 1 fails when SamplingParams.n is not...
[Bug fix][Core] assert num_new_tokens == 1 fails when SamplingParams.n is not 1 and max_tokens is large & Add tests for preemption (#4451)
Showing
Please register or sign in to comment