feat: migrate requests when planner shutdown decode engine (vllm) (#2280)
Signed-off-by:Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
Jacky <18255193+kthui@users.noreply.github.com> Co-authored-by:
hhzhang16 <54051230+hhzhang16@users.noreply.github.com>
Showing
Please register or sign in to comment