[Bugfix][Async][Connector] avoid vllm-side double free during async scheduling...
[Bugfix][Async][Connector] avoid vllm-side double free during async scheduling + request abort + async KV cache transfer (#33377)
Signed-off-by:
KuntaiDu <kuntai@uchicago.edu>
Showing
Please register or sign in to comment