"vllm/executor/ray_xpu_executor.py" did not exist on "479d69fad0538f04cb22bf13e76ff91cfeb8a4e5"
Fix KV sharing fast prefill with cudagraph enabled (#28537)
Signed-off-by:Yong Hoon Shin <yhshin@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
Showing
Please register or sign in to comment