"vllm/entrypoints/openai/run_batch.py" did not exist on "fc0d9dfc3afcea2e23649ef8eb8bbe0446682813"
-
Kuntai Du authored
This PR provides initial support for single-node disaggregated prefill in 1P1D scenario. Signed-off-by:
KuntaiDu <kuntai@uchicago.edu> Co-authored-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
YaoJiayi <120040070@link.cuhk.edu.cn>
0590ec3f