[Refactor] [6/N] to simplify the vLLM openai chat_completion serving architecture (#32240)
Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com>
Showing
Please register or sign in to comment
Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com>