[Refactor] [6/N] to simplify the vLLM openai chat_completion serving architecture (#32240)
Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com>
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment