chore: Remove embedded Python vllm and sglang engines (#966)
vllm and sglang are now the sub-process engines from #954 Also updated docs on doing vllm and sglang multi-gpu (tensor parallel) and multi-node (pipeline parallel).
Showing
This diff is collapsed.
Please register or sign in to comment