Unverified Commit 4050d646 authored by youkaichao's avatar youkaichao Committed by GitHub
Browse files

[doc][misc] remove deprecated api server in doc (#6037)

parent d76084c1
...@@ -19,7 +19,7 @@ To run multi-GPU serving, pass in the :code:`--tensor-parallel-size` argument wh ...@@ -19,7 +19,7 @@ To run multi-GPU serving, pass in the :code:`--tensor-parallel-size` argument wh
.. code-block:: console .. code-block:: console
$ python -m vllm.entrypoints.api_server \ $ python -m vllm.entrypoints.openai.api_server \
$ --model facebook/opt-13b \ $ --model facebook/opt-13b \
$ --tensor-parallel-size 4 $ --tensor-parallel-size 4
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment