Unverified Commit 37c1e3c2 authored by Simon Mo's avatar Simon Mo Committed by GitHub
Browse files

Documentation about official docker image (#1709)

parent 06e9ebeb
...@@ -3,11 +3,25 @@ ...@@ -3,11 +3,25 @@
Deploying with Docker Deploying with Docker
============================ ============================
vLLM offers official docker image for deployment.
The image can be used to run OpenAI compatible server.
The image is available on Docker Hub as `vllm/vllm-openai <https://hub.docker.com/r/vllm/vllm-openai/tags>`_.
... code-block:: console
$ docker run --runtime nvidia --gpus all \
-v ~/.cache/huggingface:/root/.cache/huggingface \
-p 8000:8000 \
--env "HUGGING_FACE_HUB_TOKEN=<secret>" \
vllm/vllm-openai:latest \
--model mistralai/Mistral-7B-v0.1
You can build and run vLLM from source via the provided dockerfile. To build vLLM: You can build and run vLLM from source via the provided dockerfile. To build vLLM:
.. code-block:: console .. code-block:: console
$ DOCKER_BUILDKIT=1 docker build . --target vllm --tag vllm --build-arg max_jobs=8 $ DOCKER_BUILDKIT=1 docker build . --target vllm-openai --tag vllm/vllm-openai --build-arg max_jobs=8
To run vLLM: To run vLLM:
...@@ -17,5 +31,5 @@ To run vLLM: ...@@ -17,5 +31,5 @@ To run vLLM:
-v ~/.cache/huggingface:/root/.cache/huggingface \ -v ~/.cache/huggingface:/root/.cache/huggingface \
-p 8000:8000 \ -p 8000:8000 \
--env "HUGGING_FACE_HUB_TOKEN=<secret>" \ --env "HUGGING_FACE_HUB_TOKEN=<secret>" \
vllm <args...> vllm/vllm-openai <args...>
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment