Explain where the engine args go when using Docker (#12041)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

Explain where the engine args go when using Docker (#12041)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
c9d6ff53 · Harry Mellor · GitHub · a2d2acb4 · c9d6ff53
Unverified Commit c9d6ff53 authored Jan 14, 2025 by Harry Mellor Committed by GitHub Jan 14, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 0 deletions

docs/source/deployment/docker.md docs/source/deployment/docker.md +2 -0

No files found.
--- a/docs/source/deployment/docker.md
+++ b/docs/source/deployment/docker.md
@@ -19,6 +19,8 @@ $ docker run --runtime nvidia --gpus all \
    --model mistralai/Mistral-7B-v0.1
 ```

+You can add any other <project:#engine-args> you need after the image tag (`vllm/vllm-openai:latest`).
+
 ```{note}
 You can either use the `ipc=host` flag or `--shm-size` flag to allow the
 container to access the host's shared memory. vLLM uses PyTorch, which uses shared