[CI/Build][Bugfix][Doc][ROCm] CI fix and doc update after ROCm 6.2 upgrade (#8777)

1c046447 · Hongxia Yang · GitHub · 8fae5ed7 · 1c046447 · 1c046447
Unverified Commit 1c046447 authored Sep 25, 2024 by Hongxia Yang Committed by GitHub Sep 25, 2024
Showing with 16 additions and 3 deletions

.buildkite/test-pipeline.yaml .buildkite/test-pipeline.yaml +4 -1

Dockerfile.rocm Dockerfile.rocm +1 -1

docs/source/getting_started/amd-installation.rst docs/source/getting_started/amd-installation.rst +11 -1

No files found.
--- a/.buildkite/test-pipeline.yaml
+++ b/.buildkite/test-pipeline.yaml
@@ -90,8 +90,11 @@ steps:
  commands:
  - pip install -e ./plugins/vllm_add_dummy_model
  - pip install git+https://github.com/EleutherAI/lm-evaluation-harness.git@a4987bba6e9e9b3f22bd3a6c1ecf0abd04fd5622#egg=lm_eval[api]
-  - pytest -v -s entrypoints/llm --ignore=entrypoints/llm/test_lazy_outlines.py
+  - pytest -v -s entrypoints/llm --ignore=entrypoints/llm/test_lazy_outlines.py --ignore=entrypoints/llm/test_generate.py --ignore=entrypoints/llm/test_generate_multiple_loras.py --ignore=entrypoints/llm/test_guided_generate.py
  - pytest -v -s entrypoints/llm/test_lazy_outlines.py # it needs a clean process
+  - pytest -v -s entrypoints/llm/test_generate.py # it needs a clean process
+  - pytest -v -s entrypoints/llm/test_generate_multiple_loras.py # it needs a clean process
+  - pytest -v -s entrypoints/llm/test_guided_generate.py # it needs a clean process
  - pytest -v -s entrypoints/openai
  - pytest -v -s entrypoints/test_chat_utils.py
  - pytest -v -s entrypoints/offline_mode # Needs to avoid interference with other tests

--- a/Dockerfile.rocm
+++ b/Dockerfile.rocm
@@ -120,7 +120,7 @@ COPY . .
 # Package upgrades for useful functionality or to avoid dependency issues
 RUN --mount=type=cache,target=/root/.cache/pip \
-    python3 -m pip install --upgrade numba scipy huggingface-hub[cli]
+    python3 -m pip install --upgrade numba scipy huggingface-hub[cli] pytest-shard
 # Workaround for ray >= 2.10.0

--- a/docs/source/getting_started/amd-installation.rst
+++ b/docs/source/getting_started/amd-installation.rst
@@ -28,6 +28,16 @@ Option 1: Build from source with docker (recommended)
 You can build and install vLLM from source.
 First, build a docker image from `Dockerfile.rocm <https://github.com/vllm-project/vllm/blob/main/Dockerfile.rocm>`_ and launch a docker container from the image.
+It is important that the user kicks off the docker build using buildkit. Either the user put DOCKER_BUILDKIT=1 as environment variable when calling docker build command, or the user needs to setup buildkit in the docker daemon configuration /etc/docker/daemon.json as follows and restart the daemon:
+.. code-block:: console
+    {
+        "features": {
+            "buildkit": true
+        }
+    }
 `Dockerfile.rocm <https://github.com/vllm-project/vllm/blob/main/Dockerfile.rocm>`_ uses ROCm 6.2 by default, but also supports ROCm 5.7, 6.0 and 6.1 in older vLLM branches.
 It provides flexibility to customize the build of docker image using the following arguments:
@@ -152,7 +162,7 @@ Note to get your gfx architecture, run `rocminfo |grep gfx`.
        $ python3 setup.py develop
-    This may take 5-10 minutes. Currently, `pip install .`` does not work for ROCm installation
+    This may take 5-10 minutes. Currently, :code:`pip install .` does not work for ROCm installation.
 .. tip::