Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
943e72ca
Unverified
Commit
943e72ca
authored
May 20, 2024
by
Alexei-V-Ivanov-AMD
Committed by
GitHub
May 20, 2024
Browse files
[Build/CI] Enabling AMD Entrypoints Test (#4834)
Co-authored-by:
Alexey Kondratiev
<
alexey.kondratiev@amd.com
>
parent
546a97ef
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
16 additions
and
6 deletions
+16
-6
.buildkite/test-pipeline.yaml
.buildkite/test-pipeline.yaml
+2
-1
Dockerfile.rocm
Dockerfile.rocm
+6
-2
requirements-rocm.txt
requirements-rocm.txt
+2
-1
tests/spec_decode/e2e/conftest.py
tests/spec_decode/e2e/conftest.py
+6
-2
No files found.
.buildkite/test-pipeline.yaml
View file @
943e72ca
...
...
@@ -60,7 +60,8 @@ steps:
command
:
pytest -v -s engine tokenization test_sequence.py test_config.py test_logger.py
-
label
:
Entrypoints Test
#mirror_hardwares: [amd]
mirror_hardwares
:
[
amd
]
commands
:
# these tests have to be separated, because each one will allocate all posible GPU memory
-
pytest -v -s entrypoints --ignore=entrypoints/test_server_oot_registration.py
...
...
Dockerfile.rocm
View file @
943e72ca
...
...
@@ -92,19 +92,23 @@ RUN if [ "$BUILD_TRITON" = "1" ]; then \
WORKDIR /vllm-workspace
COPY . .
#RUN python3 -m pip install pynvml # to be removed eventually
RUN python3 -m pip install --upgrade pip numba
# make sure punica kernels are built (for LoRA)
ENV VLLM_INSTALL_PUNICA_KERNELS=1
# Workaround for ray >= 2.10.0
ENV RAY_EXPERIMENTAL_NOSET_ROCR_VISIBLE_DEVICES=1
ENV VLLM_NCCL_SO_PATH=/opt/rocm/lib/librccl.so
RUN --mount=type=cache,target=/root/.cache/pip \
pip install -U -r requirements-rocm.txt \
&& patch /opt/rocm/include/hip/amd_detail/amd_hip_bf16.h ./rocm_patch/rocm_bf16.patch \
&& python3 setup.py install \
&& cp build/lib.linux-x86_64-cpython-39/vllm/_C.cpython-39-x86_64-linux-gnu.so vllm/ \
&& cp build/lib.linux-x86_64-cpython-39/vllm/_punica_C.cpython-39-x86_64-linux-gnu.so vllm/ \
&& cd ..
RUN python3 -m pip install --upgrade pip
RUN python3 -m pip install --no-cache-dir ray[all]==2.9.3
CMD ["/bin/bash"]
requirements-rocm.txt
View file @
943e72ca
...
...
@@ -2,4 +2,5 @@
-r requirements-common.txt
# Dependencies for AMD GPUs
ray == 2.9.3
ray >= 2.10.0
pytest-asyncio
tests/spec_decode/e2e/conftest.py
View file @
943e72ca
...
...
@@ -6,7 +6,11 @@ from typing import Dict, List, Optional, Tuple, Union
import
pytest
import
ray
import
torch
from
pynvml
import
(
nvmlDeviceGetHandleByIndex
,
nvmlDeviceGetMemoryInfo
,
from
vllm.utils
import
is_hip
if
(
not
is_hip
()):
from
pynvml
import
(
nvmlDeviceGetHandleByIndex
,
nvmlDeviceGetMemoryInfo
,
nvmlInit
)
from
vllm
import
LLM
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment