Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
76144adf
Unverified
Commit
76144adf
authored
Aug 15, 2025
by
Eli Uriegas
Committed by
GitHub
Aug 15, 2025
Browse files
ci: Add CUDA + arm64 release builds (#21201)
Signed-off-by:
Eli Uriegas
<
eliuriegas@meta.com
>
parent
f5d412ba
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
18 additions
and
15 deletions
+18
-15
.buildkite/release-pipeline.yaml
.buildkite/release-pipeline.yaml
+16
-0
docker/Dockerfile
docker/Dockerfile
+2
-15
No files found.
.buildkite/release-pipeline.yaml
View file @
76144adf
steps
:
steps
:
# aarch64 + CUDA builds
-
label
:
"
Build
arm64
wheel
-
CUDA
12.8"
id
:
build-wheel-arm64-cuda-12-8
agents
:
queue
:
arm64_cpu_queue_postmerge
commands
:
# #NOTE: torch_cuda_arch_list is derived from upstream PyTorch build files here:
# https://github.com/pytorch/pytorch/blob/main/.ci/aarch64_linux/aarch64_ci_build.sh#L7
-
"
DOCKER_BUILDKIT=1
docker
build
--build-arg
max_jobs=16
--build-arg
USE_SCCACHE=1
--build-arg
GIT_REPO_CHECK=1
--build-arg
CUDA_VERSION=12.8.1
--build-arg
torch_cuda_arch_list='8.7
9.0
10.0+PTX'
--tag
vllm-ci:build-image
--target
build
--progress
plain
-f
docker/Dockerfile
."
-
"
mkdir
artifacts"
-
"
docker
run
--rm
-v
$(pwd)/artifacts:/artifacts_host
vllm-ci:build-image
bash
-c
'cp
-r
dist
/artifacts_host
&&
chmod
-R
a+rw
/artifacts_host'"
-
"
bash
.buildkite/scripts/upload-wheels.sh"
env
:
DOCKER_BUILDKIT
:
"
1"
# x86 + CUDA builds
-
label
:
"
Build
wheel
-
CUDA
12.8"
-
label
:
"
Build
wheel
-
CUDA
12.8"
id
:
build-wheel-cuda-12-8
id
:
build-wheel-cuda-12-8
agents
:
agents
:
...
...
docker/Dockerfile
View file @
76144adf
...
@@ -139,21 +139,6 @@ RUN ldconfig /usr/local/cuda-$(echo $CUDA_VERSION | cut -d. -f1,2)/compat/
...
@@ -139,21 +139,6 @@ RUN ldconfig /usr/local/cuda-$(echo $CUDA_VERSION | cut -d. -f1,2)/compat/
WORKDIR
/workspace
WORKDIR
/workspace
# install build and runtime dependencies
# install build and runtime dependencies
# arm64 (GH200) build follows the practice of "use existing pytorch" build,
# we need to install torch and torchvision from the nightly builds first,
# pytorch will not appear as a vLLM dependency in all of the following steps
# after this step
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
if
[
"
$TARGETPLATFORM
"
=
"linux/arm64"
]
;
then
\
uv pip
install
--system
\
--index-url
${
PYTORCH_CUDA_NIGHTLY_INDEX_BASE_URL
}
/cu
$(
echo
$CUDA_VERSION
|
cut
-d
.
-f1
,2 |
tr
-d
'.'
)
\
"torch==2.8.0.dev20250318+cu128"
"torchvision==0.22.0.dev20250319"
;
\
uv pip
install
--system
\
--index-url
${
PYTORCH_CUDA_NIGHTLY_INDEX_BASE_URL
}
/cu
$(
echo
$CUDA_VERSION
|
cut
-d
.
-f1
,2 |
tr
-d
'.'
)
\
--pre
pytorch_triton
==
3.3.0+gitab727c40
;
\
fi
COPY
requirements/common.txt requirements/common.txt
COPY
requirements/common.txt requirements/common.txt
COPY
requirements/cuda.txt requirements/cuda.txt
COPY
requirements/cuda.txt requirements/cuda.txt
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
RUN
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
...
@@ -234,6 +219,8 @@ RUN --mount=type=cache,target=/root/.cache/uv \
...
@@ -234,6 +219,8 @@ RUN --mount=type=cache,target=/root/.cache/uv \
&&
sccache
--show-stats
;
\
&&
sccache
--show-stats
;
\
fi
fi
ARG
vllm_target_device="cuda"
ENV
VLLM_TARGET_DEVICE=${vllm_target_device}
ENV
CCACHE_DIR=/root/.cache/ccache
ENV
CCACHE_DIR=/root/.cache/ccache
RUN
--mount
=
type
=
cache,target
=
/root/.cache/ccache
\
RUN
--mount
=
type
=
cache,target
=
/root/.cache/ccache
\
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
--mount
=
type
=
cache,target
=
/root/.cache/uv
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment