Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
dd8a29da
Unverified
Commit
dd8a29da
authored
Mar 26, 2025
by
Alexei-V-Ivanov-AMD
Committed by
GitHub
Mar 26, 2025
Browse files
Applying some fixes for K8s agents in CI (#15493)
Signed-off-by:
Alexei V. Ivanov
<
alexei.ivanov@amd.com
>
parent
27df5199
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
8 additions
and
5 deletions
+8
-5
.buildkite/run-amd-test.sh
.buildkite/run-amd-test.sh
+6
-4
Dockerfile.rocm
Dockerfile.rocm
+2
-1
No files found.
.buildkite/run-amd-test.sh
View file @
dd8a29da
...
@@ -134,9 +134,10 @@ if [[ $commands == *"--shard-id="* ]]; then
...
@@ -134,9 +134,10 @@ if [[ $commands == *"--shard-id="* ]]; then
# assign shard-id for each shard
# assign shard-id for each shard
commands_gpu
=
${
commands
//
"--shard-id= "
/
"--shard-id=
${
GPU
}
"
}
commands_gpu
=
${
commands
//
"--shard-id= "
/
"--shard-id=
${
GPU
}
"
}
echo
"Shard
${
GPU
}
commands:
$commands_gpu
"
echo
"Shard
${
GPU
}
commands:
$commands_gpu
"
echo
"Render devices:
$BUILDKITE_AGENT_META_DATA_RENDER_DEVICES
"
docker run
\
docker run
\
--device
/dev/kfd
--device
/dev/dri
\
--device
/dev/kfd
$BUILDKITE_AGENT_META_DATA_RENDER_DEVICES
\
--network
host
\
--network
=
host
\
--shm-size
=
16gb
\
--shm-size
=
16gb
\
--rm
\
--rm
\
-e
HIP_VISIBLE_DEVICES
=
"
${
GPU
}
"
\
-e
HIP_VISIBLE_DEVICES
=
"
${
GPU
}
"
\
...
@@ -163,9 +164,10 @@ if [[ $commands == *"--shard-id="* ]]; then
...
@@ -163,9 +164,10 @@ if [[ $commands == *"--shard-id="* ]]; then
fi
fi
done
done
else
else
echo
"Render devices:
$BUILDKITE_AGENT_META_DATA_RENDER_DEVICES
"
docker run
\
docker run
\
--device
/dev/kfd
--device
/dev/dri
\
--device
/dev/kfd
$BUILDKITE_AGENT_META_DATA_RENDER_DEVICES
\
--network
host
\
--network
=
host
\
--shm-size
=
16gb
\
--shm-size
=
16gb
\
--rm
\
--rm
\
-e
HIP_VISIBLE_DEVICES
=
0
\
-e
HIP_VISIBLE_DEVICES
=
0
\
...
...
Dockerfile.rocm
View file @
dd8a29da
...
@@ -12,7 +12,8 @@ ENV PYTORCH_ROCM_ARCH=${ARG_PYTORCH_ROCM_ARCH:-${PYTORCH_ROCM_ARCH}}
...
@@ -12,7 +12,8 @@ ENV PYTORCH_ROCM_ARCH=${ARG_PYTORCH_ROCM_ARCH:-${PYTORCH_ROCM_ARCH}}
# Install some basic utilities
# Install some basic utilities
RUN apt-get update -q -y && apt-get install -q -y \
RUN apt-get update -q -y && apt-get install -q -y \
sqlite3 libsqlite3-dev libfmt-dev libmsgpack-dev libsuitesparse-dev
sqlite3 libsqlite3-dev libfmt-dev libmsgpack-dev libsuitesparse-dev \
apt-transport-https ca-certificates wget curl
# Remove sccache
# Remove sccache
RUN python3 -m pip install --upgrade pip && pip install setuptools_scm
RUN python3 -m pip install --upgrade pip && pip install setuptools_scm
RUN apt-get purge -y sccache; python3 -m pip uninstall -y sccache; rm -f "$(which sccache)"
RUN apt-get purge -y sccache; python3 -m pip uninstall -y sccache; rm -f "$(which sccache)"
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment