Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
1298a400
Unverified
Commit
1298a400
authored
Feb 03, 2025
by
youkaichao
Committed by
GitHub
Feb 03, 2025
Browse files
[ci/build] fix gh200 test (#12681)
Signed-off-by:
youkaichao
<
youkaichao@gmail.com
>
parent
ad4a9dc8
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
5 additions
and
5 deletions
+5
-5
.buildkite/check-wheel-size.py
.buildkite/check-wheel-size.py
+2
-2
.buildkite/run-gh200-test.sh
.buildkite/run-gh200-test.sh
+2
-2
Dockerfile
Dockerfile
+1
-1
No files found.
.buildkite/check-wheel-size.py
View file @
1298a400
...
@@ -4,11 +4,11 @@ import os
...
@@ -4,11 +4,11 @@ import os
import
sys
import
sys
import
zipfile
import
zipfile
# Read the VLLM_MAX_SIZE_MB environment variable, defaulting to
3
00 MiB
# Read the VLLM_MAX_SIZE_MB environment variable, defaulting to
4
00 MiB
# Note that we have 400 MiB quota, please use it wisely.
# Note that we have 400 MiB quota, please use it wisely.
# See https://github.com/pypi/support/issues/3792 .
# See https://github.com/pypi/support/issues/3792 .
# Please also sync the value with the one in Dockerfile.
# Please also sync the value with the one in Dockerfile.
VLLM_MAX_SIZE_MB
=
int
(
os
.
environ
.
get
(
'VLLM_MAX_SIZE_MB'
,
3
00
))
VLLM_MAX_SIZE_MB
=
int
(
os
.
environ
.
get
(
'VLLM_MAX_SIZE_MB'
,
4
00
))
def
print_top_10_largest_files
(
zip_file
):
def
print_top_10_largest_files
(
zip_file
):
...
...
.buildkite/run-gh200-test.sh
View file @
1298a400
...
@@ -23,6 +23,6 @@ trap remove_docker_container EXIT
...
@@ -23,6 +23,6 @@ trap remove_docker_container EXIT
remove_docker_container
remove_docker_container
# Run the image and test offline inference
# Run the image and test offline inference
docker run
--name
gh200-test
--gpus
=
all
--entrypoint
=
""
gh200-test bash
-c
'
docker run
-e
HF_TOKEN
-v
/root/.cache/huggingface:/root/.cache/huggingface
--name
gh200-test
--gpus
=
all
--entrypoint
=
""
gh200-test bash
-c
'
python3 examples/offline_inference/
basic.py
python3 examples/offline_inference/
cli.py --model meta-llama/Llama-3.2-1B
'
'
Dockerfile
View file @
1298a400
...
@@ -127,7 +127,7 @@ RUN --mount=type=cache,target=/root/.cache/ccache \
...
@@ -127,7 +127,7 @@ RUN --mount=type=cache,target=/root/.cache/ccache \
# Check the size of the wheel if RUN_WHEEL_CHECK is true
# Check the size of the wheel if RUN_WHEEL_CHECK is true
COPY
.buildkite/check-wheel-size.py check-wheel-size.py
COPY
.buildkite/check-wheel-size.py check-wheel-size.py
# sync the default value with .buildkite/check-wheel-size.py
# sync the default value with .buildkite/check-wheel-size.py
ARG
VLLM_MAX_SIZE_MB=
3
00
ARG
VLLM_MAX_SIZE_MB=
4
00
ENV
VLLM_MAX_SIZE_MB=$VLLM_MAX_SIZE_MB
ENV
VLLM_MAX_SIZE_MB=$VLLM_MAX_SIZE_MB
ARG
RUN_WHEEL_CHECK=true
ARG
RUN_WHEEL_CHECK=true
RUN if
[
"
$RUN_WHEEL_CHECK
"
=
"true"
]
;
then
\
RUN if
[
"
$RUN_WHEEL_CHECK
"
=
"true"
]
;
then
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment