Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
03025c02
Unverified
Commit
03025c02
authored
Nov 14, 2024
by
Isotr0py
Committed by
GitHub
Nov 14, 2024
Browse files
[CI/Build] Fix CPU CI online inference timeout (#10314)
Signed-off-by:
Isotr0py
<
2037008807@qq.com
>
parent
29f3ef26
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
.buildkite/run-cpu-test.sh
.buildkite/run-cpu-test.sh
+2
-2
No files found.
.buildkite/run-cpu-test.sh
View file @
03025c02
...
...
@@ -61,7 +61,7 @@ function cpu_tests() {
docker
exec
cpu-test bash
-c
"
set -e
export VLLM_CPU_KVCACHE_SPACE=10
export VLLM_CPU_OMP_THREADS_BIND=
$
CORE_RANGE
export VLLM_CPU_OMP_THREADS_BIND=
$
1
python3 -m vllm.entrypoints.openai.api_server --model facebook/opt-125m --dtype half &
timeout 600 bash -c 'until curl localhost:8000/v1/models; do sleep 1; done' || exit 1
python3 benchmarks/benchmark_serving.py
\
...
...
@@ -75,4 +75,4 @@ function cpu_tests() {
# All of CPU tests are expected to be finished less than 25 mins.
export
-f
cpu_tests
timeout
25m bash
-c
"cpu_tests"
timeout
25m bash
-c
"cpu_tests
$CORE_RANGE
"
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment