Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
72a8639b
Unverified
Commit
72a8639b
authored
Mar 18, 2025
by
Alexander Matveev
Committed by
GitHub
Mar 18, 2025
Browse files
[V1] TPU - CI/CD use smaller model (#15054)
Signed-off-by:
Alexander Matveev
<
amatveev@redhat.com
>
parent
99abb8b6
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
9 deletions
+10
-9
.buildkite/run-tpu-v1-test.sh
.buildkite/run-tpu-v1-test.sh
+7
-7
tests/v1/tpu/test_basic.py
tests/v1/tpu/test_basic.py
+3
-2
No files found.
.buildkite/run-tpu-v1-test.sh
View file @
72a8639b
...
@@ -20,17 +20,17 @@ docker run --privileged --net host --shm-size=16G -it \
...
@@ -20,17 +20,17 @@ docker run --privileged --net host --shm-size=16G -it \
&& python3 -m pip install pytest
\
&& python3 -m pip install pytest
\
&& python3 -m pip install lm_eval[api]==0.4.4
\
&& python3 -m pip install lm_eval[api]==0.4.4
\
&& echo TEST_1
\
&& echo TEST_1
\
&& VLLM_USE_V1=1 pyt
est -v -s
/workspace/vllm/tests/
v1/
tpu/test_
basic
.py
\
&& VLLM_USE_V1=1 pyt
hon3
/workspace/vllm/tests/tpu/test_
compilation
.py
\
&& echo TEST_2
\
&& echo TEST_2
\
&& VLLM_USE_V1=1 pytest -v -s /workspace/vllm/tests/
entrypoints/llm/test_accuracy.py::test_lm_eval_accuracy_v1_engine
\
&& VLLM_USE_V1=1 pytest -v -s /workspace/vllm/tests/
v1/tpu/test_basic.py
\
&& echo TEST_3
\
&& echo TEST_3
\
&& VLLM_USE_V1=1 pytest -
s
-
v
/workspace/vllm/tests/
tpu/test_quantization_accuracy.py
\
&& VLLM_USE_V1=1 pytest -
v
-
s
/workspace/vllm/tests/
entrypoints/llm/test_accuracy.py::test_lm_eval_accuracy_v1_engine
\
&& echo TEST_4
\
&& echo TEST_4
\
&& VLLM_USE_V1=1 pyt
hon3
/workspace/vllm/
examples/offline_inference/tpu
.py
"
\
&& VLLM_USE_V1=1 pyt
est -s -v
/workspace/vllm/
tests/tpu/test_quantization_accuracy
.py
\
&& echo TEST_5
\
&& echo TEST_5
\
&&
VLLM_USE_V1
=
1 python3 /workspace/vllm/
tests/tpu/test_compilation
.py
\
&& VLLM_USE_V1=1 python3 /workspace/vllm/
examples/offline_inference/tpu
.py
"
\
# TODO:
Fix these tests
# TODO:
This test fails because it uses RANDOM_SEED sampling
# && VLLM_USE_V1=1 pytest -v -s /workspace/vllm/tests/tpu/test_custom_dispatcher.py \
# && VLLM_USE_V1=1 pytest -v -s /workspace/vllm/tests/tpu/test_custom_dispatcher.py \
tests/v1/tpu/test_basic.py
View file @
72a8639b
...
@@ -15,9 +15,10 @@ if TYPE_CHECKING:
...
@@ -15,9 +15,10 @@ if TYPE_CHECKING:
from
tests.conftest
import
VllmRunner
from
tests.conftest
import
VllmRunner
MODELS
=
[
MODELS
=
[
"Qwen/Qwen2.5-1.5B-Instruct"
,
# TODO: Enable this models with v6e
# "Qwen/Qwen2-7B-Instruct",
# "Qwen/Qwen2-7B-Instruct",
"meta-llama/Llama-3.1-8B"
,
# "meta-llama/Llama-3.1-8B",
# TODO: Add models here as necessary
]
]
TENSOR_PARALLEL_SIZES
=
[
1
]
TENSOR_PARALLEL_SIZES
=
[
1
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment