Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
40d86ee4
Unverified
Commit
40d86ee4
authored
Jul 24, 2025
by
Chengji Yao
Committed by
GitHub
Jul 24, 2025
Browse files
[TPU][Bugfix] fix OOM issue in CI test (#21550)
Signed-off-by:
Chengji Yao
<
chengjiyao@google.com
>
parent
85d051f0
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
tests/v1/tpu/test_basic.py
tests/v1/tpu/test_basic.py
+1
-1
No files found.
tests/v1/tpu/test_basic.py
View file @
40d86ee4
...
@@ -59,7 +59,7 @@ def test_basic(
...
@@ -59,7 +59,7 @@ def test_basic(
# actually test chunked prompt
# actually test chunked prompt
max_num_batched_tokens
=
1024
,
max_num_batched_tokens
=
1024
,
max_model_len
=
8192
,
max_model_len
=
8192
,
gpu_memory_utilization
=
0.
7
,
gpu_memory_utilization
=
0.
95
,
max_num_seqs
=
max_num_seqs
,
max_num_seqs
=
max_num_seqs
,
tensor_parallel_size
=
tensor_parallel_size
)
as
vllm_model
:
tensor_parallel_size
=
tensor_parallel_size
)
as
vllm_model
:
vllm_outputs
=
vllm_model
.
generate_greedy
(
example_prompts
,
vllm_outputs
=
vllm_model
.
generate_greedy
(
example_prompts
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment