Unverified Commit 7cfea0df authored by QiliangCui's avatar QiliangCui Committed by GitHub
Browse files

[TPU][Test] Rollback PR-21550. (#21619)


Signed-off-by: default avatarQiliang Cui <derrhein@gmail.com>
parent 5ac3168e
...@@ -59,7 +59,7 @@ def test_basic( ...@@ -59,7 +59,7 @@ def test_basic(
# actually test chunked prompt # actually test chunked prompt
max_num_batched_tokens=1024, max_num_batched_tokens=1024,
max_model_len=8192, max_model_len=8192,
gpu_memory_utilization=0.95, gpu_memory_utilization=0.7,
max_num_seqs=max_num_seqs, max_num_seqs=max_num_seqs,
tensor_parallel_size=tensor_parallel_size) as vllm_model: tensor_parallel_size=tensor_parallel_size) as vllm_model:
vllm_outputs = vllm_model.generate_greedy(example_prompts, vllm_outputs = vllm_model.generate_greedy(example_prompts,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment