Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a738dbb2
Unverified
Commit
a738dbb2
authored
Jun 23, 2025
by
QiliangCui
Committed by
GitHub
Jun 24, 2025
Browse files
Update test case parameter to have the throughput above 8.0 (#19994)
Signed-off-by:
Qiliang Cui
<
derrhein@gmail.com
>
parent
33d5e29b
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
.buildkite/scripts/tpu/config_v6e_1.env
.buildkite/scripts/tpu/config_v6e_1.env
+2
-2
No files found.
.buildkite/scripts/tpu/config_v6e_1.env
View file @
a738dbb2
...
...
@@ -4,8 +4,8 @@ CONTAINER_NAME=vllm-tpu
# vllm config
MODEL=meta-llama/Llama-3.1-8B-Instruct
MAX_NUM_SEQS=
512
MAX_NUM_BATCHED_TOKENS=
512
MAX_NUM_SEQS=
256
MAX_NUM_BATCHED_TOKENS=
1024
TENSOR_PARALLEL_SIZE=1
MAX_MODEL_LEN=2048
DOWNLOAD_DIR=/mnt/disks/persist
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment