Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
59012df9
Unverified
Commit
59012df9
authored
Oct 07, 2025
by
Johnny Yang
Committed by
GitHub
Oct 07, 2025
Browse files
[TPU] update TPU benchmark threshold (#25713)
Signed-off-by:
Johnny Yang
<
johnnyyang@google.com
>
parent
3d1f6761
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
.buildkite/scripts/tpu/quantized_v6e_1.env
.buildkite/scripts/tpu/quantized_v6e_1.env
+1
-1
No files found.
.buildkite/scripts/tpu/quantized_v6e_1.env
View file @
59012df9
...
...
@@ -9,6 +9,6 @@ MAX_NUM_BATCHED_TOKENS=1024
TENSOR_PARALLEL_SIZE=1
MAX_MODEL_LEN=2048
DOWNLOAD_DIR=/mnt/disks/persist
EXPECTED_THROUGHPUT=
10.0
EXPECTED_THROUGHPUT=
8.7
INPUT_LEN=1800
OUTPUT_LEN=128
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment