Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
norm
vllm
Commits
00efdc84
"src/graph/sampling/randomwalks/frequency_hashmap.hip" did not exist on "78e0dae69343ba47077f575584bc4f68a58d8a0c"
Unverified
Commit
00efdc84
authored
Jan 19, 2024
by
Simon Mo
Committed by
GitHub
Jan 19, 2024
Browse files
Add benchmark serving to CI (#2505)
parent
91a61da9
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
32 additions
and
4 deletions
+32
-4
.buildkite/run-benchmarks.sh
.buildkite/run-benchmarks.sh
+32
-4
No files found.
.buildkite/run-benchmarks.sh
View file @
00efdc84
# This script is run by buildkite to run the benchmarks and upload the results to buildkite
# This script is run by buildkite to run the benchmarks and upload the results to buildkite
set
-ex
set
-ex
set
-o
pipefail
# cd into parent directory of this file
# cd into parent directory of this file
cd
"
$(
dirname
"
${
BASH_SOURCE
[0]
}
"
)
/.."
cd
"
$(
dirname
"
${
BASH_SOURCE
[0]
}
"
)
/.."
(
wget
&&
curl
)
||
(
apt-get update
&&
apt-get
install
-y
wget curl
)
# run benchmarks and upload the result to buildkite
# run benchmarks and upload the result to buildkite
python3 benchmarks/benchmark_latency.py 2>&1 |
tee
benchmark_latency.txt
python3 benchmarks/benchmark_latency.py 2>&1 |
tee
benchmark_latency.txt
bench_latency_exit_code
=
$?
bench_latency_exit_code
=
$?
...
@@ -12,15 +15,36 @@ bench_latency_exit_code=$?
...
@@ -12,15 +15,36 @@ bench_latency_exit_code=$?
python3 benchmarks/benchmark_throughput.py
--input-len
256
--output-len
256 2>&1 |
tee
benchmark_throughput.txt
python3 benchmarks/benchmark_throughput.py
--input-len
256
--output-len
256 2>&1 |
tee
benchmark_throughput.txt
bench_throughput_exit_code
=
$?
bench_throughput_exit_code
=
$?
python3
-m
vllm.entrypoints.openai.api_server
--model
meta-llama/Llama-2-7b-chat-hf &
server_pid
=
$!
wget https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/resolve/main/ShareGPT_V3_unfiltered_cleaned_split.json
# wait for server to start, timeout after 600 seconds
timeout
600 bash
-c
'until curl localhost:8000/v1/models; do sleep 1; done'
||
exit
1
python3 benchmarks/benchmark_serving.py
\
--dataset
./ShareGPT_V3_unfiltered_cleaned_split.json
\
--model
meta-llama/Llama-2-7b-chat-hf
\
--num-prompts
20
\
--endpoint
/v1/completions
\
--tokenizer
meta-llama/Llama-2-7b-chat-hf 2>&1 |
tee
benchmark_serving.txt
bench_serving_exit_code
=
$?
kill
$server_pid
# write the results into a markdown file
# write the results into a markdown file
echo
"### Latency Benchmarks"
>>
benchmark_results.md
echo
"### Latency Benchmarks"
>>
benchmark_results.md
sed
-n
'1p'
benchmark_latency.txt
>>
benchmark_results.md
sed
-n
'1p'
benchmark_latency.txt
>>
benchmark_results.md
# first line
echo
""
>>
benchmark_results.md
echo
""
>>
benchmark_results.md
sed
-n
'$p'
benchmark_latency.txt
>>
benchmark_results.md
sed
-n
'$p'
benchmark_latency.txt
>>
benchmark_results.md
# last line
echo
"### Throughput Benchmarks"
>>
benchmark_results.md
echo
"### Throughput Benchmarks"
>>
benchmark_results.md
sed
-n
'1p'
benchmark_throughput.txt
>>
benchmark_results.md
sed
-n
'1p'
benchmark_throughput.txt
>>
benchmark_results.md
# first line
echo
""
>>
benchmark_results.md
echo
""
>>
benchmark_results.md
sed
-n
'$p'
benchmark_throughput.txt
>>
benchmark_results.md
sed
-n
'$p'
benchmark_throughput.txt
>>
benchmark_results.md
# last line
echo
"### Serving Benchmarks"
>>
benchmark_results.md
sed
-n
'1p'
benchmark_serving.txt
>>
benchmark_results.md
# first line
echo
""
>>
benchmark_results.md
tail
-n
5 benchmark_serving.txt
>>
benchmark_results.md
# last 5 lines
# upload the results to buildkite
# upload the results to buildkite
/workspace/buildkite-agent annotate
--style
"info"
--context
"benchmark-results"
< benchmark_results.md
/workspace/buildkite-agent annotate
--style
"info"
--context
"benchmark-results"
< benchmark_results.md
...
@@ -33,3 +57,7 @@ fi
...
@@ -33,3 +57,7 @@ fi
if
[
$bench_throughput_exit_code
-ne
0
]
;
then
if
[
$bench_throughput_exit_code
-ne
0
]
;
then
exit
$bench_throughput_exit_code
exit
$bench_throughput_exit_code
fi
fi
if
[
$bench_serving_exit_code
-ne
0
]
;
then
exit
$bench_serving_exit_code
fi
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment