Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
f52eda35
Unverified
Commit
f52eda35
authored
Jul 30, 2024
by
Yineng Zhang
Committed by
GitHub
Jul 30, 2024
Browse files
misc: update e2e test benchmark config (#825)
parent
b579ecf0
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
8 additions
and
7 deletions
+8
-7
.github/workflows/pr-e2e-test.yml
.github/workflows/pr-e2e-test.yml
+7
-7
python/sglang/bench_serving.py
python/sglang/bench_serving.py
+1
-0
No files found.
.github/workflows/pr-e2e-test.yml
View file @
f52eda35
name
:
PR E2E Test
name
:
PR E2E Test
on
:
on
:
push
:
branches
:
[
main
]
pull_request
:
pull_request
:
branches
:
[
main
]
branches
:
[
main
]
paths
:
-
"
python/sglang/*"
workflow_dispatch
:
workflow_dispatch
:
jobs
:
jobs
:
...
@@ -26,22 +26,22 @@ jobs:
...
@@ -26,22 +26,22 @@ jobs:
-
name
:
Launch server and run benchmark
-
name
:
Launch server and run benchmark
run
:
|
run
:
|
python3 -m sglang.launch_server --model /home/lmzheng/zhyncs/Meta-Llama-3.1-8B-Instruct --port 8413 &
python3 -m sglang.launch_server --model /home/lmzheng/zhyncs/Meta-Llama-3.1-8B-Instruct --port 8413
--disable-radix-cache
&
echo "Waiting for server to start..."
echo "Waiting for server to start..."
for i in {1..
6
0}; do
for i in {1..
12
0}; do
if curl -s http://127.0.0.1:8413/health; then
if curl -s http://127.0.0.1:8413/health; then
echo "Server is up!"
echo "Server is up!"
break
break
fi
fi
if [ $i -eq
6
0 ]; then
if [ $i -eq
12
0 ]; then
echo "Server failed to start within
6
0 seconds"
echo "Server failed to start within
12
0 seconds"
exit 1
exit 1
fi
fi
sleep 1
sleep 1
done
done
python3 -m sglang.bench_serving --backend sglang --port 8413
cd /home/lmzheng/zhyncs &&
python3 -m sglang.bench_serving --backend sglang --port 8413
--dataset-name random --num-prompts 3000 --random-input 256 --random-output 512
echo "Stopping server..."
echo "Stopping server..."
kill -9 $(ps aux | grep sglang | grep Meta-Llama-3.1-8B-Instruct | grep -v grep | awk '{print $2}')
kill -9 $(ps aux | grep sglang | grep Meta-Llama-3.1-8B-Instruct | grep -v grep | awk '{print $2}')
python/sglang/bench_serving.py
View file @
f52eda35
# Adapted from https://github.com/vllm-project/vllm/blob/6366efc67b0aedd2c1721c14385370e50b297fb3/benchmarks/backend_request_func.py
# Adapted from https://github.com/vllm-project/vllm/blob/6366efc67b0aedd2c1721c14385370e50b297fb3/benchmarks/backend_request_func.py
# Adapted from https://github.com/vllm-project/vllm/blob/6366efc67b0aedd2c1721c14385370e50b297fb3/benchmarks/benchmark_serving.py
# Adapted from https://github.com/vllm-project/vllm/blob/6366efc67b0aedd2c1721c14385370e50b297fb3/benchmarks/benchmark_serving.py
"""
"""
Benchmark online serving.
Benchmark online serving.
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment