Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
c8423ca3
Unverified
Commit
c8423ca3
authored
Aug 14, 2024
by
Yineng Zhang
Committed by
GitHub
Aug 14, 2024
Browse files
ci: update timeout and retry (#1086)
Co-authored-by:
Liangsheng Yin
<
hnyls2002@gmail.com
>
parent
e205527c
Changes
5
Hide whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
24 additions
and
6 deletions
+24
-6
.github/workflows/accuracy-test.yml
.github/workflows/accuracy-test.yml
+3
-1
.github/workflows/e2e-test.yml
.github/workflows/e2e-test.yml
+5
-0
.github/workflows/moe-test.yml
.github/workflows/moe-test.yml
+11
-4
.github/workflows/unit-test.yml
.github/workflows/unit-test.yml
+4
-0
test/srt/test_moe_serving_throughput.py
test/srt/test_moe_serving_throughput.py
+1
-1
No files found.
.github/workflows/accuracy-test.yml
View file @
c8423ca3
...
...
@@ -6,11 +6,13 @@ on:
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/accuracy-test.yml"
pull_request
:
branches
:
[
main
]
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/accuracy-test.yml"
workflow_dispatch
:
concurrency
:
...
...
@@ -43,4 +45,4 @@ jobs:
run
:
|
cd test/srt
python3 test_eval_accuracy_large.py
timeout-minutes
:
2
0
timeout-minutes
:
1
0
.github/workflows/e2e-test.yml
View file @
c8423ca3
...
...
@@ -6,11 +6,13 @@ on:
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/e2e-test.yml"
pull_request
:
branches
:
[
main
]
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/e2e-test.yml"
workflow_dispatch
:
concurrency
:
...
...
@@ -39,13 +41,16 @@ jobs:
run
:
|
cd test/srt
python3 -m unittest test_serving_throughput.TestServingThroughput.test_default
timeout-minutes
:
10
-
name
:
Benchmark Serving Throughput (w/o RadixAttention)
run
:
|
cd test/srt
python3 -m unittest test_serving_throughput.TestServingThroughput.test_default_without_radix_cache
timeout-minutes
:
10
-
name
:
Benchmark Serving Throughput (w/ ChunkedPrefill)
run
:
|
cd test/srt
python3 -m unittest test_serving_throughput.TestServingThroughput.test_default_with_chunked_prefill
timeout-minutes
:
10
.github/workflows/moe-test.yml
View file @
c8423ca3
...
...
@@ -6,11 +6,13 @@ on:
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/moe-test.yml"
pull_request
:
branches
:
[
main
]
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/moe-test.yml"
workflow_dispatch
:
concurrency
:
...
...
@@ -36,7 +38,12 @@ jobs:
pip install flashinfer -i https://flashinfer.ai/whl/cu121/torch2.4/ --force-reinstall
-
name
:
Benchmark MOE Serving Throughput
run
:
|
cd test/srt
python3 -m unittest test_moe_serving_throughput.TestServingThroughput.test_default
python3 -m unittest test_moe_serving_throughput.TestServingThroughput.test_default_without_radix_cache
uses
:
nick-fields/retry@v3
with
:
timeout_minutes
:
15
max_attempts
:
2
retry_on
:
error
command
:
|
cd test/srt
python3 -m unittest test_moe_serving_throughput.TestServingThroughput.test_default
python3 -m unittest test_moe_serving_throughput.TestServingThroughput.test_default_without_radix_cache
.github/workflows/unit-test.yml
View file @
c8423ca3
...
...
@@ -6,11 +6,13 @@ on:
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/unit-test.yml"
pull_request
:
branches
:
[
main
]
paths
:
-
"
python/sglang/**"
-
"
test/**"
-
"
.github/workflows/unit-test.yml"
workflow_dispatch
:
concurrency
:
...
...
@@ -41,8 +43,10 @@ jobs:
run
:
|
cd test/srt
python3 run_suite.py --suite minimal
timeout-minutes
:
15
-
name
:
Test Frontend Language
run
:
|
cd test/lang
python3 run_suite.py --suite minimal
timeout-minutes
:
10
test/srt/test_moe_serving_throughput.py
View file @
c8423ca3
...
...
@@ -73,7 +73,7 @@ class TestServingThroughput(unittest.TestCase):
if
os
.
getenv
(
"SGLANG_IS_IN_CI"
,
"false"
)
==
"true"
:
# A100 (PCIE) performance
assert
res
[
"output_throughput"
]
>
9
5
0
assert
res
[
"output_throughput"
]
>
9
3
0
def
test_default_without_radix_cache
(
self
):
res
=
self
.
run_test
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment