Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
26fc32d1
Unverified
Commit
26fc32d1
authored
Apr 28, 2025
by
Lianmin Zheng
Committed by
GitHub
Apr 28, 2025
Browse files
[CI] tune the test order to warmup the server (#5860)
parent
1cc32603
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
7 additions
and
3 deletions
+7
-3
python/sglang/srt/layers/quantization/deep_gemm.py
python/sglang/srt/layers/quantization/deep_gemm.py
+1
-1
test/srt/test_full_deepseek_v3.py
test/srt/test_full_deepseek_v3.py
+6
-2
No files found.
python/sglang/srt/layers/quantization/deep_gemm.py
View file @
26fc32d1
...
...
@@ -293,7 +293,7 @@ def _maybe_compile_deep_gemm_one_type_all(
logger
.
info
(
f
"Try DeepGEMM JIT Compiling for "
f
"<
{
kernel_helper
.
name
}
> N=
{
n
}
, K=
{
k
}
, num_groups=
{
num_groups
}
with all Ms."
f
"
{
' It only takes a litte time
(T
ypically 1 sec) if you have run `sglang.compile_deep_gemm`. '
if
not
_IN_PRECOMPILE_STAGE
else
''
}
"
f
"
{
' It only takes a litte time
(t
ypically 1 sec) if you have run `
python3 -m
sglang.compile_deep_gemm`. '
if
not
_IN_PRECOMPILE_STAGE
else
''
}
"
)
# NOTE(alcanderian): get_num_sms should be change when 2-batch-overlap is introduced
...
...
test/srt/test_full_deepseek_v3.py
View file @
26fc32d1
...
...
@@ -35,7 +35,9 @@ class TestDeepseekV3(CustomTestCase):
def
tearDownClass
(
cls
):
kill_process_tree
(
cls
.
process
.
pid
)
def
test_gsm8k
(
self
):
def
test_a_gsm8k
(
self
,
):
# Append an "a" to make this test run first (alphabetically) to warm up the server
args
=
SimpleNamespace
(
num_shots
=
8
,
data_path
=
None
,
...
...
@@ -100,7 +102,9 @@ class TestDeepseekV3MTP(CustomTestCase):
def
tearDownClass
(
cls
):
kill_process_tree
(
cls
.
process
.
pid
)
def
test_gsm8k
(
self
):
def
test_a_gsm8k
(
self
,
):
# Append an "a" to make this test run first (alphabetically) to warm up the server
requests
.
get
(
self
.
base_url
+
"/flush_cache"
)
args
=
SimpleNamespace
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment