Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
56405076
Unverified
Commit
56405076
authored
Sep 17, 2025
by
Yineng Zhang
Committed by
GitHub
Sep 17, 2025
Browse files
fix: update dsv3 fp4 ut (#10584)
parent
b73ac629
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
4 deletions
+4
-4
test/srt/test_deepseek_v3_fp4_4gpu.py
test/srt/test_deepseek_v3_fp4_4gpu.py
+4
-4
No files found.
test/srt/test_deepseek_v3_fp4_4gpu.py
View file @
56405076
...
@@ -141,8 +141,8 @@ class TestDeepseekV3FP4MTP(CustomTestCase):
...
@@ -141,8 +141,8 @@ class TestDeepseekV3FP4MTP(CustomTestCase):
f
'
{
metrics
[
"accuracy"
]
=
:.
3
f
}
\n
'
f
'
{
metrics
[
"accuracy"
]
=
:.
3
f
}
\n
'
f
"
{
avg_spec_accept_length
=
:.
2
f
}
\n
"
f
"
{
avg_spec_accept_length
=
:.
2
f
}
\n
"
)
)
self
.
assertGreater
(
metrics
[
"accuracy"
],
0.9
35
)
self
.
assertGreater
(
metrics
[
"accuracy"
],
0.9
4
)
self
.
assertGreater
(
avg_spec_accept_length
,
2.
9
)
self
.
assertGreater
(
avg_spec_accept_length
,
2.
04
)
def
test_bs_1_speed
(
self
):
def
test_bs_1_speed
(
self
):
args
=
BenchArgs
(
port
=
int
(
self
.
base_url
.
split
(
":"
)[
-
1
]),
max_new_tokens
=
2048
)
args
=
BenchArgs
(
port
=
int
(
self
.
base_url
.
split
(
":"
)[
-
1
]),
max_new_tokens
=
2048
)
...
@@ -156,8 +156,8 @@ class TestDeepseekV3FP4MTP(CustomTestCase):
...
@@ -156,8 +156,8 @@ class TestDeepseekV3FP4MTP(CustomTestCase):
f
"
{
acc_length
=
:.
2
f
}
\n
"
f
"
{
acc_length
=
:.
2
f
}
\n
"
f
"
{
speed
=
:.
2
f
}
token/s
\n
"
f
"
{
speed
=
:.
2
f
}
token/s
\n
"
)
)
self
.
assertGreater
(
acc_length
,
2.
9
)
self
.
assertGreater
(
acc_length
,
2.
04
)
self
.
assertGreater
(
speed
,
1
3
0
)
self
.
assertGreater
(
speed
,
1
5
0
)
class
TestDeepseekV3FP4CutlassMoE
(
CustomTestCase
):
class
TestDeepseekV3FP4CutlassMoE
(
CustomTestCase
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment