Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
7282ab74
Unverified
Commit
7282ab74
authored
Apr 22, 2025
by
Yineng Zhang
Committed by
GitHub
Apr 22, 2025
Browse files
fix: update bench_speculative (#5649)
parent
b0feda09
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
1 deletion
+3
-1
python/sglang/bench_serving.py
python/sglang/bench_serving.py
+2
-0
scripts/playground/bench_speculative.py
scripts/playground/bench_speculative.py
+1
-1
No files found.
python/sglang/bench_serving.py
View file @
7282ab74
...
@@ -993,6 +993,8 @@ async def benchmark(
...
@@ -993,6 +993,8 @@ async def benchmark(
async
with
semaphore
:
async
with
semaphore
:
return
await
request_func
(
request_func_input
=
request_func_input
,
pbar
=
pbar
)
return
await
request_func
(
request_func_input
=
request_func_input
,
pbar
=
pbar
)
if
not
hasattr
(
args
,
"warmup_requests"
):
args
.
warmup_requests
=
1
# Warmup
# Warmup
print
(
f
"Starting warmup with
{
args
.
warmup_requests
}
sequences..."
)
print
(
f
"Starting warmup with
{
args
.
warmup_requests
}
sequences..."
)
...
...
scripts/playground/bench_speculative.py
View file @
7282ab74
...
@@ -85,7 +85,7 @@ def send_one_batch(base_url, num_prompts, batch_size):
...
@@ -85,7 +85,7 @@ def send_one_batch(base_url, num_prompts, batch_size):
request_rate
=
float
(
"inf"
),
request_rate
=
float
(
"inf"
),
max_concurrency
=
batch_size
,
max_concurrency
=
batch_size
,
disable_tqdm
=
False
,
disable_tqdm
=
False
,
lora_name
=
None
,
lora_name
s
=
None
,
extra_request_body
=
{},
extra_request_body
=
{},
profile
=
None
,
profile
=
None
,
)
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment