Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
c07526e4
Unverified
Commit
c07526e4
authored
Jul 22, 2024
by
zhyncs
Committed by
GitHub
Jul 22, 2024
Browse files
fix: update bench serving (#694)
parent
7b597475
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
3 deletions
+1
-3
python/sglang/bench_serving.py
python/sglang/bench_serving.py
+1
-3
No files found.
python/sglang/bench_serving.py
View file @
c07526e4
...
@@ -70,12 +70,10 @@ async def async_request_trt_llm(
...
@@ -70,12 +70,10 @@ async def async_request_trt_llm(
assert
api_url
.
endswith
(
"generate_stream"
)
assert
api_url
.
endswith
(
"generate_stream"
)
async
with
aiohttp
.
ClientSession
(
timeout
=
AIOHTTP_TIMEOUT
)
as
session
:
async
with
aiohttp
.
ClientSession
(
timeout
=
AIOHTTP_TIMEOUT
)
as
session
:
assert
not
request_func_input
.
use_beam_search
assert
request_func_input
.
best_of
==
1
payload
=
{
payload
=
{
"accumulate_tokens"
:
True
,
"accumulate_tokens"
:
True
,
"text_input"
:
request_func_input
.
prompt
,
"text_input"
:
request_func_input
.
prompt
,
"temperature"
:
0.0
,
"temperature"
:
0.0
00001
,
"top_p"
:
1.0
,
"top_p"
:
1.0
,
"max_tokens"
:
request_func_input
.
output_len
,
"max_tokens"
:
request_func_input
.
output_len
,
"stream"
:
True
,
"stream"
:
True
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment