Unverified Commit 05c50a82 authored by Roger Wang's avatar Roger Wang Committed by GitHub
Browse files

Minor bugfix on benchmark serving (#1005)

parent b568df5d
...@@ -195,6 +195,7 @@ async def async_request_openai_completions( ...@@ -195,6 +195,7 @@ async def async_request_openai_completions(
output.ttft = ttft output.ttft = ttft
# Decoding phase # Decoding phase
else:
output.itl.append(timestamp - most_recent_timestamp) output.itl.append(timestamp - most_recent_timestamp)
most_recent_timestamp = timestamp most_recent_timestamp = timestamp
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment