"git@developer.sourcefind.cn:renzhc/diffusers_dcu.git" did not exist on "f12d161d6763cff0f45b0ec3b3f6072a2b7c7f9d"
Unverified Commit 3ecae420 authored by Darinka's avatar Darinka Committed by GitHub
Browse files

Update api.md (#3945)



* Update api.md

Changed the calculation of tps (token/s) in the documentation

* Update docs/api.md

---------
Co-authored-by: default avatarJeffrey Morgan <jmorganca@gmail.com>
parent 4cbbf0e1
...@@ -95,7 +95,7 @@ The final response in the stream also includes additional data about the generat ...@@ -95,7 +95,7 @@ The final response in the stream also includes additional data about the generat
- `context`: an encoding of the conversation used in this response, this can be sent in the next request to keep a conversational memory - `context`: an encoding of the conversation used in this response, this can be sent in the next request to keep a conversational memory
- `response`: empty if the response was streamed, if not streamed, this will contain the full response - `response`: empty if the response was streamed, if not streamed, this will contain the full response
To calculate how fast the response is generated in tokens per second (token/s), divide `eval_count` / `eval_duration`. To calculate how fast the response is generated in tokens per second (token/s), divide `eval_count` / `eval_duration` * `10^9`.
```json ```json
{ {
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment