Unverified Commit 51906c8c authored by maang-h's avatar maang-h Committed by GitHub
Browse files

[Docs] Improve `priority` parameter documentation (#29572)


Signed-off-by: default avatarmaang <maang_h@163.com>
Signed-off-by: default avatarmaang-h <55082429+maang-h@users.noreply.github.com>
Co-authored-by: default avatarCyrus Leung <cyrus.tl.leung@gmail.com>
parent 0838b52e
......@@ -405,6 +405,9 @@ class LLM:
lora_request: LoRA request to use for generation, if any.
priority: The priority of the requests, if any.
Only applicable when priority scheduling policy is enabled.
If provided, must be a list of integers matching the length
of `prompts`, where each priority value corresponds to the prompt
at the same index.
Returns:
A list of `RequestOutput` objects containing the
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment