Unverified Commit 8a93a598 authored by Wang, Yi's avatar Wang, Yi Committed by GitHub
Browse files

fix the issue that len(tokenizer(prompt)["input_ids"]) > prompt_len (#10524)


Signed-off-by: default avatarWang, Yi A <yi.a.wang@intel.com>
parent 1cfde82f
......@@ -54,6 +54,7 @@ async def async_request_tgi(
"do_sample": True,
"temperature": 0.01, # TGI does not accept 0.0 temperature.
"top_p": 0.99, # TGI does not accept 1.0 top_p.
"truncate": request_func_input.prompt_len,
# TGI does not accept ignore_eos flag.
}
payload = {
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment