fix: Set max_tokens_in_buffer on TRT-LLM cache transceiver (#4703)
Signed-off-by:
Jacky <18255193+kthui@users.noreply.github.com>
Showing
Please register or sign in to comment
Signed-off-by:
Jacky <18255193+kthui@users.noreply.github.com>