feat: vllm. Use prefill-specific health check payload and use bos as token_id (#3126)
Signed-off-by:
tzulingk@nvidia.com <tzulingk@nvidia.com>
Showing
Please register or sign in to comment
Signed-off-by:
tzulingk@nvidia.com <tzulingk@nvidia.com>