[Core/DBO][1/N] Add Dual-Batch Overlap mechanism to VLLM (#23693)
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
Showing
This diff is collapsed.
Please register or sign in to comment