[BugFix] Fix OOM in vLLM replicas by ensuring consistent NCCL memory accounting (#25359)
Signed-off-by:
Kourosh Hakhamaneshi <kourosh@anyscale.com>
Showing
Please register or sign in to comment
Signed-off-by:
Kourosh Hakhamaneshi <kourosh@anyscale.com>