[Model Runner V2] Enable piecewise CUDA graphs for pipeline parallelism (#35162)
Signed-off-by:Zhanqiu Hu <zh338@cornell.edu> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Co-authored-by:
Woosuk Kwon <woosuk@inferact.ai>
Showing
Please register or sign in to comment