[Model Runner V2] support auto resolve cudagraph mode/sizes based on attn backend (#32936)
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
Showing
Please register or sign in to comment
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>