[Perf] use cpu all reduce to avoid sync when async_scheduling & dp > 1 (#29311)
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
Showing
Please register or sign in to comment
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>