Merge branch 'main' into lmcafee/distrib-opt-nodupe
merge sequence parallelism's layernorm all-reduce into distributed optimizer.
Showing
224 KB
This diff is collapsed.
merge sequence parallelism's layernorm all-reduce into distributed optimizer.
224 KB