Merge branch 'main' into lmcafee/distrib-opt-nodupe
merge sequence parallelism's layernorm all-reduce into distributed optimizer.
Showing
This diff is collapsed.
tools/merge_datasets.py
0 → 100644
Please register or sign in to comment