sync-free Distributed LAMB + parameter reordering (#1055)
* sync free Distributed LAMB
* init lr with provided value
* wait l2 norm strem
* reorder param
* fix indent
Co-authored-by:
Kexin Yu <kexiny@nvidia.com>
Showing
This diff is collapsed.
Please register or sign in to comment