[training] use the lr when using 8bit adam. (#9796)
* use the lr when using 8bit adam.
* remove lr as we pack it in params_to_optimize.
---------
Co-authored-by:
Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>
Showing
Please register or sign in to comment