[feat]Adding DynamicLossScaler class for supporting optimizer updates on the CPU (#635)
* dynamic loss scaler * isort * black * flake8 * comments * added the test to ci file, added a line to catch the overflow error, fixed some formatting errors * adding type annotation * added todo for adding more test cases for handling Nan gradients * fix some doc string and comments, add more tods * fix two doc strings
Showing
Please register or sign in to comment