"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "979b039c893f771bc77c4089d841af89927759a1"
storing & logging gradient norm in trainer (#27326)
* report grad_norm during training * support getting grad_norm from deepspeed
Showing
Please register or sign in to comment