"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "993a187c6ff03cd971c71ad234d1ddfb3beb020a"
storing & logging gradient norm in trainer (#27326)
* report grad_norm during training * support getting grad_norm from deepspeed
Showing
Please register or sign in to comment