"megatron/git@developer.sourcefind.cn:OpenDAS/megatron-lm.git" did not exist on "14e60427afafdeeac748a929664e32ffc525665b"
Prototype loss blowup recovery in the base trainer.
When loss is NaN, the weights should also be NaN. PiperOrigin-RevId: 341886095
Showing
Please register or sign in to comment