"tools/vscode:/vscode.git/clone" did not exist on "7a6349e67fb094ce125b5235de51b601e71730be"
do not scale gradient in bf16 mode (#21428)
* no dot scale gradient in bf16 mode * fix since args.fp16 might be none * fixed typo * typo * only do if grad scaling is true * self.amp_dtype == torch.float16 is true * put back prop when fsdp is not none
Showing
Please register or sign in to comment