"vscode:/vscode.git/clone" did not exist on "3ff641c40290aeeef593b15c16e65394a6eeee67"
Adafactor: avoid updating group["lr"] attributes (#9751)
This affects Adafactor with relative_step=False and scale_parameter=True. Updating group["lr"] makes the result of ._get_lr() depends on the previous call, i.e., on the scale of other parameters. This isn't supposed to happen.
Showing
Please register or sign in to comment