"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "27d0e01d755dc14309617c06eb4d55c246183c98"
Adafactor: avoid updating group["lr"] attributes (#9751)
This affects Adafactor with relative_step=False and scale_parameter=True. Updating group["lr"] makes the result of ._get_lr() depends on the previous call, i.e., on the scale of other parameters. This isn't supposed to happen.
Showing
Please register or sign in to comment