Add option to use fused adamw optimizer
Summary: Pull Request resolved: https://github.com/facebookresearch/d2go/pull/471 adamw recently added an option to use a fused optimizer. It may give better performance than foreach argument. However, we cannot enable it by default, since it requires all parameters to be in CUDA and maybe some other restrictions. So, enable it on a per project basis. On DALLE2, it results about 23ms faster. Reviewed By: newstzpz Differential Revision: D43027327 fbshipit-source-id: 82c6855116094e86386ad2edeea3a74f9e555174
Showing
Please register or sign in to comment