Use parallel version of AdamW optimizer
Summary: Pull Request resolved: https://github.com/facebookresearch/d2go/pull/448 Tracing d2go runners with using adamw optimizer yielded small operators being executed in the optimizer code. They can be fused together by using the foreach version. QPS gain is ~4.5%. Reviewed By: miqueljubert Differential Revision: D42004110 fbshipit-source-id: 807e0a297bb0b4272f67cc4348389294145a20eb
Showing
Please register or sign in to comment