Parallelize EMA optimizer
Summary: Pull Request resolved: https://github.com/facebookresearch/d2go/pull/451 Tracing d2go runners using adamw optimizer yielded small operators being executed in the EMA code. They can be fused together by using multi-tensor API. Reviewed By: tglik Differential Revision: D42098310 fbshipit-source-id: 544d7e214964530ec03674986827410b0f60951f
Showing
Please register or sign in to comment