fixed roberta finetuning with --find-unused-parameters on multiGPU
Summary: Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/806 Differential Revision: D16649933 fbshipit-source-id: 6eeda6e2caf8019228e3efc0c27ddfcc3c4d8674
Showing
Please register or sign in to comment