fixed bugs of masked_lm for fine-tuning (#744)
Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/744 Pull Request resolved: https://github.com/fairinternal/fairseq-py/pull/587 After we added additional prediciton layers for language model predictions. The fine-tuning is broken because of 2 reasons. 1. checkpoint cannot be loaded since we didn't update state_dict names 2. lm_output_learned_bias is not initialize if load_softmax is false Reviewed By: myleott Differential Revision: D15377380 fbshipit-source-id: d58544b1d2c549586abef42fec19ec8bf27a994a
Showing
Please register or sign in to comment