1. 02 May, 2019 4 commits
  2. 01 May, 2019 5 commits
  3. 30 Apr, 2019 6 commits
  4. 29 Apr, 2019 2 commits
  5. 27 Apr, 2019 2 commits
  6. 26 Apr, 2019 1 commit
  7. 25 Apr, 2019 6 commits
  8. 24 Apr, 2019 1 commit
  9. 22 Apr, 2019 2 commits
    • Max Ryabinin's avatar
      Fix generation with --no-early-stop (#627) · fa52d202
      Max Ryabinin authored
      Summary:
      Because the size of `unfinalized_scores` is equal to current `bsz` and not initial batch size, we need to index it by `unfin_idx` instead of `sent` in `is_finished`.
      Fixes #588.
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/627
      
      Differential Revision: D15034641
      
      Pulled By: myleott
      
      fbshipit-source-id: 2638e68e877ae01256cac7d8e69b5b7fec8f7017
      fa52d202
    • Yongqiang Wang's avatar
      reduce memory footprint for average_checkpoints (#647) · d63477e1
      Yongqiang Wang authored
      Summary:
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/647
      
      the current implementation of average_checkpoints requires loading all
      the model parameters into memory and then do the averaging. To average large
      models (e.g., transformer) over a large number of checkpoints (e.g., >50),
      it may require over 100GB memory.
      
      Loading all the parameters is not necessary, as we know the number of models in advance.
      
      Reviewed By: skritika
      
      Differential Revision: D15027513
      
      fbshipit-source-id: 0afe37c9a031a9ab0f1e78844a37be49ec5f76f1
      d63477e1
  10. 17 Apr, 2019 3 commits
  11. 16 Apr, 2019 1 commit
  12. 15 Apr, 2019 3 commits
  13. 12 Apr, 2019 1 commit
  14. 10 Apr, 2019 3 commits
    • Xian Li's avatar
      Fix sacrebleu (#630) · 58b912f6
      Xian Li authored
      Summary:
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/630
      
      sacrebleu scorer has stopped working in pytorch_translate (maybe
      fairseq too) probably due to  a recent api change.
      
      Reviewed By: jmp84
      
      Differential Revision: D14792797
      
      fbshipit-source-id: c2a00246e08bc913c41e60c5fbf8ab4ab5e80d18
      58b912f6
    • Liezl Puzon's avatar
      Make TransformerEncoderLayer layer norm names more descriptive · e5ba94ab
      Liezl Puzon authored
      Summary:
      I added an upgrade_state_dict function so that loading old models will still work
      
      layer_norms[0] --> self_attn_layer_norm
      layer_norms[1] --> final_layer_norm
      
      Reviewed By: pipibjc
      
      Differential Revision: D14689849
      
      fbshipit-source-id: b2809262c11fe9d083e571fa31044798aefd48ce
      e5ba94ab
    • Kritika Singh's avatar
      Add anneal-eps argument · 309f2511
      Kritika Singh authored
      Summary: Used in fairspeq/train.py
      
      Reviewed By: myleott, yqwangustc
      
      Differential Revision: D14841512
      
      fbshipit-source-id: 02fd7b58841c32e2797e3159e65f2bef36f02da1
      309f2511