1. 08 May, 2019 4 commits
    • Myle Ott's avatar
      Cleanup LM + Flake8 · f2563c21
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/720
      
      Differential Revision: D15259091
      
      Pulled By: myleott
      
      fbshipit-source-id: 06a35996c06ccddb49fdc9e01e348ff3c9da334e
      f2563c21
    • Myle Ott's avatar
      Fix indexing in TokenBlockDataset · eddcdf08
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/719
      
      Differential Revision: D15258483
      
      Pulled By: myleott
      
      fbshipit-source-id: dd00daa6f1c87264c1196a77dfffc8c876ebde7f
      eddcdf08
    • Myle Ott's avatar
      Bugfix · 0cb45bcb
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/717
      
      Differential Revision: D15254560
      
      Pulled By: myleott
      
      fbshipit-source-id: 2a07614e8d294636f706939e60f0091c73115494
      0cb45bcb
    • Jay Mahadeokar's avatar
      bugfix data not in args · 6a7eb6ce
      Jay Mahadeokar authored
      Summary:
      D15214049 introduced a bug such that if a tasks args does not contain data, then it will give error
      ```
      File "/data/users/jaym/fbsource/fbcode/buck-out/dev/gen/deeplearning/projects/fairspeq/train#link-tree/train.py", line 119, in reload_train
         if len(args.data.split(":")) == 1:
      AttributeError: 'Namespace' object has no attribute 'data'
      ```
      
      This diff checks if data is in args to avoid above error.
      
      Reviewed By: myleott, jmp84
      
      Differential Revision: D15253373
      
      fbshipit-source-id: 14fb9ad878ee50f1b7583349bb17e29c03c40815
      6a7eb6ce
  2. 07 May, 2019 5 commits
  3. 06 May, 2019 5 commits
  4. 05 May, 2019 3 commits
  5. 04 May, 2019 4 commits
  6. 03 May, 2019 2 commits
  7. 02 May, 2019 5 commits
    • Peng-Jen Chen's avatar
      Make learned positional embedding optional · 39264559
      Peng-Jen Chen authored
      Summary:
      - Add learned positional embedding binary flag to masked LM model.
      - Add base arch config for masked LM model which sets all the binary parameters to False. Otherwise some of the binary flag parameters will always be override by config in `xlm_architecture` (e.g. encoder_learned_pos)
      
      Reviewed By: liezl200
      
      Differential Revision: D15054487
      
      fbshipit-source-id: d78827f352b9160a89c9dc4f45b9fce15a2f234d
      39264559
    • Myle Ott's avatar
      Move distributed_init into DistributedFairseqModel (#687) · 34726d56
      Myle Ott authored
      Summary:
      This should make rendezvous happen as lazily as possible.
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/687
      
      Differential Revision: D15151145
      
      Pulled By: myleott
      
      fbshipit-source-id: d70816a85414c5d509a6b12e2b339b4736db2c88
      34726d56
    • Myle Ott's avatar
      Validate on all sets based on --save-interval-updates · fb18be00
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/693
      
      Differential Revision: D15174831
      
      fbshipit-source-id: 98688b1269ead5694e5116659ff64507d3c0d1c0
      fb18be00
    • Myle Ott's avatar
      Fix inconsistent gradient check · 4a30a5f6
      Myle Ott authored
      Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/692
      
      Differential Revision: D15174954
      
      fbshipit-source-id: 1a7bff9aeed3e2cc658577be9d79e8c9f72314c2
      4a30a5f6
    • Kritika Singh's avatar
      Make CTC work with more encoder-only models · ffc9c8cc
      Kritika Singh authored
      Summary:
      Changes include:
      1. Added get_normalized_probabilities to the encoder-only base class FairseqEncoderModel
      2. Made CTCCriterion work for both batch_first (LSTMSubsampleEncoderModel) and batch_second (LSTMEncoderOnly) encoder types
      3. Added tests for different encoder and CTC combinations.
      
      TODO:
      CTC still doesn't work for VGGLSTMEncoderModel so I have disabled that. Will debug and send out fix in another diff.
      
      Reviewed By: jay-mahadeokar
      
      Differential Revision: D15158818
      
      fbshipit-source-id: acb484bad705c937d676d2c3dcde3e3562d68ed9
      ffc9c8cc
  8. 01 May, 2019 5 commits
  9. 30 Apr, 2019 6 commits
  10. 29 Apr, 2019 1 commit