1. 10 Apr, 2019 3 commits
  2. 09 Apr, 2019 2 commits
  3. 07 Apr, 2019 1 commit
    • Haoran Li's avatar
      move distributed_init after get_batch_iterator · 34028c63
      Haoran Li authored
      Summary: There are constantly wait timeout issue for using multiple nodes, even setting copylocallytempdir:/ doesn't help, eg f105637629. It seems to be working after I moved distributed_init after get_batch_iterator, eg f106520580
      
      Reviewed By: myleott
      
      Differential Revision: D14817769
      
      fbshipit-source-id: edbb101a28d8082241c7bdd8c5500c9dad27647c
      34028c63
  4. 05 Apr, 2019 3 commits
  5. 04 Apr, 2019 1 commit
    • Jay Mahadeokar's avatar
      aligned training task and CE related changes · 3658fa32
      Jay Mahadeokar authored
      Summary:
      This diff adds:
      
      1. Aligned training task specifically for doing cross entropy criterion training using prod data and prod like models
      2. Few changes to correctly register the task and criterions.
      3. Changes to trainer code for propogating accuracy metrics which we care about for training.
      
      Couple of things are hacky right now:
      - The reporting is not modular (this needs to be thought about in general for fairseq).
      
      - The get dummy batch could be specific to task instead of specific for dataset.
      
      Reviewed By: myleott
      
      Differential Revision: D14670482
      
      fbshipit-source-id: dc077247b2ae9d26a8e842a386ec5faa5771e836
      3658fa32
  6. 03 Apr, 2019 2 commits
  7. 02 Apr, 2019 3 commits
  8. 29 Mar, 2019 5 commits
  9. 26 Mar, 2019 1 commit
  10. 19 Mar, 2019 2 commits
  11. 16 Mar, 2019 1 commit
  12. 15 Mar, 2019 1 commit
    • Myle Ott's avatar
      0.6.1 -> 0.6.2 (#577) · e6422528
      Myle Ott authored
      Summary:
      Changelog:
      - 998ba4f: Add language models from Baevski & Auli (2018)
      - 4294c4f6: Add mixture of experts code from Shen et al. (2019)
      - 00493490: Add example for multilingual training
      - 48d9afbe: Speed improvements, including fused operators from apex
      - 44d27e64: Add Tensorboard support
      - d17fa851: Add Adadelta optimizer
      - 9e1c880f: Add `FairseqEncoderModel`
      - b65c579b: Add `FairseqTask.inference_step` to modularize generate.py
      - 2ad1178e: Add back `--curriculum`
      - Misc bug fixes and other features
      
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/577
      
      Differential Revision: D14481233
      
      Pulled By: myleott
      
      fbshipit-source-id: 4ff8625ef1c0b24273fc65df7c5658e3c932e8b7
      e6422528
  13. 14 Mar, 2019 2 commits
  14. 13 Mar, 2019 1 commit
  15. 12 Mar, 2019 3 commits
    • Dmytro Okhonko's avatar
      Handle 3+ dimensional input in sequence_generator + nits · 860010e9
      Dmytro Okhonko authored
      Summary: sequence_generator assumes that model input is 2d tensor of longs. But it can be something like 3d tensor of floats and we should be able to handle this as long as first dimension is batch size followed by source lengths.
      
      Reviewed By: myleott
      
      Differential Revision: D14420044
      
      fbshipit-source-id: bf8b1e42ad1873f7b803c1a377b0af21648db015
      860010e9
    • Dmytro Okhonko's avatar
      Adadelta optimizer · d17fa851
      Dmytro Okhonko authored
      Summary: Adding Adadelta optimizer to fairseq as wrapper around torch.optim.Adadelta
      
      Reviewed By: myleott
      
      Differential Revision: D14418635
      
      fbshipit-source-id: 6bf5ec008e905a4a2cbf7415e9492f5eea3ff07f
      d17fa851
    • Dmytro Okhonko's avatar
      FairseqEncoderModel · 9e1c880f
      Dmytro Okhonko authored
      Summary: Base class for encoder-only models. Some models doesn't have decoder part.
      
      Reviewed By: myleott
      
      Differential Revision: D14413406
      
      fbshipit-source-id: f36473b91dcf3c835fd6d50e2eb6002afa75f11a
      9e1c880f
  16. 11 Mar, 2019 2 commits
    • Matt Le's avatar
      Create fairseq_cli_lib · 7fc9a3be
      Matt Le authored
      Summary: This allows one to call fairseq_cli functions from within python without dispatching to bash.
      
      Reviewed By: myleott
      
      Differential Revision: D14404719
      
      fbshipit-source-id: 044eb652045bb15fc40e72ecbaf6fb10df9f8c61
      7fc9a3be
    • Jose Fonollosa's avatar
      Add missing parentheses in regex expression (#567) · fef4e002
      Jose Fonollosa authored
      Summary:
      The regex pattern without parentheses is not correct. The checkpoints are not sorted in descending order
      Pull Request resolved: https://github.com/pytorch/fairseq/pull/567
      
      Differential Revision: D14404380
      
      Pulled By: myleott
      
      fbshipit-source-id: 98cd0cfa8c92b78a03ffbb94840bc0f7a118eca1
      fef4e002
  17. 04 Mar, 2019 2 commits
  18. 02 Mar, 2019 1 commit
  19. 01 Mar, 2019 4 commits