Commits · df88ba95c7028e8c09430efa6fe2c9c98e0e13d5 · OpenDAS / Fairseq

02 Oct, 2018 2 commits

Michael Auli authored Oct 02, 2018

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/300

Differential Revision: D10154711

Pulled By: edunov

fbshipit-source-id: 859d1ac59923b67c1547b6f7acb94f801b0c3318

df88ba95

Explicitly list out generation args for backtranslation dataset · 86e93f2b

Liezl Puzon authored Oct 02, 2018

Summary:
Using argparse Namespace hides the actual args that are expected and makes code harder to read.

Note the difference in style for the args list

    def __init__(
        self,
        tgt_dataset,
        tgt_dict,
        backtranslation_model,
        unkpen,
        sampling,
        beam,
        max_len_a,
        max_len_b,
    ):

instead of

    def __init__(
        self, tgt_dataset, tgt_dict, backtranslation_model, unkpen, sampling,
        beam,  max_len_a, max_len_b,
    ):

Reviewed By: dpacgopinath

Differential Revision: D10152331

fbshipit-source-id: 6539ccba09d48acf23759996b7e32fb329b3e3f6

86e93f2b

01 Oct, 2018 1 commit

Merge internal changes · 22e535e2

alexeib authored Sep 30, 2018

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/296

Differential Revision: D10121830

Pulled By: alexeib

fbshipit-source-id: 1b73430bdfdcb20a9a6123abfca3472a0d307b3b

22e535e2

30 Sep, 2018 3 commits

Merge internal changes (#295) · b87c5366

Myle Ott authored Sep 30, 2018

Summary:
Changelog:
- `90f52a1`: Support loading subsets of the data on each worker with the `--fix-batches-to-gpus` flag. This should fix #217 and #266.
- `6eda0a9`: Update README for replicating the "Scaling Neural Machine Translation" paper
- `b14c7cf`: Fallback to no_c10d backend for pytorch 0.4.1 (fixes #294)
Pull Request resolved: https://github.com/pytorch/fairseq/pull/295

Differential Revision: D10121559

Pulled By: myleott

fbshipit-source-id: 41c84d0ee4cdd113544b5d3aa38ae8b23acc2c27

b87c5366

fbshipit-source-id: 17992f6a5908f078942544b769eda7a340a5e359 · 0bc5c2e9
myleott authored Sep 30, 2018

0bc5c2e9
fbshipit-source-id: 6a835d32f9dc5e0de118f1b46d365d0e0cc85e11 · f8377a70
myleott authored Sep 30, 2018

f8377a70

25 Sep, 2018 18 commits
- Online backtranslation module · 864b89d0
  Myle Ott authored Sep 25, 2018
```
Co-authored-by: liezl200 <lie@fb.com>
```
  864b89d0
- Add back secondary set · a4fe8c99
  Sergey Edunov authored Sep 24, 2018
  
  a4fe8c99
- Merge internal changes · 535ca991
  Myle Ott authored Sep 24, 2018
  
  535ca991
- fix issue with truncated dict · 28069cf4
  alexeib authored Sep 21, 2018
  
  28069cf4
- core changes to support latte collab · cfd2a3a0
  Alexei Baevski authored Sep 20, 2018
  
  cfd2a3a0
- Better support for various c10d API changes · fbe8ce65
  Myle Ott authored Sep 17, 2018
  
  fbe8ce65
- Fix type of c10d bucket size · 78071e0f
  Myle Ott authored Sep 12, 2018
  
  78071e0f
- Parallel preprocessing · 862cad11
  Sergey Edunov authored Sep 12, 2018
  
  862cad11
- Fix adaptive loss logging · ee46c63b
  Sergey Edunov authored Sep 10, 2018
  
  ee46c63b
- Add unit test to verify reproducibility after reloading checkpoints · e775877f
  Myle Ott authored Sep 09, 2018
  
  e775877f
- Fix validation loss · 83e08b6f
  Myle Ott authored Sep 09, 2018
  
  83e08b6f
- Pass encoder_input to generator, rather than src_tokens/src_lengths. · bfeb7732
  Stephen Roller authored Sep 08, 2018
  
  bfeb7732
- Update LM test with --no-c10d · 8bd8ec8f
  Myle Ott authored Sep 07, 2018
  
  8bd8ec8f
- Disable c10d for AdaptiveLoss · f66e9cb5
  Myle Ott authored Sep 06, 2018
  
  f66e9cb5
- Switch to DistributedDataParallelC10d and bump version 0.5.0 -> 0.6.0 · 1082ba35
  Sergey Edunov authored Sep 06, 2018
```
- no more FP16Trainer, we just have an FP16Optimizer wrapper
- most of the distributed code is moved to a new wrapper class called DistributedFairseqModel, which behaves like DistributedDataParallel and a FairseqModel at the same time
- Trainer now requires an extra dummy_batch argument at initialization, which we do fwd/bwd on when there's an uneven number of batches per worker. We hide the gradients from these dummy batches by multiplying the loss by 0
- Trainer.train_step now takes a list of samples, which will allow cleaner --update-freq
```
  1082ba35
- Revert sequence generator changes · 311d2c6c
  Myle Ott authored Sep 06, 2018
  
  311d2c6c
- Sequence generator bug fix. · 0714080b
  Stephen Roller authored Sep 05, 2018
  
  0714080b
- Generator: net_input instead of manual src_tokens. · e6d45d5c
  Stephen Roller authored Sep 05, 2018
  
  e6d45d5c
24 Sep, 2018 2 commits
- Merge pull request #287 from pytorch/oss-master · 25524f19
  Sergey Edunov authored Sep 24, 2018
```
Update readme with WMT'18 model (#433)
```
  25524f19
- Update readme with WMT'18 model (#433) · 86b5cfe4
  Sergey Edunov authored Sep 24, 2018
  
  86b5cfe4
18 Sep, 2018 4 commits
- Merge pull request #279 from pytorch/oss-master · 5d150856
  Sergey Edunov authored Sep 17, 2018
```
Oss master
```
  5d150856
- Readme fix · 74b3f1e9
  Sergey Edunov authored Sep 17, 2018
  
  74b3f1e9
- Fix docs · fe2d1581
  Sergey Edunov authored Sep 17, 2018
  
  fe2d1581
- Fix readme · 5d944b06
  Sergey Edunov authored Sep 17, 2018
  
  5d944b06
07 Sep, 2018 1 commit
- modified stories readme to include sample preprocessing code to split stories to 1k tokens · 5d00e8ee
  Angela Fan authored Sep 07, 2018
  
  5d00e8ee
04 Sep, 2018 1 commit
- Update documentation · 4a47b889
  Myle Ott authored Sep 03, 2018
  
  4a47b889
03 Sep, 2018 8 commits
- Add documentation · 6381cc97
  Myle Ott authored Sep 03, 2018
  
  6381cc97
- Misc changes to simplify upcoming tutorial · 0e101e9c
  Myle Ott authored Sep 02, 2018
  
  0e101e9c
- Test max_positions · d473620e
  Myle Ott authored Sep 02, 2018
  
  d473620e
- fix cosine lr sched for t_mult=1 with warmup · dfd77717
  alexeib authored Sep 02, 2018
  
  dfd77717
- Further generalize EpochBatchIterator and move iterators into new file · 0a7f9e64
  Myle Ott authored Aug 31, 2018
  
  0a7f9e64
- Fix comment · 75f6ba05
  Myle Ott authored Aug 30, 2018
  
  75f6ba05
- fix max_positions comparison · b3cd43b2
  alexeib authored Aug 30, 2018
  
  b3cd43b2
- Clean up FairseqTask so that it's easier to extend/add new tasks · 2e507d3c
  Myle Ott authored Aug 30, 2018
  
  2e507d3c