Commits · d3795d6cd1c66ac05dc0f4861ce69ab4680bff3d · OpenDAS / Fairseq

02 Apr, 2018 1 commit

Merge internal changes (#136) · d3795d6c

Myle Ott authored Apr 02, 2018

Changes:
- 7d19e36: Add `--sampling` flag to generate.py to sample instead of doing beam search
- c777340: Add `scripts/average_checkpoints.py` to average multiple checkpoints into a combined model
- 3ea882c: Add `--max-update` option to train.py to stop training after a given number of updates
- small bugfixes for distributed training, LSTM, inverse square root LR scheduler

d3795d6c

28 Mar, 2018 4 commits

Merge pull request #134 from hitvoice/master · 48836525
Sergey Edunov authored Mar 28, 2018
```
Update training commands
```
48836525
Update training command for IWSLT14 · 0a141e3f
Runqi Yang authored Mar 29, 2018
```
specify a single GPU setup for IWSLT14
```
0a141e3f

Update training commands · 435ed351

Runqi Yang authored Mar 28, 2018

Update training commands in data/README to match the latest version of this project according to #132.

Continue from 3c072958: add omitted "\".

435ed351

Update training commands · 3c072958

Runqi Yang authored Mar 28, 2018

Update training commands in data/README to match the latest version of this project according to #132.

- Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50". 
- What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets.
- Result: the new commands run without error on all 3 datasets.

3c072958

27 Mar, 2018 1 commit
- Merge remote-tracking branch 'upstream/master' · 4972056e
  杨润琦 authored Mar 28, 2018
  
  4972056e
26 Mar, 2018 1 commit
- fix typo in data/README (#131) · 6268f20e
  Runqi Yang authored Mar 26, 2018
```
Change "awailable" to "available".
```
  6268f20e
25 Mar, 2018 1 commit
- fix typo in data/README · 261d1822
  Runqi Yang authored Mar 25, 2018
```
Change "awailable" to "available".
```
  261d1822
07 Mar, 2018 1 commit
- Enforce upper-bound on maximum generation length (#121) · 49aeab2d
  Myle Ott authored Mar 07, 2018
  
  49aeab2d
05 Mar, 2018 4 commits
- Merge pull request #116 from facebookresearch/oss-merge-internal · cbaf59d4
  Sergey Edunov authored Mar 05, 2018
```
Oss merge internal
```
  cbaf59d4
- Allow more flexible pre-processing and generation (#227) · b03b53b4
  Sergey Edunov authored Mar 05, 2018
```
* Allow more flexible pre-processing and generation

* Addressing CR comments

* small fix
```
  b03b53b4
- Filter padding properly in LabelSmoothedCrossEntropyCriterion (#229) · e73fddf4
  Myle Ott authored Mar 04, 2018
  
  e73fddf4
- Small fixes · 5f29d123
  Myle Ott authored Mar 02, 2018
  
  5f29d123
02 Mar, 2018 1 commit
- Use ATen built-in conv_tbc method (#66) · 56f9ec3c
  James Reed authored Mar 01, 2018
```
Remove custom ConvTBC code
```
  56f9ec3c
01 Mar, 2018 2 commits
- More updates for PyTorch (#114) · 6e4d370a
  Myle Ott authored Mar 01, 2018
  
  6e4d370a
- More fixes for recent PyTorch (incl. topk issue) (#113) · 3bde773d
  Myle Ott authored Mar 01, 2018
  
  3bde773d
27 Feb, 2018 10 commits
- Merge pull request #107 from facebookresearch/oss-merge-internal · 21b8fb5c
  Sergey Edunov authored Feb 27, 2018
```
Oss merge internal changes
```
  21b8fb5c
- Making our code compatible with the latest pytorch (#223) · 2f976aae
  Sergey Edunov authored Feb 27, 2018
```
* Making our code compatible with the latest pytorch

* revert

* torch.nn.utils.clip_grad_norm now returns tensor
```
  2f976aae
- Refactor incremental generation to be more explicit and less magical (#222) · 9438019f
  Myle Ott authored Feb 24, 2018
  
  9438019f
- Fix LabelSmoothedCrossEntropy test · e7094b14
  Myle Ott authored Feb 23, 2018
  
  e7094b14
- pytorch update: no need to rewrap variable in backward() · 78a6ef02
  Myle Ott authored Feb 23, 2018
  
  78a6ef02
- Add support to prefixes (#221) · 866b27d5
  Dario Pavllo authored Feb 23, 2018
```
* Add prefix

* Fixes

* Keep original scores with prefix

* Improve prefix code

* Replace 'repeat' with 'expand'
```
  866b27d5
- More unit test fixes · 0d90e35f
  Myle Ott authored Feb 15, 2018
  
  0d90e35f
- Fix tests and flake8 · 29c82741
  Myle Ott authored Feb 15, 2018
  
  29c82741
- Add OOM counter back to logging output · b9f2d427
  Myle Ott authored Feb 14, 2018
  
  b9f2d427
- fairseq-py goes distributed (#106) · 66415206
  Myle Ott authored Feb 27, 2018
```
This PR includes breaking API changes to modularize fairseq-py and adds support for distributed training across multiple nodes.

Changes:
- c7033ef: add support for distributed training! See updated README for usage.
- e016299: modularize fairseq-py, adding support for register_model, register_criterion, register_optimizer, etc.
- 154e440: update LSTM implementation to use PackedSequence objects in the encoder, better following best practices and improving perf
- 90c2973 and 1da6265: improve unit test coverage
```
  66415206
12 Feb, 2018 1 commit
- Allow larger maxlen (fixes #100) (#101) · 7e86e30c
  Myle Ott authored Feb 12, 2018
  
  7e86e30c
09 Feb, 2018 1 commit
- Adjust weight decay by the current learning rate to make it work correctly during annealing · 9a951216
  Sergey Edunov authored Feb 08, 2018
  
  9a951216
31 Jan, 2018 5 commits
- Merge pull request #91 from facebookresearch/prepare_wmt · e4c935aa
  Sergey Edunov authored Jan 31, 2018
```
Prepare scripts for WMT14 (#88)
```
  e4c935aa
- spelling · 52b6119a
  Sergey Edunov authored Jan 31, 2018
  
  52b6119a
- Update README with new models · 2c18c273
  Sergey Edunov authored Jan 31, 2018
  
  2c18c273
- Merge pull request #95 from bastings/patch-1 · fb366144
  Sergey Edunov authored Jan 31, 2018
```
BLEU ratio should be predlen/reflen not reflen/predlen
```
  fb366144
- Adding README and more parameters to En2De script · 971c2d63
  Sergey Edunov authored Jan 31, 2018
  
  971c2d63
29 Jan, 2018 1 commit
- Ratio should be predlen/reflen not reflen/predlen · 1ff3efce
  Joost Bastings authored Jan 29, 2018
```
To be compatible with multi-bleu.
This seems to only affect the result_string.
```
  1ff3efce
27 Jan, 2018 2 commits
- Merge branch 'master' of github.com:facebookresearch/fairseq-py into prepare_wmt · d9f46c54
  Sergey Edunov authored Jan 26, 2018
  
  d9f46c54
- Switch to news-commentary-v12 · 4185d3ed
  Sergey Edunov authored Jan 26, 2018
  
  4185d3ed
22 Jan, 2018 4 commits
- Fixed Weight Decay Regularization in Adam · ee36a6f3
  Michael Auli authored Jan 19, 2018
```
See https://arxiv.org/abs/1711.05101
```
  ee36a6f3
- Fix tests · 66d9fcf5
  Myle Ott authored Jan 22, 2018
  
  66d9fcf5
- Output correct perplexity when training with --sentence-avg · f9362e87
  Myle Ott authored Jan 19, 2018
  
  f9362e87
- Fix max_positions calculation in train.py · 81ace092
  Myle Ott authored Jan 19, 2018
  
  81ace092