- 02 Apr, 2018 1 commit
-
-
Myle Ott authored
Changes: - 7d19e36: Add `--sampling` flag to generate.py to sample instead of doing beam search - c777340: Add `scripts/average_checkpoints.py` to average multiple checkpoints into a combined model - 3ea882c: Add `--max-update` option to train.py to stop training after a given number of updates - small bugfixes for distributed training, LSTM, inverse square root LR scheduler
-
- 28 Mar, 2018 4 commits
-
-
Sergey Edunov authored
Update training commands
-
Runqi Yang authored
specify a single GPU setup for IWSLT14
-
Runqi Yang authored
Update training commands in data/README to match the latest version of this project according to #132. Continue from 3c072958: add omitted "\".
-
Runqi Yang authored
Update training commands in data/README to match the latest version of this project according to #132. - Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50". - What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets. - Result: the new commands run without error on all 3 datasets.
-
- 27 Mar, 2018 1 commit
-
-
杨润琦 authored
-
- 26 Mar, 2018 1 commit
-
-
Runqi Yang authored
Change "awailable" to "available".
-
- 25 Mar, 2018 1 commit
-
-
Runqi Yang authored
Change "awailable" to "available".
-
- 07 Mar, 2018 1 commit
-
-
Myle Ott authored
-
- 05 Mar, 2018 4 commits
-
-
Sergey Edunov authored
Oss merge internal
-
Sergey Edunov authored
* Allow more flexible pre-processing and generation * Addressing CR comments * small fix
-
Myle Ott authored
-
Myle Ott authored
-
- 02 Mar, 2018 1 commit
-
-
James Reed authored
Remove custom ConvTBC code
-
- 01 Mar, 2018 2 commits
- 27 Feb, 2018 10 commits
-
-
Sergey Edunov authored
Oss merge internal changes
-
Sergey Edunov authored
* Making our code compatible with the latest pytorch * revert * torch.nn.utils.clip_grad_norm now returns tensor
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Dario Pavllo authored
* Add prefix * Fixes * Keep original scores with prefix * Improve prefix code * Replace 'repeat' with 'expand'
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
This PR includes breaking API changes to modularize fairseq-py and adds support for distributed training across multiple nodes. Changes: - c7033ef: add support for distributed training! See updated README for usage. - e016299: modularize fairseq-py, adding support for register_model, register_criterion, register_optimizer, etc. - 154e440: update LSTM implementation to use PackedSequence objects in the encoder, better following best practices and improving perf - 90c2973 and 1da6265: improve unit test coverage
-
- 12 Feb, 2018 1 commit
-
-
Myle Ott authored
-
- 09 Feb, 2018 1 commit
-
-
Sergey Edunov authored
-
- 31 Jan, 2018 5 commits
-
-
Sergey Edunov authored
Prepare scripts for WMT14 (#88)
-
Sergey Edunov authored
-
Sergey Edunov authored
-
Sergey Edunov authored
BLEU ratio should be predlen/reflen not reflen/predlen
-
Sergey Edunov authored
-
- 29 Jan, 2018 1 commit
-
-
Joost Bastings authored
To be compatible with multi-bleu. This seems to only affect the result_string.
-
- 27 Jan, 2018 2 commits
-
-
Sergey Edunov authored
-
Sergey Edunov authored
-
- 22 Jan, 2018 4 commits
-
-
Michael Auli authored
See https://arxiv.org/abs/1711.05101
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-