- 15 Jun, 2018 25 commits
-
-
Myle Ott authored
This reverts commit b2e119c209363e6ff6d2878a69c7d1a507a2e9be.
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
Improve dataloader speed and deprecate concept of batch_offset (use --sample-without-replacement instead)
-
Sergey Edunov authored
-
Myle Ott authored
-
Sergey Edunov authored
-
Myle Ott authored
-
Myle Ott authored
-
alexeib authored
-
Alexei Baevski authored
- process only last time step during generation - cache keys and values - dont apply masking during generation
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
alexeib authored
-
alexeib authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
Myle Ott authored
-
- 24 May, 2018 1 commit
-
-
Myle Ott authored
-
- 22 May, 2018 1 commit
-
-
- 21 May, 2018 1 commit
-
-
Myle Ott authored
-
- 09 May, 2018 3 commits
- 01 May, 2018 3 commits
- 02 Apr, 2018 1 commit
-
-
Myle Ott authored
Changes: - 7d19e36: Add `--sampling` flag to generate.py to sample instead of doing beam search - c777340: Add `scripts/average_checkpoints.py` to average multiple checkpoints into a combined model - 3ea882c: Add `--max-update` option to train.py to stop training after a given number of updates - small bugfixes for distributed training, LSTM, inverse square root LR scheduler
-
- 28 Mar, 2018 4 commits
-
-
Sergey Edunov authored
Update training commands
-
Runqi Yang authored
specify a single GPU setup for IWSLT14
-
Runqi Yang authored
Update training commands in data/README to match the latest version of this project according to #132. Continue from 3c072958: add omitted "\".
-
Runqi Yang authored
Update training commands in data/README to match the latest version of this project according to #132. - Motivation: in the previous data/README, the commands are obsolete and will cause the error "unrecognized arguments: --label-smoothing 0.1 --force-anneal 50". - What's changed: add arguments "--criterion label_smoothed_cross_entropy" and "--lr-scheduler fixed" to the training commands of all 3 datasets. - Result: the new commands run without error on all 3 datasets.
-
- 27 Mar, 2018 1 commit
-
-
杨润琦 authored
-