• Myle Ott's avatar
    Merge internal changes (#136) · d3795d6c
    Myle Ott authored
    Changes:
    - 7d19e36: Add `--sampling` flag to generate.py to sample instead of doing beam search
    - c777340: Add `scripts/average_checkpoints.py` to average multiple checkpoints into a combined model
    - 3ea882c: Add `--max-update` option to train.py to stop training after a given number of updates
    - small bugfixes for distributed training, LSTM, inverse square root LR scheduler
    d3795d6c
singleprocess_train.py 10.9 KB