Commits · fd28c8806b6f3805194a37df2b8b95f7b076ce74 · OpenDAS / Fairseq

"src/vscode:/vscode.git/clone" did not exist on "e3d71ad89abfee3817340b2245a49eec894a1705"

22 Jan, 2018 17 commits
- Fix LearnedPositionalEmbedding · fd28c880
  Myle Ott authored Jan 08, 2018
  
  fd28c880
- Move normalization of model output (e.g., via LSM) into model definition · 4db6579a
  Myle Ott authored Jan 07, 2018
  
  4db6579a
- Move positional embeddings into LearnedPositionalEmbedding module · c21a6e29
  Myle Ott authored Jan 06, 2018
  
  c21a6e29
- Fix warning about deprecated `volatile` kwarg for Variables · 185a0df5
  Myle Ott authored Jan 06, 2018
  
  185a0df5
- Add option to SequenceGenerator to retain dropout · dccf7909
  Myle Ott authored Jan 01, 2018
  
  dccf7909
- Add --max-sentences-valid to train.py · c542884d
  Myle Ott authored Jan 01, 2018
  
  c542884d
- Streamline data formatting utils · eb005cdb
  Myle Ott authored Jan 01, 2018
  
  eb005cdb
- Add reduce kwarg to criterions · 6f6cb4ab
  Myle Ott authored Jan 01, 2018
  
  6f6cb4ab
- Raise FileNotFoundError if dictionary files don't exist · dcbf5e75
  Myle Ott authored Dec 26, 2017
  
  dcbf5e75
- Output number of model parameters in train.py · fa508492
  Myle Ott authored Dec 26, 2017
  
  fa508492
- Add explicit dimension to softmax calls · 18a6d85c
  Myle Ott authored Dec 26, 2017
  
  18a6d85c
- Support deprecation of volatile Variables in latest PyTorch · 7da4e062
  Myle Ott authored Dec 22, 2017
  
  7da4e062
- Minor fix for strip_pad functions · 5637d54e
  Myle Ott authored Dec 22, 2017
  
  5637d54e
- Better error message for --decoder-attention · a4f86a89
  Myle Ott authored Dec 22, 2017
  
  a4f86a89
- Fix BeamableMM · a09fe803
  Myle Ott authored Dec 22, 2017
  
  a09fe803
- Add support for sharded generation · 9f7c3ec6
  Myle Ott authored Dec 21, 2017
  
  9f7c3ec6
- Fix generation bug with large beam sizes (>50) · cc7705d3
  Myle Ott authored Dec 07, 2017
  
  cc7705d3
05 Jan, 2018 1 commit
- Directly decay weight instead of L2 penalty (#157) · 9430544a
  Yann N. Dauphin authored Jan 05, 2018
```
See https://arxiv.org/pdf/1711.05101.pdf
```
  9430544a
06 Dec, 2017 9 commits
- Merge pull request #77 from facebookresearch/oss-merge-internal · 94dae690
  Myle Ott authored Dec 06, 2017
  
  94dae690
- Fix conv padding for even kernel widths · 0a836276
  Myle Ott authored Dec 06, 2017
  
  0a836276
- Rebuild optimizer when loading checkpoints · 10bf4074
  Myle Ott authored Dec 04, 2017
  
  10bf4074
- Fix weight norm dimension in decoder (fixes #73) · 9f3ccaa6
  Myle Ott authored Dec 04, 2017
  
  9f3ccaa6
- Save number of GPUs in args (and checkpoints) · 99493a85
  Myle Ott authored Dec 02, 2017
  
  99493a85
- Prefer command-line configuration over checkpoint for optimizer state · bd46c5ec
  Myle Ott authored Nov 21, 2017
  
  bd46c5ec
- Allow --lr to specify a fixed learning rate schedule · 19fafae6
  Myle Ott authored Nov 21, 2017
  
  19fafae6
- Improve memory handling (recover from OOM and periodically empty caching allocator) · a233fceb
  Myle Ott authored Nov 17, 2017
  
  a233fceb
- Improve error when resuming training with a different model architecture · be274623
  Myle Ott authored Nov 15, 2017
  
  be274623
02 Dec, 2017 1 commit
- Fixed 2 typos (#75) · d74f200a
  toothlessdragon authored Dec 01, 2017
  
  d74f200a
24 Nov, 2017 1 commit
- fix bug in lstm model (#68) · 90b2d8de
  Zrachel authored Nov 24, 2017
  
  90b2d8de
13 Nov, 2017 9 commits
- Update requirements.txt and fix flake8 (#62) · 884e3046
  Myle Ott authored Nov 13, 2017
  
  884e3046
- Remove Python3.6 format string from preprocess.py (fixes #60) (#61) · 3e3529e5
  Myle Ott authored Nov 13, 2017
  
  3e3529e5
- Remove more Python 3.6 format strings (fixes #57) (#58) · 15dccfbf
  Myle Ott authored Nov 12, 2017
  
  15dccfbf
- Merge pull request #56 from facebookresearch/python3 · c6831d3b
  Myle Ott authored Nov 12, 2017
```
Remove Python 3.6 format strings (fixes #55)
```
  c6831d3b
- Remove Python 3.6 format strings (fixes #55) · 3524b661
  Myle Ott authored Nov 12, 2017
  
  3524b661
- Make LSTM backwards compatible and fix incremental generation · 7b086021
  Myle Ott authored Nov 12, 2017
  
  7b086021
- Flush non-TTY logging output after each log interval · 63dc27e8
  Myle Ott authored Nov 12, 2017
  
  63dc27e8
- Fix Flake8 · 557b99d1
  Myle Ott authored Nov 12, 2017
  
  557b99d1
- Fallback to `--log-format=simple` for non-TTY terminals · 1b42c8c4
  Myle Ott authored Nov 12, 2017
  
  1b42c8c4
12 Nov, 2017 2 commits

Merge pull request #54: Version 0.1.0 -> 0.2.0 · e5b3c1f4

Myle Ott authored Nov 11, 2017

Release notes:
- 5c7f4954: Added simple LSTM model with input feeding and attention
- 6e4b7e22: Refactored model definitions and incremental generation to be cleaner
- 7ae79c12: Split interactive generation out of generate.py and into a new binary: interactive.py
- 19a3865d: Subtle correctness fix in beam search decoder. Previously, for a beam size of k, we might emit a hypotheses
           if the <eos> was among the top 2*k candidates. Now we only emit hypotheses for which the <eos> is among the
           top-k candidates. This may subtly change generation results, and in the case of k=1 we will now produce
           strictly greedy outputs.
- 97d7fcb9: Fixed bug in padding direction, where previously we right-padded the source and left-padded the target. We
           now left-pad the source and right-pad the target. This should not effect existing trained models, but may
           change (usually improves) the quality of new models.
- f442f896: Add support for batching based on the number of sentences (`--max-sentences`) in addition to the number of
           tokens (`--max-tokens`). When batching by the number of sentences, one can optionally normalize the gradients
           by the number of sentences with `--sentence-avg` (the default is to normalize by the number of tokens).
- c6d6256b: Add `--log-format` option and JSON logger

e5b3c1f4

Version 0.1.0 -> 0.2.0 · 13a3c811

Myle Ott authored Nov 08, 2017

Release notes:
- 5c7f4954: Added simple LSTM model with input feeding and attention
- 6e4b7e22: Refactored model definitions and incremental generation to be cleaner
- 7ae79c12: Split interactive generation out of generate.py and into a new binary: interactive.py
- 19a3865d: Subtle correctness fix in beam search decoder. Previously, for a beam size of k, we might emit a hypotheses
           if the <eos> was among the top 2*k candidates. Now we only emit hypotheses for which the <eos> is among the
           top-k candidates. This may subtly change generation results, and in the case of k=1 we will now produce
           strictly greedy outputs.
- 97d7fcb9: Fixed bug in padding direction, where previously we right-padded the source and left-padded the target. We
           now left-pad the source and right-pad the target. This should not effect existing trained models, but may
           change (usually improves) the quality of new models.
- f442f896: Add support for batching based on the number of sentences (`--max-sentences`) in addition to the number of
           tokens (`--max-tokens`). When batching by the number of sentences, one can optionally normalize the gradients
           by the number of sentences with `--sentence-avg` (the default is to normalize by the number of tokens).
- c6d6256b: Add `--log-format` option and JSON logger

13a3c811