Commits · 7c3a15ace95bb92d750be3022752b62c4467be10 · chenpangpang / transformers

"tools/vscode:/vscode.git/clone" did not exist on "6776299322ae0c8108b34af947e6e62ccd61b5ca"

10 Dec, 2019 28 commits
- Merge branch 'master' into t5 · 7c3a15ac
  thomwolf authored Dec 10, 2019
  
  7c3a15ac
- updating models urls · 981a5c8c
  thomwolf authored Dec 10, 2019
  
  981a5c8c
- Merge pull request #2069 from huggingface/cleaner-pt-tf-conversion · e6cff60b
  Thomas Wolf authored Dec 10, 2019
```
clean up PT <=> TF conversion
```
  e6cff60b
- remove misplaced summarization documentation · 4b82c485
  Rémi Louf authored Dec 10, 2019
  
  4b82c485
- updating tests and TF 2.0 model · 8ae1044f
  thomwolf authored Dec 10, 2019
  
  8ae1044f
- Merge branch 'master' into t5 · 0558c9cb
  thomwolf authored Dec 10, 2019
  
  0558c9cb
- Merge pull request #1984 from huggingface/squad-refactor · e57d00ee
  Thomas Wolf authored Dec 10, 2019
```
[WIP] Squad refactor
```
  e57d00ee
- Merge pull request #2107 from huggingface/encoder-mask-shape · ecabbf6d
  Thomas Wolf authored Dec 10, 2019
```
create encoder attention mask from shape of hidden states
```
  ecabbf6d
- updating tf 2.0 layer_norm to T5 layer norm · 608a8f5b
  thomwolf authored Dec 10, 2019
  
  608a8f5b
- Harmonize `no_cuda` flag with other scripts · 1d189304
  Julien Chaumond authored Dec 10, 2019
  
  1d189304
- clean for release · f7eba090
  Rémi Louf authored Dec 06, 2019
  
  f7eba090
- improve device usage · 2a64107e
  Rémi Louf authored Dec 06, 2019
  
  2a64107e
- add README · c0707a85
  Rémi Louf authored Dec 06, 2019
  
  c0707a85
- integrate ROUGE · ade3cdf5
  Rémi Louf authored Dec 06, 2019
  
  ade3cdf5
- prevent BERT weights from being downloaded twice · 076602bd
  Rémi Louf authored Dec 06, 2019
  
  076602bd
- add py-rouge dependency · 5909f710
  Rémi Louf authored Dec 05, 2019
  
  5909f710
- simplified model and configuration · a1994a71
  Rémi Louf authored Dec 05, 2019
  
  a1994a71
- default output dir to documents dir · 3a9a9f78
  Rémi Louf authored Dec 05, 2019
  
  3a9a9f78
- update the docs · 693606a7
  Rémi Louf authored Dec 05, 2019
  
  693606a7
- remove beam search · c0443df5
  Rémi Louf authored Dec 05, 2019
  
  c0443df5
- give transformers API to BertAbs · 2403a665
  Rémi Louf authored Nov 23, 2019
  
  2403a665
- cast bool tensor to long for pytorch < 1.3 · 4d181999
  Rémi Louf authored Nov 12, 2019
  
  4d181999
- setup training · 9f75565e
  Rémi Louf authored Nov 08, 2019
  
  9f75565e
- tweaks to the BeamSearch API · 4735c2af
  Rémi Louf authored Nov 08, 2019
  
  4735c2af
- share pretrained embeddings · ba089c78
  Rémi Louf authored Nov 06, 2019
  
  ba089c78
- Add beam search · 9660ba1c
  Rémi Louf authored Oct 31, 2019
  
  9660ba1c
- load the pretrained weights for encoder-decoder · 1c71ecc8
  Rémi Louf authored Oct 31, 2019
```
We currently save the pretrained_weights of the encoder and decoder in
two separate directories `encoder` and `decoder`. However, for the
`from_pretrained` function to operate with automodels we need to
specify the type of model in the path to the weights.

The path to the encoder/decoder weights is handled by the
`PreTrainedEncoderDecoder` class in the `save_pretrained` function. Sice
there is no easy way to infer the type of model that was initialized for
the encoder and decoder we add a parameter `model_type` to the function.
This is not an ideal solution as it is error prone, and the model type
should be carried by the Model classes somehow.

This is a temporary fix that should be changed before merging.
```
  1c71ecc8
- update function to add special tokens · 07f4cd73
  Rémi Louf authored Oct 31, 2019
```
Since I started my PR the `add_special_token_single_sequence` function
has been deprecated for another; I replaced it with the new function.
```
  07f4cd73
09 Dec, 2019 12 commits
- fix albert links · 5c877fe9
  Pierric Cistac authored Dec 09, 2019
  
  5c877fe9
- Remove unnecessary epoch variable · 79526f82
  Bilal Khan authored Nov 28, 2019
  
  79526f82
- Add functionality to continue training from last saved global_step · 9626e045
  Bilal Khan authored Nov 27, 2019
  
  9626e045
- Stop saving current epoch · 2d73591a
  Bilal Khan authored Nov 27, 2019
  
  2d73591a
- Use saved optimizer and scheduler states if available · 0eb973b0
  Bilal Khan authored Nov 27, 2019
  
  0eb973b0
- Save tokenizer after each epoch to be able to resume training from a checkpoint · a03fcf57
  Bilal Khan authored Nov 27, 2019
  
  a03fcf57
- Save optimizer state, scheduler state and current epoch · f71b1bb0
  Bilal Khan authored Nov 27, 2019
  
  f71b1bb0
- fix tf tests · 8e651f56
  thomwolf authored Dec 09, 2019
  
  8e651f56
- fix transfo xl tests · 808bb8da
  thomwolf authored Dec 09, 2019
  
  808bb8da
- fix tests on python 3.5 · b016dd16
  thomwolf authored Dec 09, 2019
  
  b016dd16
- Add ALBERT and XLM to SQuAD script · 2a4ef098
  LysandreJik authored Dec 09, 2019
  
  2a4ef098
- Merge branch 'master' into squad-refactor · 00c4e395
  Lysandre Debut authored Dec 09, 2019
  
  00c4e395