Commits · 030faccb8d45be9bdd2b4b80ff26f36dc41f622a · chenpangpang / transformers

"test/vscode:/vscode.git/clone" did not exist on "f2ddea0a205a14c76063c876084942aa1d3e1b17"

11 Dec, 2019 3 commits
- doc: fix pretrained models table · 030faccb
  Stefan Schweter authored Dec 11, 2019
  
  030faccb
- rm duplicate imports · 2e2f9fed
  Julien Chaumond authored Dec 11, 2019
  
  2e2f9fed
- Remove misleading documentation · 4c12860f
  LysandreJik authored Dec 11, 2019
  
  4c12860f
10 Dec, 2019 27 commits
- Merge pull request #2129 from leopd/master · 51ae2032
  Thomas Wolf authored Dec 10, 2019
```
Progress indicator improvements when downloading pre-trained models.
```
  51ae2032
- Progress indicator improvements when downloading pre-trained models. · 58d75aa3
  Leo Dirac authored Dec 10, 2019
  
  58d75aa3
- Complete warning + cleanup · 6a733827
  LysandreJik authored Dec 10, 2019
  
  6a733827
- DataParallel for SQuAD + fix XLM · dc4e9e5c
  Lysandre authored Dec 10, 2019
  
  dc4e9e5c
- Merge pull request #2069 from huggingface/cleaner-pt-tf-conversion · e6cff60b
  Thomas Wolf authored Dec 10, 2019
```
clean up PT <=> TF conversion
```
  e6cff60b
- remove misplaced summarization documentation · 4b82c485
  Rémi Louf authored Dec 10, 2019
  
  4b82c485
- Merge pull request #1984 from huggingface/squad-refactor · e57d00ee
  Thomas Wolf authored Dec 10, 2019
```
[WIP] Squad refactor
```
  e57d00ee
- Merge pull request #2107 from huggingface/encoder-mask-shape · ecabbf6d
  Thomas Wolf authored Dec 10, 2019
```
create encoder attention mask from shape of hidden states
```
  ecabbf6d
- Harmonize `no_cuda` flag with other scripts · 1d189304
  Julien Chaumond authored Dec 10, 2019
  
  1d189304
- clean for release · f7eba090
  Rémi Louf authored Dec 06, 2019
  
  f7eba090
- improve device usage · 2a64107e
  Rémi Louf authored Dec 06, 2019
  
  2a64107e
- add README · c0707a85
  Rémi Louf authored Dec 06, 2019
  
  c0707a85
- integrate ROUGE · ade3cdf5
  Rémi Louf authored Dec 06, 2019
  
  ade3cdf5
- prevent BERT weights from being downloaded twice · 076602bd
  Rémi Louf authored Dec 06, 2019
  
  076602bd
- add py-rouge dependency · 5909f710
  Rémi Louf authored Dec 05, 2019
  
  5909f710
- simplified model and configuration · a1994a71
  Rémi Louf authored Dec 05, 2019
  
  a1994a71
- default output dir to documents dir · 3a9a9f78
  Rémi Louf authored Dec 05, 2019
  
  3a9a9f78
- update the docs · 693606a7
  Rémi Louf authored Dec 05, 2019
  
  693606a7
- remove beam search · c0443df5
  Rémi Louf authored Dec 05, 2019
  
  c0443df5
- give transformers API to BertAbs · 2403a665
  Rémi Louf authored Nov 23, 2019
  
  2403a665
- cast bool tensor to long for pytorch < 1.3 · 4d181999
  Rémi Louf authored Nov 12, 2019
  
  4d181999
- setup training · 9f75565e
  Rémi Louf authored Nov 08, 2019
  
  9f75565e
- tweaks to the BeamSearch API · 4735c2af
  Rémi Louf authored Nov 08, 2019
  
  4735c2af
- share pretrained embeddings · ba089c78
  Rémi Louf authored Nov 06, 2019
  
  ba089c78
- Add beam search · 9660ba1c
  Rémi Louf authored Oct 31, 2019
  
  9660ba1c
- load the pretrained weights for encoder-decoder · 1c71ecc8
  Rémi Louf authored Oct 31, 2019
```
We currently save the pretrained_weights of the encoder and decoder in
two separate directories `encoder` and `decoder`. However, for the
`from_pretrained` function to operate with automodels we need to
specify the type of model in the path to the weights.

The path to the encoder/decoder weights is handled by the
`PreTrainedEncoderDecoder` class in the `save_pretrained` function. Sice
there is no easy way to infer the type of model that was initialized for
the encoder and decoder we add a parameter `model_type` to the function.
This is not an ideal solution as it is error prone, and the model type
should be carried by the Model classes somehow.

This is a temporary fix that should be changed before merging.
```
  1c71ecc8
- update function to add special tokens · 07f4cd73
  Rémi Louf authored Oct 31, 2019
```
Since I started my PR the `add_special_token_single_sequence` function
has been deprecated for another; I replaced it with the new function.
```
  07f4cd73
09 Dec, 2019 10 commits
- fix albert links · 5c877fe9
  Pierric Cistac authored Dec 09, 2019
  
  5c877fe9
- Remove unnecessary epoch variable · 79526f82
  Bilal Khan authored Nov 28, 2019
  
  79526f82
- Add functionality to continue training from last saved global_step · 9626e045
  Bilal Khan authored Nov 27, 2019
  
  9626e045
- Stop saving current epoch · 2d73591a
  Bilal Khan authored Nov 27, 2019
  
  2d73591a
- Use saved optimizer and scheduler states if available · 0eb973b0
  Bilal Khan authored Nov 27, 2019
  
  0eb973b0
- Save tokenizer after each epoch to be able to resume training from a checkpoint · a03fcf57
  Bilal Khan authored Nov 27, 2019
  
  a03fcf57
- Save optimizer state, scheduler state and current epoch · f71b1bb0
  Bilal Khan authored Nov 27, 2019
  
  f71b1bb0
- Add ALBERT and XLM to SQuAD script · 2a4ef098
  LysandreJik authored Dec 09, 2019
  
  2a4ef098
- Merge branch 'master' into squad-refactor · 00c4e395
  Lysandre Debut authored Dec 09, 2019
  
  00c4e395
- create encoder attention mask from shape of hidden states · 3520be78
  Rémi Louf authored Dec 09, 2019
```
We currently create encoder attention masks (when they're not provided)
based on the shape of the inputs to the encoder. This is obviously
wrong; sequences can be of different lengths. We now create the encoder
attention mask based on the batch_size and sequence_length of the
encoder hidden states.
```
  3520be78