Commits · 0558c9cb9b62771d3e3955dc15d34475c2c86995 · chenpangpang / transformers

10 Dec, 2019 23 commits
- Merge branch 'master' into t5 · 0558c9cb
  thomwolf authored Dec 10, 2019
  
  0558c9cb
- Merge pull request #1984 from huggingface/squad-refactor · e57d00ee
  Thomas Wolf authored Dec 10, 2019
```
[WIP] Squad refactor
```
  e57d00ee
- Merge pull request #2107 from huggingface/encoder-mask-shape · ecabbf6d
  Thomas Wolf authored Dec 10, 2019
```
create encoder attention mask from shape of hidden states
```
  ecabbf6d
- updating tf 2.0 layer_norm to T5 layer norm · 608a8f5b
  thomwolf authored Dec 10, 2019
  
  608a8f5b
- Harmonize `no_cuda` flag with other scripts · 1d189304
  Julien Chaumond authored Dec 10, 2019
  
  1d189304
- clean for release · f7eba090
  Rémi Louf authored Dec 06, 2019
  
  f7eba090
- improve device usage · 2a64107e
  Rémi Louf authored Dec 06, 2019
  
  2a64107e
- add README · c0707a85
  Rémi Louf authored Dec 06, 2019
  
  c0707a85
- integrate ROUGE · ade3cdf5
  Rémi Louf authored Dec 06, 2019
  
  ade3cdf5
- prevent BERT weights from being downloaded twice · 076602bd
  Rémi Louf authored Dec 06, 2019
  
  076602bd
- add py-rouge dependency · 5909f710
  Rémi Louf authored Dec 05, 2019
  
  5909f710
- simplified model and configuration · a1994a71
  Rémi Louf authored Dec 05, 2019
  
  a1994a71
- default output dir to documents dir · 3a9a9f78
  Rémi Louf authored Dec 05, 2019
  
  3a9a9f78
- update the docs · 693606a7
  Rémi Louf authored Dec 05, 2019
  
  693606a7
- remove beam search · c0443df5
  Rémi Louf authored Dec 05, 2019
  
  c0443df5
- give transformers API to BertAbs · 2403a665
  Rémi Louf authored Nov 23, 2019
  
  2403a665
- cast bool tensor to long for pytorch < 1.3 · 4d181999
  Rémi Louf authored Nov 12, 2019
  
  4d181999
- setup training · 9f75565e
  Rémi Louf authored Nov 08, 2019
  
  9f75565e
- tweaks to the BeamSearch API · 4735c2af
  Rémi Louf authored Nov 08, 2019
  
  4735c2af
- share pretrained embeddings · ba089c78
  Rémi Louf authored Nov 06, 2019
  
  ba089c78
- Add beam search · 9660ba1c
  Rémi Louf authored Oct 31, 2019
  
  9660ba1c
- load the pretrained weights for encoder-decoder · 1c71ecc8
  Rémi Louf authored Oct 31, 2019
```
We currently save the pretrained_weights of the encoder and decoder in
two separate directories `encoder` and `decoder`. However, for the
`from_pretrained` function to operate with automodels we need to
specify the type of model in the path to the weights.

The path to the encoder/decoder weights is handled by the
`PreTrainedEncoderDecoder` class in the `save_pretrained` function. Sice
there is no easy way to infer the type of model that was initialized for
the encoder and decoder we add a parameter `model_type` to the function.
This is not an ideal solution as it is error prone, and the model type
should be carried by the Model classes somehow.

This is a temporary fix that should be changed before merging.
```
  1c71ecc8
- update function to add special tokens · 07f4cd73
  Rémi Louf authored Oct 31, 2019
```
Since I started my PR the `add_special_token_single_sequence` function
has been deprecated for another; I replaced it with the new function.
```
  07f4cd73
09 Dec, 2019 14 commits
- fix albert links · 5c877fe9
  Pierric Cistac authored Dec 09, 2019
  
  5c877fe9
- Remove unnecessary epoch variable · 79526f82
  Bilal Khan authored Nov 28, 2019
  
  79526f82
- Add functionality to continue training from last saved global_step · 9626e045
  Bilal Khan authored Nov 27, 2019
  
  9626e045
- Stop saving current epoch · 2d73591a
  Bilal Khan authored Nov 27, 2019
  
  2d73591a
- Use saved optimizer and scheduler states if available · 0eb973b0
  Bilal Khan authored Nov 27, 2019
  
  0eb973b0
- Save tokenizer after each epoch to be able to resume training from a checkpoint · a03fcf57
  Bilal Khan authored Nov 27, 2019
  
  a03fcf57
- Save optimizer state, scheduler state and current epoch · f71b1bb0
  Bilal Khan authored Nov 27, 2019
  
  f71b1bb0
- fix tf tests · 8e651f56
  thomwolf authored Dec 09, 2019
  
  8e651f56
- fix transfo xl tests · 808bb8da
  thomwolf authored Dec 09, 2019
  
  808bb8da
- fix tests on python 3.5 · b016dd16
  thomwolf authored Dec 09, 2019
  
  b016dd16
- Add ALBERT and XLM to SQuAD script · 2a4ef098
  LysandreJik authored Dec 09, 2019
  
  2a4ef098
- Merge branch 'master' into squad-refactor · 00c4e395
  Lysandre Debut authored Dec 09, 2019
  
  00c4e395
- updating T5 · 169fea68
  thomwolf authored Dec 09, 2019
  
  169fea68
- create encoder attention mask from shape of hidden states · 3520be78
  Rémi Louf authored Dec 09, 2019
```
We currently create encoder attention masks (when they're not provided)
based on the shape of the inputs to the encoder. This is obviously
wrong; sequences can be of different lengths. We now create the encoder
attention mask based on the batch_size and sequence_length of the
encoder hidden states.
```
  3520be78
07 Dec, 2019 1 commit
- Remove pytest dependency. (#2093) · 0cb16386
  Aymeric Augustin authored Dec 07, 2019
  
  0cb16386
06 Dec, 2019 2 commits

Fix bug which lowercases special tokens · 2670b0d6
Michael Watkins authored Dec 04, 2019

2670b0d6

Remove dependency on pytest for running tests (#2055) · 35401fe5

Aymeric Augustin authored Dec 06, 2019

* Switch to plain unittest for skipping slow tests.

Add a RUN_SLOW environment variable for running them.

* Switch to plain unittest for PyTorch dependency.

* Switch to plain unittest for TensorFlow dependency.

* Avoid leaking open files in the test suite.

This prevents spurious warnings when running tests.

* Fix unicode warning on Python 2 when running tests.

The warning was:

    UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal

* Support running PyTorch tests on a GPU.

Reverts 27e015bd.

* Tests no longer require pytest.

* Make tests pass on cuda

35401fe5