Commits · f7eba090077a443d4a2fd1cd341c822a8fb4dcbc · chenpangpang / transformers

10 Dec, 2019 13 commits
- clean for release · f7eba090
  Rémi Louf authored Dec 06, 2019
  
  f7eba090
- improve device usage · 2a64107e
  Rémi Louf authored Dec 06, 2019
  
  2a64107e
- add README · c0707a85
  Rémi Louf authored Dec 06, 2019
  
  c0707a85
- integrate ROUGE · ade3cdf5
  Rémi Louf authored Dec 06, 2019
  
  ade3cdf5
- prevent BERT weights from being downloaded twice · 076602bd
  Rémi Louf authored Dec 06, 2019
  
  076602bd
- simplified model and configuration · a1994a71
  Rémi Louf authored Dec 05, 2019
  
  a1994a71
- default output dir to documents dir · 3a9a9f78
  Rémi Louf authored Dec 05, 2019
  
  3a9a9f78
- update the docs · 693606a7
  Rémi Louf authored Dec 05, 2019
  
  693606a7
- give transformers API to BertAbs · 2403a665
  Rémi Louf authored Nov 23, 2019
  
  2403a665
- share pretrained embeddings · ba089c78
  Rémi Louf authored Nov 06, 2019
  
  ba089c78
- Add beam search · 9660ba1c
  Rémi Louf authored Oct 31, 2019
  
  9660ba1c
- load the pretrained weights for encoder-decoder · 1c71ecc8
  Rémi Louf authored Oct 31, 2019
```
We currently save the pretrained_weights of the encoder and decoder in
two separate directories `encoder` and `decoder`. However, for the
`from_pretrained` function to operate with automodels we need to
specify the type of model in the path to the weights.

The path to the encoder/decoder weights is handled by the
`PreTrainedEncoderDecoder` class in the `save_pretrained` function. Sice
there is no easy way to infer the type of model that was initialized for
the encoder and decoder we add a parameter `model_type` to the function.
This is not an ideal solution as it is error prone, and the model type
should be carried by the Model classes somehow.

This is a temporary fix that should be changed before merging.
```
  1c71ecc8
- update function to add special tokens · 07f4cd73
  Rémi Louf authored Oct 31, 2019
```
Since I started my PR the `add_special_token_single_sequence` function
has been deprecated for another; I replaced it with the new function.
```
  07f4cd73
09 Dec, 2019 6 commits
- Remove unnecessary epoch variable · 79526f82
  Bilal Khan authored Nov 28, 2019
  
  79526f82
- Add functionality to continue training from last saved global_step · 9626e045
  Bilal Khan authored Nov 27, 2019
  
  9626e045
- Stop saving current epoch · 2d73591a
  Bilal Khan authored Nov 27, 2019
  
  2d73591a
- Use saved optimizer and scheduler states if available · 0eb973b0
  Bilal Khan authored Nov 27, 2019
  
  0eb973b0
- Save tokenizer after each epoch to be able to resume training from a checkpoint · a03fcf57
  Bilal Khan authored Nov 27, 2019
  
  a03fcf57
- Save optimizer state, scheduler state and current epoch · f71b1bb0
  Bilal Khan authored Nov 27, 2019
  
  f71b1bb0
05 Dec, 2019 5 commits
- update requirements · 35ff345f
  VictorSanh authored Dec 05, 2019
  
  35ff345f
- release distilm-bert · 552c44a9
  VictorSanh authored Dec 05, 2019
  
  552c44a9
- Pr for pplm (#2060) · ee53de7a
  Rosanne Liu authored Dec 05, 2019
```
* license

* changes

* ok

* Update paper link and commands to run

* pointer to uber repo
```
  ee53de7a
- Add few tests on the TF optimization file with some info in the documentation. Complete the README. · 9200a759
  Julien Plu authored Dec 05, 2019
  
  9200a759
- fix #1450 - add doc · 75a97af6
  thomwolf authored Dec 05, 2019
  
  75a97af6
04 Dec, 2019 2 commits
- fix #1991 · 5bfcd048
  thomwolf authored Dec 04, 2019
  
  5bfcd048
- Create a NER example similar to the Pytorch one. It takes the same options,... · ecb923da
  Julien Plu authored Dec 04, 2019
```
Create a NER example similar to the Pytorch one. It takes the same options, and can be run the same way.
```
  ecb923da
03 Dec, 2019 14 commits
- [pplm] split classif head into its own file · 7edb51f3
  Julien Chaumond authored Dec 03, 2019
  
  7edb51f3
- Use full dataset for eval (SequentialSampler in Distributed setting) · 48cbf267
  VictorSanh authored Dec 03, 2019
  
  48cbf267
- [pplm] Update S3 links · f434bfc6
  Julien Chaumond authored Dec 03, 2019
```
Co-Authored-By: Piero Molino <w4nderlust@gmail.com>
```
  f434bfc6
- Always use SequentialSampler during evaluation · 96e83506
  Ethan Perez authored Nov 29, 2019
```
When evaluating, shouldn't we always use the SequentialSampler instead of DistributedSampler? Evaluation only runs on 1 GPU no matter what, so if you use the DistributedSampler with N GPUs, I think you'll only evaluate on 1/N of the evaluation set. That's at least what I'm finding when I run an older/modified version of this repo.
```
  96e83506
- [pplm] README: add setup + tweaks · 3b48806f
  Julien Chaumond authored Dec 03, 2019
  
  3b48806f
- readme · 0cb2c908
  Julien Chaumond authored Dec 02, 2019
```
Co-Authored-By: Rosanne Liu <mimosavvy@gmail.com>
```
  0cb2c908
- [pplm] move scripts under examples/pplm/ · 1efb2ae7
  Julien Chaumond authored Dec 02, 2019
  
  1efb2ae7
- generate_text_pplm now works with batch_size > 1 · a59fdd16
  Piero Molino authored Dec 01, 2019
  
  a59fdd16
- Changed order of some parameters to be more consistent. Identical results. · 893d0d64
  w4nderlust authored Nov 29, 2019
  
  893d0d64
- Added additional check for url and path in discriminator model params · f42816e7
  w4nderlust authored Nov 29, 2019
  
  f42816e7
- Imrpovements: model_path renamed pretrained_model, tokenizer loaded from... · f10b9250
  w4nderlust authored Nov 29, 2019
```
Imrpovements: model_path renamed pretrained_model, tokenizer loaded from pretrained_model, pretrained_model set to discriminator's when discrim is specified, sample = False by default but cli parameter introduced. To obtain identical samples call the cli with --sample
```
  f10b9250
- Removed global variable device · 75904dae
  w4nderlust authored Nov 29, 2019
  
  75904dae
- Added support for generic discriminators · 7fd54b55
  piero authored Nov 27, 2019
  
  7fd54b55
- Added a +1 to epoch when saving weights · b0eaff36
  piero authored Nov 27, 2019
  
  b0eaff36