Commits · d5d7d886128732091e92afff7fcb3e094c71a7ec · chenpangpang / transformers

03 Apr, 2020 9 commits

Lysandre Debut authored Apr 03, 2020

* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs

d5d7d886

BertJapaneseTokenizer accept options for mecab (#3566) · 8594dd80
Yohei Tamura authored Apr 04, 2020
```
* BertJapaneseTokenizer accept options for mecab

* black

* fix mecab_option to Option[str]
```
8594dd80
Added albert-base-bahasa-cased README and fixed tiny-bert-bahasa-cased README (#3613) · 216e167c
HUSEIN ZOLKEPLI authored Apr 03, 2020
```
* add bert bahasa readme

* update readme

* update readme

* added xlnet

* added tiny-bert and fix xlnet readme

* added albert base
```
216e167c
Update README.md (#3604) · 1ac6a246
ahotrod authored Apr 03, 2020
```
Update AutoModel & AutoTokernizer loading.
```
1ac6a246
Update README.md (#3603) · e91692f4
ahotrod authored Apr 03, 2020

e91692f4

corrected mistake in polish model cards (#3611) · 8e287d50

HenrykBorzymowski authored Apr 03, 2020



* added model_cards for polish squad models

* corrected mistake in polish design cards
Co-authored-by: Henryk Borzymowski <henryk.borzymowski@pwc.com>

8e287d50

Create README.md (#3568) · 81484b44

redewiedergabe authored Apr 03, 2020

* Create README.md

* added meta block (language: german)

* Added additional information about test data

81484b44

Create README.md · 9f6349ab
ahotrod authored Apr 01, 2020

9f6349ab
added model_cards for polish squad models · ddb1ce74
Henryk Borzymowski authored Apr 02, 2020

ddb1ce74

02 Apr, 2020 5 commits
- delete bogus print statement (#3595) · f68d2285
  Patrick von Platen authored Apr 02, 2020
  
  f68d2285
- Resizing embedding matrix before sending it to the optimizer. (#3532) · c50aa67b
  Nicolas authored Apr 02, 2020
```
* Resizing embedding matrix after sending it to the optimizer prevents from updating the newly resized matrix.

* Remove space for style matter
```
  c50aa67b
- Adding should_continue check for retraining (#3509) · 1b101599
  Mark Kockerbeck authored Apr 02, 2020
  
  1b101599
- [Encoder-Decoder] Force models outputs to always have batch_size as their first dim (#3536) · 390c1285
  Patrick von Platen authored Apr 02, 2020
```
* solve conflicts

* improve comments
```
  390c1285
- [T5, examples] replace heavy t5 models with tiny random models (#3556) · ab5d06a0
  Patrick von Platen authored Apr 02, 2020
```
* replace heavy t5 models with tiny random models as was done by sshleifer

* fix isort
```
  ab5d06a0
01 Apr, 2020 7 commits
- [T5, TF 2.2] change tf t5 argument naming (#3547) · a4ee4da1
  Patrick von Platen authored Apr 01, 2020
```
* change tf t5 argument naming for TF 2.2

* correct bug in testing
```
  a4ee4da1
- fix bug in warnings T5 pipelines (#3545) · 06dd5975
  Patrick von Platen authored Apr 01, 2020
  
  06dd5975
- Correct output shape for Bert NSP models in docs (#3482) · 9de9ceb6
  Anirudh Srinivasan authored Apr 02, 2020
  
  9de9ceb6
- [T5, Testst] Add extensive hard-coded integration tests and make sure PT and... · b815edf6
  Patrick von Platen authored Apr 01, 2020
```
[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results (#3550)

* add some t5 integration tests

* finish summarization and translation integration tests for T5 - results loook good

* add tf test

* fix == vs is bug

* fix tf beam search error and make tf t5 tests pass
```
  b815edf6
- Add tiny-bert-bahasa-cased model card (#3567) · 8538ce90
  HUSEIN ZOLKEPLI authored Apr 01, 2020
```
* add bert bahasa readme

* update readme

* update readme

* added xlnet

* added tiny-bert and fix xlnet readme
```
  8538ce90
- Create model card (#3557) · c1a6252b
  Manuel Romero authored Apr 01, 2020
```
Create model card for: distilbert-multi-finetuned-for-xqua-on-tydiqa
```
  c1a6252b
- Tokenizers: Start cleaning examples a little (#3455) · 50e15c82
  Julien Chaumond authored Apr 01, 2020
```
* Start cleaning examples

* Fixup
```
  50e15c82
31 Mar, 2020 16 commits
- [Generate] Add bad words list argument to the generate function (#3367) · b38d552a
  Patrick von Platen authored Mar 31, 2020
```
* add bad words list

* make style

* add bad_words_tokens

* make style

* better naming

* make style

* fix typo
```
  b38d552a
- [Examples] Clean summarization and translation example testing files for T5 and Bart (#3514) · ae6834e0
  Patrick von Platen authored Mar 31, 2020
```
* fix conflicts

* add model size argument to summarization

* correct wrong import

* fix isort

* correct imports

* other isort make style

* make style
```
  ae6834e0
- Update README.md (#3552) · 0373b60c
  Manuel Romero authored Mar 31, 2020
```
- Show that the last uploaded version was trained on more data (custom_license files)
```
  0373b60c
- [Docs] Add usage examples for translation and summarization (#3538) · 83d1fbcf
  Patrick von Platen authored Mar 31, 2020
  
  83d1fbcf
- remove useless and confusing lm_labels line (#3531) · 55bcae7f
  Patrick von Platen authored Mar 31, 2020
  
  55bcae7f
- Update usage doc regarding generate fn (#3504) · 42e1e3c6
  Patrick von Platen authored Mar 31, 2020
  
  42e1e3c6
- Add better explanation to check `docs` locally. (#3459) · 57b0fab6
  Patrick von Platen authored Mar 31, 2020
  
  57b0fab6
- Update README.md (#3470) · a8d4dff0
  Manuel Romero authored Mar 31, 2020
```
Fix typo
```
  a8d4dff0
- Create card for the model: GPT-2-finetuned-covid-bio-medrxiv (#3453) · 4a566356
  Manuel Romero authored Mar 31, 2020
  
  4a566356
- Create README.md (#3393) · bbedb596
  Branden Chan authored Mar 31, 2020
```
* Create README.md

* Update README.md
```
  bbedb596
- Add link to 16 POS tags model (#3465) · c2cf1929
  Manuel Romero authored Mar 31, 2020
  
  c2cf1929
- Added CovidBERT-NLI model card (#3477) · c82ef721
  Gabriele Sarti authored Mar 31, 2020
  
  c82ef721
- Add text shown in example of usage (#3464) · b48a1f08
  Manuel Romero authored Mar 31, 2020
  
  b48a1f08
- Create model card (#3487) · 99833a9c
  Manuel Romero authored Mar 31, 2020
  
  99833a9c
- Add electra and alectra model cards (#3524) · ebceeeac
  Sho Arora authored Mar 31, 2020
  
  ebceeeac
- Add model cards (#3537) · a6c4ee27
  Leandro von Werra authored Mar 31, 2020
```
* feat: add model card bert-imdb

* feat: add model card gpt2-imdb-pos

* feat: add model card gpt2-imdb
```
  a6c4ee27
30 Mar, 2020 3 commits

[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437) · e5c393dc

Ethan Perez authored Mar 30, 2020

* Using loaded checkpoint with --do_predict

Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix).

* Update checkpoint loading

* Fixing model loading

e5c393dc

[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488) · 8deff3ac
Sam Shleifer authored Mar 30, 2020

8deff3ac
[BART] Update encoder and decoder on set_input_embedding (#3501) · 1f728657
dougian authored Mar 30, 2020
```
Co-authored-by: Ioannis Douratsos <ioannisd@amazon.com>
```
1f728657