Commits · ab5d06a094ad32722018a069c2fb9707d1c6a8b1 · chenpangpang / transformers

02 Apr, 2020 1 commit
- [T5, examples] replace heavy t5 models with tiny random models (#3556) · ab5d06a0
  Patrick von Platen authored Apr 02, 2020
```
* replace heavy t5 models with tiny random models as was done by sshleifer

* fix isort
```
  ab5d06a0
01 Apr, 2020 7 commits
- [T5, TF 2.2] change tf t5 argument naming (#3547) · a4ee4da1
  Patrick von Platen authored Apr 01, 2020
```
* change tf t5 argument naming for TF 2.2

* correct bug in testing
```
  a4ee4da1
- fix bug in warnings T5 pipelines (#3545) · 06dd5975
  Patrick von Platen authored Apr 01, 2020
  
  06dd5975
- Correct output shape for Bert NSP models in docs (#3482) · 9de9ceb6
  Anirudh Srinivasan authored Apr 02, 2020
  
  9de9ceb6
- [T5, Testst] Add extensive hard-coded integration tests and make sure PT and... · b815edf6
  Patrick von Platen authored Apr 01, 2020
```
[T5, Testst] Add extensive hard-coded integration tests and make sure PT and TF give equal results (#3550)

* add some t5 integration tests

* finish summarization and translation integration tests for T5 - results loook good

* add tf test

* fix == vs is bug

* fix tf beam search error and make tf t5 tests pass
```
  b815edf6
- Add tiny-bert-bahasa-cased model card (#3567) · 8538ce90
  HUSEIN ZOLKEPLI authored Apr 01, 2020
```
* add bert bahasa readme

* update readme

* update readme

* added xlnet

* added tiny-bert and fix xlnet readme
```
  8538ce90
- Create model card (#3557) · c1a6252b
  Manuel Romero authored Apr 01, 2020
```
Create model card for: distilbert-multi-finetuned-for-xqua-on-tydiqa
```
  c1a6252b
- Tokenizers: Start cleaning examples a little (#3455) · 50e15c82
  Julien Chaumond authored Apr 01, 2020
```
* Start cleaning examples

* Fixup
```
  50e15c82
31 Mar, 2020 16 commits
- [Generate] Add bad words list argument to the generate function (#3367) · b38d552a
  Patrick von Platen authored Mar 31, 2020
```
* add bad words list

* make style

* add bad_words_tokens

* make style

* better naming

* make style

* fix typo
```
  b38d552a
- [Examples] Clean summarization and translation example testing files for T5 and Bart (#3514) · ae6834e0
  Patrick von Platen authored Mar 31, 2020
```
* fix conflicts

* add model size argument to summarization

* correct wrong import

* fix isort

* correct imports

* other isort make style

* make style
```
  ae6834e0
- Update README.md (#3552) · 0373b60c
  Manuel Romero authored Mar 31, 2020
```
- Show that the last uploaded version was trained on more data (custom_license files)
```
  0373b60c
- [Docs] Add usage examples for translation and summarization (#3538) · 83d1fbcf
  Patrick von Platen authored Mar 31, 2020
  
  83d1fbcf
- remove useless and confusing lm_labels line (#3531) · 55bcae7f
  Patrick von Platen authored Mar 31, 2020
  
  55bcae7f
- Update usage doc regarding generate fn (#3504) · 42e1e3c6
  Patrick von Platen authored Mar 31, 2020
  
  42e1e3c6
- Add better explanation to check `docs` locally. (#3459) · 57b0fab6
  Patrick von Platen authored Mar 31, 2020
  
  57b0fab6
- Update README.md (#3470) · a8d4dff0
  Manuel Romero authored Mar 31, 2020
```
Fix typo
```
  a8d4dff0
- Create card for the model: GPT-2-finetuned-covid-bio-medrxiv (#3453) · 4a566356
  Manuel Romero authored Mar 31, 2020
  
  4a566356
- Create README.md (#3393) · bbedb596
  Branden Chan authored Mar 31, 2020
```
* Create README.md

* Update README.md
```
  bbedb596
- Add link to 16 POS tags model (#3465) · c2cf1929
  Manuel Romero authored Mar 31, 2020
  
  c2cf1929
- Added CovidBERT-NLI model card (#3477) · c82ef721
  Gabriele Sarti authored Mar 31, 2020
  
  c82ef721
- Add text shown in example of usage (#3464) · b48a1f08
  Manuel Romero authored Mar 31, 2020
  
  b48a1f08
- Create model card (#3487) · 99833a9c
  Manuel Romero authored Mar 31, 2020
  
  99833a9c
- Add electra and alectra model cards (#3524) · ebceeeac
  Sho Arora authored Mar 31, 2020
  
  ebceeeac
- Add model cards (#3537) · a6c4ee27
  Leandro von Werra authored Mar 31, 2020
```
* feat: add model card bert-imdb

* feat: add model card gpt2-imdb-pos

* feat: add model card gpt2-imdb
```
  a6c4ee27
30 Mar, 2020 11 commits

[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437) · e5c393dc

Ethan Perez authored Mar 30, 2020

* Using loaded checkpoint with --do_predict

Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix).

* Update checkpoint loading

* Fixing model loading

e5c393dc

[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488) · 8deff3ac
Sam Shleifer authored Mar 30, 2020

8deff3ac
[BART] Update encoder and decoder on set_input_embedding (#3501) · 1f728657
dougian authored Mar 30, 2020
```
Co-authored-by: Ioannis Douratsos <ioannisd@amazon.com>
```
1f728657
[InputExample] Unfreeze for now, cf. #3423 · cc598b31
Julien Chaumond authored Mar 30, 2020

cc598b31

Update the NER TF script (#3511) · d38bbb22

Julien Plu authored Mar 30, 2020



* Update the NER TF script to remove the softmax and make the pad token label id to -1

* Reformat the quality and style
Co-authored-by: Julien Plu <julien.plu@adevinta.com>

d38bbb22

Re-pin isort version · eff757f2
LysandreJik authored Mar 30, 2020

eff757f2
Un-pin isort for v2.7.0 pypi · a009d751
LysandreJik authored Mar 30, 2020

a009d751
Release: v2.7.0 · 6f5a12a5
LysandreJik authored Mar 30, 2020

6f5a12a5
fix lm lables in docstring (#3529) · 296252c4
Patrick von Platen authored Mar 30, 2020

296252c4

[T5] make decoder input ids optional for t5 training (#3521) · 75ec6c9e

Patrick von Platen authored Mar 30, 2020

* make decoder input ids optional for t5 training

* lm_lables should not be shifted in t5

* add tests

* finish shift right functionality for PT T5

* move shift right to correct class

* cleaner code

* replace -100 values with pad token id

* add assert statement

* remove unnecessary for loop

* make style

75ec6c9e

[T5] Add training documenation (#3507) · 5b44e0a3

Patrick von Platen authored Mar 30, 2020

* Add clear description of how to train T5

* correct docstring in T5

* correct typo

* correct docstring format

* update t5 model docs

* implement collins feedback

* fix typo and add more explanation for sentinal tokens

* delete unnecessary todos

5b44e0a3

29 Mar, 2020 2 commits
- [Docs] examples/summarization/bart: Simplify CNN/DM preprocessi… (#3516) · 33ef7002
  Sam Shleifer authored Mar 29, 2020
  
  33ef7002
- [BART] add bart-large-xsum weights (#3422) · f6a23d19
  Sam Shleifer authored Mar 29, 2020
  
  f6a23d19
27 Mar, 2020 3 commits
- [model_cards]: use MIT license for all dbmdz models · 601ac5b1
  Stefan Schweter authored Mar 27, 2020
  
  601ac5b1
- Fix circle ci flaky fail of wmt example (#3485) · 17dceae7
  Patrick von Platen authored Mar 27, 2020
```
* force bleu

* fix wrong file name

* rename file

* different filenames for each example test

* test files should clean up after themselves

* test files should clean up after themselves

* do not force bleu

* correct typo

* fix isort
```
  17dceae7
- add summarization and translation to notebook (#3478) · 00ea100e
  Patrick von Platen authored Mar 27, 2020
  
  00ea100e