Commits · ab5d06a094ad32722018a069c2fb9707d1c6a8b1 · chenpangpang / transformers

02 Apr, 2020 1 commit
- [T5, examples] replace heavy t5 models with tiny random models (#3556) · ab5d06a0
  Patrick von Platen authored Apr 02, 2020
```
* replace heavy t5 models with tiny random models as was done by sshleifer

* fix isort
```
  ab5d06a0
01 Apr, 2020 1 commit
- Tokenizers: Start cleaning examples a little (#3455) · 50e15c82
  Julien Chaumond authored Apr 01, 2020
```
* Start cleaning examples

* Fixup
```
  50e15c82
31 Mar, 2020 1 commit

[Examples] Clean summarization and translation example testing files for T5 and Bart (#3514) · ae6834e0

Patrick von Platen authored Mar 31, 2020

* fix conflicts

* add model size argument to summarization

* correct wrong import

* fix isort

* correct imports

* other isort make style

* make style

ae6834e0

30 Mar, 2020 3 commits

[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437) · e5c393dc

Ethan Perez authored Mar 30, 2020

* Using loaded checkpoint with --do_predict

Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix).

* Update checkpoint loading

* Fixing model loading

e5c393dc

[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488) · 8deff3ac
Sam Shleifer authored Mar 30, 2020

8deff3ac

Update the NER TF script (#3511) · d38bbb22

Julien Plu authored Mar 30, 2020



* Update the NER TF script to remove the softmax and make the pad token label id to -1

* Reformat the quality and style
Co-authored-by: Julien Plu <julien.plu@adevinta.com>

d38bbb22

29 Mar, 2020 1 commit
- [Docs] examples/summarization/bart: Simplify CNN/DM preprocessi… (#3516) · 33ef7002
  Sam Shleifer authored Mar 29, 2020
  
  33ef7002
27 Mar, 2020 4 commits

Fix circle ci flaky fail of wmt example (#3485) · 17dceae7

Patrick von Platen authored Mar 27, 2020

* force bleu

* fix wrong file name

* rename file

* different filenames for each example test

* test files should clean up after themselves

* test files should clean up after themselves

* do not force bleu

* correct typo

* fix isort

17dceae7

run_ner.py / bert-base-multilingual-cased can output empty tokens (#2991) · b08259a1

Funtowicz Morgan authored Mar 27, 2020



* Use tokenizer.num_added_tokens to count number of added special_tokens instead of hardcoded numbers.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* run_ner.py - Do not add a label to the labels_ids if word_tokens is empty.

This can happen when using bert-base-multilingual-cased with an input containing an unique space.
In this case, the tokenizer will output just an empty word_tokens thus leading to an non-consistent behavior
over the labels_ids tokens adding one more tokens than tokens vector.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

b08259a1

Rename `t5-large` to `t5-base` in README.md · f4f49468
Patrick von Platen authored Mar 27, 2020

f4f49468
Add option to choose T5 model size. (#3480) · ff80b731
Lysandre Debut authored Mar 27, 2020
```
T5-small in test


isort
```
ff80b731

26 Mar, 2020 3 commits

Add wmt translation example (#3428) · 5ad2ea06

Patrick von Platen authored Mar 26, 2020

* add translation example

* make style

* adapt docstring

* add gpu device as input for example

* small renaming

* better README

5ad2ea06

Add t5 summarization example (#3411) · e703e923

Patrick von Platen authored Mar 26, 2020

* rebase to master

* change tf to pytorch

* change to pytorch

* small fix

* renaming

* add gpu training possibility

* renaming

* improve README

* incoorporate collins feedback

* better Readme

* better README.md

e703e923

Force the return of token type IDs (#3439) · ffcffebe
Lysandre Debut authored Mar 26, 2020

ffcffebe

25 Mar, 2020 1 commit
- BART for summarization training with CNN/DM using pytorch-lightning · 3d76df3a
  Andre Carrera authored Mar 24, 2020
  
  3d76df3a
24 Mar, 2020 3 commits
- [run_language_modeling] Fix: initialize a new model from a config object · eaabaaf7
  Julien Chaumond authored Mar 24, 2020
  
  eaabaaf7
- Expose missing mappings (see #3415) · f8823bad
  Julien Chaumond authored Mar 24, 2020
  
  f8823bad
- [examples] Use AutoModels in more examples · a8e3336a
  Julien Chaumond authored Mar 23, 2020
  
  a8e3336a
23 Mar, 2020 1 commit
- [BertAbs] Move files around for more consistent naming · f7dcf8fc
  Julien Chaumond authored Mar 23, 2020
  
  f7dcf8fc
20 Mar, 2020 3 commits
- One last reorder of {scheduler,optimizer}.step() · cf72479b
  Julien Chaumond authored Mar 20, 2020
  
  cf72479b
- fixes lr_scheduler warning · 634bf6cf
  Elijah Rippeth authored Mar 20, 2020
```
For more details, see https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
```
  634bf6cf
- Clean special token init in modeling_....py (#3264) · 95e00d08
  Patrick von Platen authored Mar 20, 2020
```
* make style

* fix conflicts
```
  95e00d08
19 Mar, 2020 3 commits
- removing torch.cuda.empty_cache() from TF function (#3267) · 8becb732
  Nitish Shirish Keskar authored Mar 19, 2020
```
torch.cuda.empty_cache() was being called from a TF function (even when torch is unavailable)
not sure any replacement is needed if TF OOMs
```
  8becb732
- Fix #3305: run_ner only possible on ModelForTokenClassification models · 656e1386
  Julien Chaumond authored Mar 18, 2020
  
  656e1386
- [FIX] not training when epoch is small (#3006) · c44a17db
  mataney authored Mar 19, 2020
```
* solving bug where for small epochs and large gradient_accumulation_steps we never train

* black formatting

* no need to change these files
```
  c44a17db
17 Mar, 2020 4 commits

Update examples/ner/run_ner.py to use AutoModel (#3305) · 2b60a26b
J.P Lee authored Mar 18, 2020
```
* Update examples/ner/run_ner.py to use AutoModel

* Fix missing code and apply `make style` command
```
2b60a26b

[WIP] Lightning glue example (#3290) · 930c9412

Nathan Raw authored Mar 17, 2020

* ✨ Alter base pl transformer to use automodels

* 🐛 Add batch size env variable to function call

* 💄 Apply black code style from Makefile

* 🚚 Move lightning base out of ner directory

* ✨ Add lightning glue example

* 💄 self

* move _feature_file to base class

* ✨ Move eval logging to custom callback

* 💄 Apply black code style

* 🐛 Add parent to pythonpath, remove copy command

* 🐛 Add missing max_length kwarg

930c9412

[generate] do_sample default back to False (#3298) · e8f44af5

Patrick von Platen authored Mar 17, 2020

* change do_samples back

* None better default as boolean

* adapt do_sample to True in test example

* make style

e8f44af5

CPU/GPU memory benchmarking utilities - Remove support for python 3.5 (now only 3.6+) (#3186) · 2187c49f

Thomas Wolf authored Mar 17, 2020

* memory benchmark rss

* have both forward pass and line-by-line mem tracing

* cleaned up tracing

* refactored and cleaning up API

* no f-strings yet...

* add GPU mem logging

* fix GPU memory monitoring

* style and quality

* clean up and doc

* update with comments

* Switching to python 3.6+

* fix quality

2187c49f

16 Mar, 2020 1 commit
- [BART] Remove unused kwargs (#3279) · 5ea8ba67
  Sam Shleifer authored Mar 15, 2020
```
* Remove unused kwargs
* dont call forward in tests
```
  5ea8ba67
13 Mar, 2020 3 commits

make style · 4f75d380
Patrick von Platen authored Mar 13, 2020

4f75d380
update file to new starting token logic · c2ee3840
Patrick von Platen authored Mar 13, 2020

c2ee3840

Bump psutil from 5.6.3 to 5.6.6 in /examples/distillation · afea70c0

dependabot[bot] authored Mar 12, 2020

Bumps [psutil](https://github.com/giampaolo/psutil) from 5.6.3 to 5.6.6.
- [Release notes](https://github.com/giampaolo/psutil/releases)
- [Changelog](https://github.com/giampaolo/psutil/blob/master/HISTORY.rst)
- [Commits](https://github.com/giampaolo/psutil/compare/release-5.6.3...release-5.6.6

)
Signed-off-by: dependabot[bot] <support@github.com>

afea70c0

12 Mar, 2020 1 commit
- Bart: update example for #3140 compatibility (#3233) · 2e81b9d8
  Sam Shleifer authored Mar 12, 2020
```
* Update bart example docs
```
  2e81b9d8
11 Mar, 2020 1 commit
- renamed min_len to min_length · 5b3000d9
  Patrick von Platen authored Mar 05, 2020
  
  5b3000d9
10 Mar, 2020 1 commit

NER - pl example (#3180) · 5ca356a4

Shubham Agarwal authored Mar 10, 2020

* 1. seqeval required by ner pl example. install from examples/requirements. 2. unrecognized arguments: save_steps

* pl checkpoint callback filenotfound error: make directory and pass

* #3159 pl checkpoint path difference

* 1. Updated Readme for pl 2. pl script now also correct displays logs 3. pass gpu ids compared to number of gpus

* Updated results in readme

* 1. updated readme 2. removing deprecated pl methods 3. finalizing scripts

* comment length check

* using deprecated validation_end for stable results

* style related changes

5ca356a4

09 Mar, 2020 2 commits
- Bart example: model.to(device) (#3194) · 3aca02ef
  Sam Shleifer authored Mar 09, 2020
  
  3aca02ef
- cased -> uncased in BERT SQuAD example · eb3e6cb0
  Lysandre authored Mar 09, 2020
```
closes #3183
```
  eb3e6cb0
05 Mar, 2020 2 commits
- Rename BartForMaskedLM -> BartForConditionalGeneration (#3114) · 857e0a0d
  Sam Shleifer authored Mar 05, 2020
```
* improved documentation
```
  857e0a0d
- undo chg · c203509d
  sshleifer authored Mar 05, 2020
  
  c203509d