Commits · 092cf881a57de264e9a2798e067c92f9bd5edfc1 · chenpangpang / transformers

13 Apr, 2020 1 commit
- fix dataset shuffling for Distributed training (#huggingface#3721) (#3766) · 5ebd8989
  elk-cloner authored Apr 13, 2020
  
  5ebd8989
10 Apr, 2020 5 commits

Fix glue_convert_examples_to_features API breakage (#3742) · 700ccf6e
Jin Young Sohn authored Apr 10, 2020

700ccf6e

Add `run_glue_tpu.py` that trains models on TPUs (#3702) · 551b4505

Jin Young Sohn authored Apr 10, 2020

* Initial commit to get BERT + run_glue.py on TPU

* Add README section for TPU and address comments.

* Cleanup TPU bits from run_glue.py (#3)

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* Cleanup TPU bits from run_glue.py

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py

.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* No need to call `xm.mark_step()` explicitly (#4)

Since for gradient accumulation we're accumulating on batches from
`ParallelLoader` instance which on next() marks the step itself.

* Resolve R/W conflicts from multiprocessing (#5)

* Add XLNet in list of models for `run_glue_tpu.py` (#6)

* Add RoBERTa to list of models in TPU GLUE (#7)

* Add RoBERTa and DistilBert to list of models in TPU GLUE (#8)

* Use barriers to reduce duplicate work/resources (#9)

* Shard eval dataset and aggregate eval metrics (#10)

* Shard eval dataset and aggregate eval metrics

Also, instead of calling `eval_loss.item()` every time do summation with
tensors on device.

* Change defaultdict to float

* Reduce the pred, label tensors instead of metrics

As brought up during review some metrics like f1 cannot be aggregated
via averaging. GLUE task metrics depends largely on the dataset, so
instead we sync the prediction and label tensors so that the metrics can
be computed accurately on those instead.

* Only use tb_writer from master (#11)

* Apply huggingface black code formatting

* Style

* Remove `--do_lower_case` as example uses cased

* Add option to specify tensorboard logdir

This is needed for our testing framework which checks regressions
against key metrics writtern by the summary writer.

* Using configuration for `xla_device`

* Prefix TPU specific comments.

* num_cores clarification and namespace eval metrics

* Cache features file under `args.cache_dir`

Instead of under `args.data_dir`. This is needed as our test infra uses
data_dir with a read-only filesystem.

* Rename `run_glue_tpu` to `run_tpu_glue`
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

551b4505

[docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738) · cbad305c
Julien Chaumond authored Apr 10, 2020

cbad305c

[examples] Generate argparsers from type hints on dataclasses (#3669) · b169ac9c

Julien Chaumond authored Apr 10, 2020

* [examples] Generate argparsers from type hints on dataclasses

* [HfArgumentParser] way simpler API

* Restore run_language_modeling.py for easier diff

* [HfArgumentParser] final tweaks from code review

b169ac9c

Big cleanup of `glue_convert_examples_to_features` (#3688) · f98d0ef2

Julien Chaumond authored Apr 10, 2020

* Big cleanup of `glue_convert_examples_to_features`

* Use batch_encode_plus

* Cleaner wrapping of glue_convert_examples_to_features for TF

@lysandrejik

* Cleanup syntax, thanks to @mfuntowicz

* Raise explicit error in case of user error

f98d0ef2

07 Apr, 2020 3 commits
- [Bart] Replace config.output_past with use_cache kwarg (#3632) · 715aa5b1
  Sam Shleifer authored Apr 07, 2020
  
  715aa5b1
- [examples] SummarizationDataset cleanup (#3451) · e344e3d4
  Sam Shleifer authored Apr 07, 2020
  
  e344e3d4
- [Examples, Benchmark] Improve benchmark utils (#3674) · 80fa0f78
  Patrick von Platen authored Apr 07, 2020
```
* improve and add features to benchmark utils

* update benchmark style

* remove output files
```
  80fa0f78
06 Apr, 2020 1 commit

Fix RoBERTa/XLNet Pad Token in run_multiple_choice.py (#3631) · e52d1258

Ethan Perez authored Apr 06, 2020

* Fix RoBERTa/XLNet Pad Token in run_multiple_choice.py

`convert_examples_to_fes atures` sets `pad_token=0` by default, which is correct for BERT but incorrect for RoBERTa (`pad_token=1`) and XLNet (`pad_token=5`). I think the other arguments to `convert_examples_to_features` are correct, but it might be helpful if someone checked who is more familiar with this part of the codebase.

* Simplifying change to match recent commits

e52d1258

02 Apr, 2020 3 commits
- Resizing embedding matrix before sending it to the optimizer. (#3532) · c50aa67b
  Nicolas authored Apr 02, 2020
```
* Resizing embedding matrix after sending it to the optimizer prevents from updating the newly resized matrix.

* Remove space for style matter
```
  c50aa67b
- Adding should_continue check for retraining (#3509) · 1b101599
  Mark Kockerbeck authored Apr 02, 2020
  
  1b101599
- [T5, examples] replace heavy t5 models with tiny random models (#3556) · ab5d06a0
  Patrick von Platen authored Apr 02, 2020
```
* replace heavy t5 models with tiny random models as was done by sshleifer

* fix isort
```
  ab5d06a0
01 Apr, 2020 1 commit
- Tokenizers: Start cleaning examples a little (#3455) · 50e15c82
  Julien Chaumond authored Apr 01, 2020
```
* Start cleaning examples

* Fixup
```
  50e15c82
31 Mar, 2020 1 commit

[Examples] Clean summarization and translation example testing files for T5 and Bart (#3514) · ae6834e0

Patrick von Platen authored Mar 31, 2020

* fix conflicts

* add model size argument to summarization

* correct wrong import

* fix isort

* correct imports

* other isort make style

* make style

ae6834e0

30 Mar, 2020 3 commits

[Bug fix] Using loaded checkpoint with --do_predict (instead of… (#3437) · e5c393dc

Ethan Perez authored Mar 30, 2020

* Using loaded checkpoint with --do_predict

Without this fix, I'm getting near-random validation performance for a trained model, and the validation performance differs per validation run. I think this happens since the `model` variable isn't set with the loaded checkpoint, so I'm using a randomly initialized model. Looking at the model activations, they differ each time I run evaluation (but they don't with this fix).

* Update checkpoint loading

* Fixing model loading

e5c393dc

[bart-tiny-random] Put a 5MB model on S3 to allow faster exampl… (#3488) · 8deff3ac
Sam Shleifer authored Mar 30, 2020

8deff3ac

Update the NER TF script (#3511) · d38bbb22

Julien Plu authored Mar 30, 2020



* Update the NER TF script to remove the softmax and make the pad token label id to -1

* Reformat the quality and style
Co-authored-by: Julien Plu <julien.plu@adevinta.com>

d38bbb22

29 Mar, 2020 1 commit
- [Docs] examples/summarization/bart: Simplify CNN/DM preprocessi… (#3516) · 33ef7002
  Sam Shleifer authored Mar 29, 2020
  
  33ef7002
27 Mar, 2020 4 commits

Fix circle ci flaky fail of wmt example (#3485) · 17dceae7

Patrick von Platen authored Mar 27, 2020

* force bleu

* fix wrong file name

* rename file

* different filenames for each example test

* test files should clean up after themselves

* test files should clean up after themselves

* do not force bleu

* correct typo

* fix isort

17dceae7

run_ner.py / bert-base-multilingual-cased can output empty tokens (#2991) · b08259a1

Funtowicz Morgan authored Mar 27, 2020



* Use tokenizer.num_added_tokens to count number of added special_tokens instead of hardcoded numbers.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* run_ner.py - Do not add a label to the labels_ids if word_tokens is empty.

This can happen when using bert-base-multilingual-cased with an input containing an unique space.
In this case, the tokenizer will output just an empty word_tokens thus leading to an non-consistent behavior
over the labels_ids tokens adding one more tokens than tokens vector.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

b08259a1

Rename `t5-large` to `t5-base` in README.md · f4f49468
Patrick von Platen authored Mar 27, 2020

f4f49468
Add option to choose T5 model size. (#3480) · ff80b731
Lysandre Debut authored Mar 27, 2020
```
T5-small in test


isort
```
ff80b731

26 Mar, 2020 3 commits

Add wmt translation example (#3428) · 5ad2ea06

Patrick von Platen authored Mar 26, 2020

* add translation example

* make style

* adapt docstring

* add gpu device as input for example

* small renaming

* better README

5ad2ea06

Add t5 summarization example (#3411) · e703e923

Patrick von Platen authored Mar 26, 2020

* rebase to master

* change tf to pytorch

* change to pytorch

* small fix

* renaming

* add gpu training possibility

* renaming

* improve README

* incoorporate collins feedback

* better Readme

* better README.md

e703e923

Force the return of token type IDs (#3439) · ffcffebe
Lysandre Debut authored Mar 26, 2020

ffcffebe

25 Mar, 2020 1 commit
- BART for summarization training with CNN/DM using pytorch-lightning · 3d76df3a
  Andre Carrera authored Mar 24, 2020
  
  3d76df3a
24 Mar, 2020 3 commits
- [run_language_modeling] Fix: initialize a new model from a config object · eaabaaf7
  Julien Chaumond authored Mar 24, 2020
  
  eaabaaf7
- Expose missing mappings (see #3415) · f8823bad
  Julien Chaumond authored Mar 24, 2020
  
  f8823bad
- [examples] Use AutoModels in more examples · a8e3336a
  Julien Chaumond authored Mar 23, 2020
  
  a8e3336a
23 Mar, 2020 1 commit
- [BertAbs] Move files around for more consistent naming · f7dcf8fc
  Julien Chaumond authored Mar 23, 2020
  
  f7dcf8fc
20 Mar, 2020 3 commits
- One last reorder of {scheduler,optimizer}.step() · cf72479b
  Julien Chaumond authored Mar 20, 2020
  
  cf72479b
- fixes lr_scheduler warning · 634bf6cf
  Elijah Rippeth authored Mar 20, 2020
```
For more details, see https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
```
  634bf6cf
- Clean special token init in modeling_....py (#3264) · 95e00d08
  Patrick von Platen authored Mar 20, 2020
```
* make style

* fix conflicts
```
  95e00d08
19 Mar, 2020 3 commits
- removing torch.cuda.empty_cache() from TF function (#3267) · 8becb732
  Nitish Shirish Keskar authored Mar 19, 2020
```
torch.cuda.empty_cache() was being called from a TF function (even when torch is unavailable)
not sure any replacement is needed if TF OOMs
```
  8becb732
- Fix #3305: run_ner only possible on ModelForTokenClassification models · 656e1386
  Julien Chaumond authored Mar 18, 2020
  
  656e1386
- [FIX] not training when epoch is small (#3006) · c44a17db
  mataney authored Mar 19, 2020
```
* solving bug where for small epochs and large gradient_accumulation_steps we never train

* black formatting

* no need to change these files
```
  c44a17db
17 Mar, 2020 3 commits

Update examples/ner/run_ner.py to use AutoModel (#3305) · 2b60a26b
J.P Lee authored Mar 18, 2020
```
* Update examples/ner/run_ner.py to use AutoModel

* Fix missing code and apply `make style` command
```
2b60a26b

[WIP] Lightning glue example (#3290) · 930c9412

Nathan Raw authored Mar 17, 2020

* ✨ Alter base pl transformer to use automodels

* 🐛 Add batch size env variable to function call

* 💄 Apply black code style from Makefile

* 🚚 Move lightning base out of ner directory

* ✨ Add lightning glue example

* 💄 self

* move _feature_file to base class

* ✨ Move eval logging to custom callback

* 💄 Apply black code style

* 🐛 Add parent to pythonpath, remove copy command

* 🐛 Add missing max_length kwarg

930c9412

[generate] do_sample default back to False (#3298) · e8f44af5

Patrick von Platen authored Mar 17, 2020

* change do_samples back

* None better default as boolean

* adapt do_sample to True in test example

* make style

e8f44af5