Commits · 1381b6d01dca9c84c3cacd3eae5155cda8e03c18 · chenpangpang / transformers

27 May, 2020 1 commit

Add back --do_lower_case to uncased models (#4245) · a9aa7456

Hao Tan authored May 26, 2020

The option `--do_lower_case` is currently required by the uncased models (i.e., bert-base-uncased, bert-large-uncased).

Results:
BERT-BASE without --do_lower_case: 'exact': 73.83, 'f1': 82.22
BERT-BASE with --do_lower_case: 'exact': 81.02, 'f1': 88.34

a9aa7456

25 May, 2020 1 commit
- add DistilBERT to supported models (#4558) · 50d1ce41
  Antonis Maronikolakis authored May 25, 2020
  
  50d1ce41
21 May, 2020 2 commits

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

[examples] fix no grad in second pruning in run_bertology (#4479) · 271bedb4

Tobias Lee authored May 21, 2020

* fix no grad in second pruning and typo

* fix prune heads attention mismatch problem

* fix

* fix

* fix

* run make style

* run make style

271bedb4

19 May, 2020 2 commits

[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
aa925a52

Distributed eval: SequentialDistributedSampler + gather all results (#4243) · 5e7fe8b5

Julien Chaumond authored May 18, 2020

* Distributed eval: SequentialDistributedSampler + gather all results

* For consistency only write to disk from world_master

Close https://github.com/huggingface/transformers/issues/4272

* Working distributed eval

* Hook into scripts

* Fix #3721 again

* TPU.mesh_reduce: stay in tensor space

Thanks @jysohn23

* Just a small comment

* whitespace

* torch.hub: pip install packaging

* Add test scenarii

5e7fe8b5

18 May, 2020 2 commits
- fix(run_language_modeling): use arg overwrite_cache (#4407) · d9ece823
  Boris Dayma authored May 18, 2020
  
  d9ece823
- Fix un-prefixed f-string · 757baee8
  Julien Chaumond authored May 18, 2020
```
see https://github.com/huggingface/transformers/pull/4367#discussion_r426356693

Hat/tip @girishponkiya
```
  757baee8
15 May, 2020 3 commits
- [skip ci] remove local rank · 15550ce0
  Julien Chaumond authored May 15, 2020
  
  15550ce0
- Should return overflowing information for the log (#4385) · edf9ac11
  Lysandre Debut authored May 15, 2020
  
  edf9ac11
- [examples] Streamline doc · af2e6bf8
  Julien Chaumond authored May 14, 2020
  
  af2e6bf8
14 May, 2020 2 commits
- Fix: unpin flake8 and fix cs errors (#4367) · 448c4672
  Julien Chaumond authored May 14, 2020
```
* Fix: unpin flake8 and fix cs errors

* Ok we still need to quote those
```
  448c4672
- Use Filelock to ensure distributed barriers · c547f15a
  Julien Chaumond authored May 14, 2020
```
see context in https://github.com/huggingface/transformers/pull/4223
```
  c547f15a
13 May, 2020 2 commits

Question Answering for TF trainer (#4320) · ca136186

Julien Plu authored May 13, 2020

* Add QA trainer example for TF

* Make data_dir optional

* Fix parameter logic

* Fix feature convert

* Update the READMEs to add the question-answering task

* Apply style

* Change 'sequence-classification' to 'text-classification' and prefix with 'eval' all the metric names

* Apply style

* Apply style

ca136186

(v2) Improvements to the wandb integration (#4324) · 24175910

Julien Chaumond authored May 12, 2020



* Improvements to the wandb integration

* small reorg + no global necessary

* feat(trainer): log epoch and final metrics

* Simplify logging a bit

* Fixup

* Fix crash when just running eval
Co-authored-by: Chris Van Pelt <vanpelt@gmail.com>
Co-authored-by: Boris Dayma <boris.dayma@gmail.com>

24175910

12 May, 2020 1 commit

Add MultipleChoice to TFTrainer [WIP] (#4270) · e4512aab

Viktor Alm authored May 12, 2020



* catch gpu len 1 set to gpu0

* Add mpc to trainer

* Add MPC for TF

* fix TF automodel for MPC and add Albert

* Apply style

* Fix import

* Note to self: double check

* Make shape None, None for datasetgenerator output shapes

* Add from_pt bool which doesnt seem to work

* Original checkpoint dir

* Fix docstrings for automodel

* Update readme and apply style

* Colab should probably not be from users

* Colabs should probably not be from users

* Add colab

* Update README.md

* Update README.md

* Cleanup __intit__

* Cleanup flake8 trailing comma

* Update src/transformers/training_args_tf.py

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e4512aab

11 May, 2020 1 commit
- Documentation: fix links to NER examples (#4279) · 3f42eb97
  Stefan Schweter authored May 11, 2020
```
* docs: fix link to token classification (NER) example

* examples: fix links to NER scripts
```
  3f42eb97
08 May, 2020 1 commit

[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) · 7b75aa9f

Julien Chaumond authored May 08, 2020

* [TPU] Doc, fix xla_spawn.py, only preprocess dataset once

* Update examples/README.md

* [xla_spawn] Add `_mp_fn` to other Trainer scripts

* [TPU] Fix: eval dataloader was None

7b75aa9f

07 May, 2020 5 commits

[doc] Fix broken links + remove crazy big notebook · c99fe038
Julien Chaumond authored May 07, 2020

c99fe038
[examples] Add column for pytorch-lightning support · 6669915b
Julien Chaumond authored May 07, 2020

6669915b
Examples readme.md (#4215) · 612fa1b1
Julien Chaumond authored May 07, 2020
```
* README

* Update README.md
```
612fa1b1

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

Tpu trainer (#4146) · ebf80e2e

Lysandre Debut authored May 07, 2020



* wip

* wip

* a last wip

* Better logging when using TPUs

* Correct argument name

* Tests

* fix

* Metrics in evaluation

* Update src/transformers/training_args.py

* [tpu] Use launcher script instead

* [tpu] lots of tweaks

* Fix formatting
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

ebf80e2e

06 May, 2020 2 commits

TF version of the trainer (#4017) · aad50151

Julien Plu authored May 06, 2020

* First commit to add a TF version of the trainer.

* Make the TF trainer closer to what looks the PT trainer

* Refactoring common code between the PT and TF trainer into an util file.

* Some bugfix + better similarity with the PT trainer

* Add missing class in transformers init

* Bugfix over prediction + use classification report instead of simple metrics

* Fix name error

* Fix optimization tests + style

* Apply style

* Several bugfix for multi-gpu training

* Apply style

* Apply style

* Add glue example for the TF trainer

* Several bugix + address the reviews

* Fix on the TF training args file

* Add a debug mode

* Bugfix in utils_ner.py when segment_ids is None

* Apply style

* Apply style

* Add TPU strategy

* Fix selection strategy

aad50151

Fix overwrite_cache behaviour for pytorch lightning examples (#4093) · 25296b12
Simone Primarosa authored May 06, 2020

25296b12

02 May, 2020 3 commits
- Update run_pl_glue.py (#4117) · 4c5bd921
  William Falcon authored May 02, 2020
  
  4c5bd921
- Update run_pl_ner.py (#4118) · 5282b31d
  William Falcon authored May 02, 2020
  
  5282b31d
- NER: parse args from .args file or JSON (#4110) · 1e616c0a
  Stefan Schweter authored May 02, 2020
```
* ner: parse args from .args file or JSON

* examples: mention json-based configuration file support for run_ner script
```
  1e616c0a
01 May, 2020 1 commit
- Merge pull request #3934 from huggingface/examples_args_from_files · b8686174
  Julien Chaumond authored Apr 30, 2020
```
[qol] example scripts: parse args from .args file or JSON
```
  b8686174
29 Apr, 2020 1 commit

CDN urls (#4030) · 455c6390

Julien Chaumond authored Apr 28, 2020

* [file_utils] use_cdn + documentation

* Move to cdn. urls for weights

* [urls] Hotfix for bert-base-japanese

455c6390

28 Apr, 2020 2 commits
- [isort] add known 3rd party to setup.cfg (#4053) · d714dfea
  Sam Shleifer authored Apr 28, 2020
```
* add known 3rd party to setup.cfg

* comment

* Update CONTRIBUTING.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  d714dfea
- [Generation] Generation should allow to start with empty prompt (#3993) · 18058574
  Patrick von Platen authored Apr 28, 2020
```
* fix empty prompt

* fix length in generation pipeline
```
  18058574
24 Apr, 2020 2 commits
- [examples] For convenience, also save the tokenizer · c8115260
  Julien Chaumond authored Apr 24, 2020
```
Close #3921
```
  c8115260
- Shuffle train subset for summarization example (#3909) · b0167632
  Cola authored Apr 24, 2020
```
* Shuffle train subset

* Cleaner shuffle
```
  b0167632
22 Apr, 2020 2 commits

Fixes #3877 · 1dc9b3c7
Julien Chaumond authored Apr 22, 2020

1dc9b3c7

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

20 Apr, 2020 3 commits
- Fix bug in examples: double wrap into DataParallel during eval · b1ff0b2a
  Andrey Kulagin authored Apr 17, 2020
  
  b1ff0b2a
- Add `qas_id` to SquadResult and SquadExample (#3745) · c79b550d
  Jared T Nielsen authored Apr 20, 2020
```
* Add qas_id

* Fix incorrect name in squad.py

* Make output files optional for squad eval
```
  c79b550d
- [examples] fix summarization do_predict (#3866) · a504cb49
  Sam Shleifer authored Apr 20, 2020
  
  a504cb49
18 Apr, 2020 1 commit

Cleanup fast tokenizers integration (#3706) · 827d6d6e

Thomas Wolf authored Apr 18, 2020



* First pass on utility classes and python tokenizers

* finishing cleanup pass

* style and quality

* Fix tests

* Updating following @mfuntowicz comment

* style and quality

* Fix Roberta

* fix batch_size/seq_length inBatchEncoding

* add alignement methods + tests

* Fix OpenAI and Transfo-XL tokenizers

* adding trim_offsets=True default for GPT2 et RoBERTa

* style and quality

* fix tests

* add_prefix_space in roberta

* bump up tokenizers to rc7

* style

* unfortunately tensorfow does like these - removing shape/seq_len for now

* Update src/transformers/tokenization_utils.py
Co-Authored-By: Stefan Schweter <stefan@schweter.it>

* Adding doc and docstrings

* making flake8 happy
Co-authored-by: Stefan Schweter <stefan@schweter.it>

827d6d6e