Commits · 8ffc01a76ad4c446b16322c3b893a8a3f39c14c0 · chenpangpang / transformers

23 Nov, 2020 1 commit

Add early stopping callback to pytorch trainer (#8581) · 8ffc01a7

Colin Brochtrup authored Nov 23, 2020

* Add early stopping patience and minimum threshold metric must improve to prevent early stopping to pytorch trainer

* Add early stopping test

* Set patience counter to 0 if best metric not defined yet

* Make early stopping a callback. Add callback event for updating the best metric for early stopping callback to trigger on.

* Run make style

* make funciton name sensible

* Improve new argument docstring wording and hope that flakey CI test passes.

* Use on_evaluation callback instead of custom. Remove some debug printing

* Move early stopping arguments and state into early stopping callback

* Run make style

* Remove old code

* Fix docs formatting. make style went rogue on me.

* Remove copied attributes and fix variable

* Add assertions on training arguments instead of mutating them. Move comment out of public docs.

* Make separate test for early stopping callback. Add test of invalid arguments.

* Run make style... I remembered before CI this time!

* appease flake8

* Add EarlyStoppingCallback to callback docs

* Make docstring EarlyStoppingCallabck match other callbacks.

* Fix typo in docs

8ffc01a7

19 Nov, 2020 1 commit
- Better filtering of the model outputs in Trainer (#8633) · 4208f496
  Sylvain Gugger authored Nov 19, 2020
```
* Better filtering of the model outputs in Trainer

* Fix examples tests

* Add test for Lysandre
```
  4208f496
18 Nov, 2020 1 commit
- Fixes the training resuming with gradient accumulation (#8624) · 1e62e999
  Sylvain Gugger authored Nov 18, 2020
  
  1e62e999
05 Nov, 2020 1 commit

Make Trainer evaluation handle dynamic seq_length (#8336) · 04e442d5

Sylvain Gugger authored Nov 05, 2020

* Make Trainer evaluation handle dynamic seq_length

* Document behavior.

* Fix test

* Better fix

* Fixes for realsies this time

* Address review comments

* Without forgetting to save...

04e442d5

03 Nov, 2020 1 commit
- Clean Trainer tests and datasets dep (#8268) · 4c19f3ba
  Sylvain Gugger authored Nov 03, 2020
  
  4c19f3ba
21 Oct, 2020 1 commit

TensorBoard/Wandb/optuna/raytune integration improvements. (#7935) · e174bfeb

François Lagunas authored Oct 21, 2020

Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.

e174bfeb

19 Oct, 2020 1 commit

Trainer with Iterable Dataset (#7858) · a09fe140

Julien Rossi authored Oct 19, 2020

* fix 5990

* accomodate iterable dataset without predefined length
* set it as 1 use case: provide max_steps, and NO num_epochs
* Is a merge of master and PR 5995

* fix trainer test under TF

* fix only for torch
* TF trainer untouched
* trainer tests are skipped when no torch

* address comments

* fix quality checks

* remove torch.dataset from test_trainer

* unnecessary inheritance
* RegressionDataset implements all needed methods __len__ and __getitem__

* fix quality checks

* restore RegressionDataset

* was wrongly under is_torch_available()

a09fe140

18 Oct, 2020 1 commit

[Dependencies|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659) · ba8c4d0a

Thomas Wolf authored Oct 18, 2020

* splitting fast and slow tokenizers [WIP]

* [WIP] splitting sentencepiece and tokenizers dependencies

* update dummy objects

* add name_or_path to models and tokenizers

* prefix added to file names

* prefix

* styling + quality

* spliting all the tokenizer files - sorting sentencepiece based ones

* update tokenizer version up to 0.9.0

* remove hard dependency on sentencepiece 🎉

* and removed hard dependency on tokenizers 🎉



* update conversion script

* update missing models

* fixing tests

* move test_tokenization_fast to main tokenization tests - fix bugs

* bump up tokenizers

* fix bert_generation

* update ad fix several tokenizers

* keep sentencepiece in deps for now

* fix funnel and deberta tests

* fix fsmt

* fix marian tests

* fix layoutlm

* fix squeezebert and gpt2

* fix T5 tokenization

* fix xlnet tests

* style

* fix mbart

* bump up tokenizers to 0.9.2

* fix model tests

* fix tf models

* fix seq2seq examples

* fix tests without sentencepiece

* fix slow => fast  conversion without sentencepiece

* update auto and bert generation tests

* fix mbart tests

* fix auto and common test without tokenizers

* fix tests without tokenizers

* clean up tests lighten up when tokenizers + sentencepiece are both off

* style quality and tests fixing

* add sentencepiece to doc/examples reqs

* leave sentencepiece on for now

* style quality split hebert and fix pegasus

* WIP Herbert fast

* add sample_text_no_unicode and fix hebert tokenization

* skip FSMT example test for now

* fix style

* fix fsmt in example tests

* update following Lysandre and Sylvain's comments

* Update src/transformers/testing_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/testing_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ba8c4d0a

14 Oct, 2020 1 commit

Add predict step accumulation (#7767) · a1d1b332

Sylvain Gugger authored Oct 14, 2020



* Add eval_accumulation_step and clean distributed eval

* Add TPU test

* Add TPU stuff

* Fix arg name

* Fix Seq2SeqTrainer

* Fix total_size

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Doc and add test to TPU

* Add unit test

* Adapt name
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

a1d1b332

13 Oct, 2020 1 commit
- Fix typo · 7968051a
  Sylvain Gugger authored Oct 13, 2020
  
  7968051a
10 Oct, 2020 1 commit
- Fix flaky test in test_trainer (#7689) · c6e18de9
  Sylvain Gugger authored Oct 09, 2020
  
  c6e18de9
05 Oct, 2020 1 commit
- Expand test to locate flakiness (#7580) · d3adb985
  Sylvain Gugger authored Oct 05, 2020
  
  d3adb985
01 Oct, 2020 1 commit

Clean the Trainer state (#7490) · 29baa8fa

Sylvain Gugger authored Oct 01, 2020

* Trainer should not modify its TrainingArguments

* Trainer should not modify its TrainingArguments

* Trainer should not modify its TrainingArguments

* Add test of resumed training

* Fixes

* Non multiGPU test

* Clean Trainer state

* Add more to the state

* Documentation

* One last test

* Make resume training test more complete

* Unwanted changes

29baa8fa

29 Sep, 2020 2 commits
- Fix Trainer tests in a multiGPU env (#7458) · 8546dc55
  Sylvain Gugger authored Sep 29, 2020
  
  8546dc55
- Add automatic best model loading to Trainer (#7431) · 52e8392b
  Sylvain Gugger authored Sep 29, 2020
```
* Add automatic best model loading to Trainer

* Some small fixes

* Formatting
```
  52e8392b
28 Sep, 2020 1 commit
- Flos fix (#7384) · 4083a55a
  Marcin Zabłocki authored Sep 28, 2020
  
  4083a55a
22 Sep, 2020 1 commit

Mark big downloads slow (#7325) · 1ee2194f

Sylvain Gugger authored Sep 22, 2020

* Make big downloads as slow

* Add import

* Right order for slow decorator

* More slow tests

1ee2194f

17 Sep, 2020 1 commit
- Trainer multi label (#7191) · 492bb6aa
  Sylvain Gugger authored Sep 17, 2020
```
* Trainer accep multiple labels

* Missing import

* Fix dosctrings
```
  492bb6aa
15 Sep, 2020 3 commits

fix ZeroDivisionError and epoch counting (#7125) · 4c62c602

Yih-Dar authored Sep 15, 2020

* fix ZeroDivisionError and epoch counting

* Add test for num_train_epochs calculation in trainer.py

* Remove @require_non_multigpu for test_num_train_epochs_in_training

4c62c602

Multi predictions trainer (#7126) · 7186ca62

Sylvain Gugger authored Sep 15, 2020

* Allow multiple outputs

* Formatting

* Move the unwrapping before metrics

* Fix typo

* Add test for non-supported config options

7186ca62

Fix reproducible tests in Trainer (#7119) · 2bf70e21
Sylvain Gugger authored Sep 15, 2020
```
* Fix reproducible tests in Trainer

* Deal with multiple GPUs
```
2bf70e21

14 Sep, 2020 1 commit
- Temporarily skip failing tests due to dependency change (#7118) · bb3106f7
  Lysandre Debut authored Sep 14, 2020
```
* Temporarily skip failing tests due to dependency change

* Remove trace
```
  bb3106f7
10 Sep, 2020 2 commits
- these tests require non-multigpu env (#7059) · 8fcbe486
  Stas Bekman authored Sep 10, 2020
```
* these tests require non-multigpu env

* cleanup

* clarify
```
  8fcbe486
- Fix CI with change of name of nlp (#7054) · 51448673
  Sylvain Gugger authored Sep 10, 2020
```
* nlp -> datasets

* More nlp -> datasets

* Woopsie

* More nlp -> datasets

* One last
```
  51448673
27 Aug, 2020 1 commit
- [testing] replace hardcoded paths to allow running tests from anywhere (#6523) · e6b811f0
  Stas Bekman authored Aug 27, 2020
```
* [testing] replace hardcoded paths to allow running tests from anywhere

* fix the merge conflict
```
  e6b811f0
26 Aug, 2020 1 commit
- Black 20 release · a75c64d8
  Lysandre authored Aug 26, 2020
  
  a75c64d8
25 Aug, 2020 1 commit
- More tests to Trainer (#6699) · abc02021
  Sylvain Gugger authored Aug 25, 2020
```
* More tests to Trainer

* Add warning in the doc
```
  abc02021
20 Aug, 2020 1 commit

Add tests to Trainer (#6605) · 573bdb0a

Sylvain Gugger authored Aug 20, 2020

* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs

573bdb0a

20 Jul, 2020 1 commit

Trainer support for iterabledataset (#5834) · 290b6e18

Pradhy729 authored Jul 20, 2020

* Don't pass sampler for iterable dataset

* Added check for test and eval dataloaders.

* Formatting

* Don't pass sampler for iterable dataset

* Added check for test and eval dataloaders.

* Formatting

* Cleaner if nesting.

* Added test for trainer and iterable dataset

* Formatting for test

* Fixed import when torch is available only.

* Added require torch decorator to helper class

* Moved dataset class inside unittest

* Removed nested if and changed model in test

* Checking torch availability for IterableDataset

290b6e18

07 Jul, 2020 1 commit

Added data collator for permutation (XLNet) language modeling and related calls (#5522) · 3dcb748e

Shashank Gupta authored Jul 07, 2020

* Added data collator for XLNet language modeling and related calls

Added DataCollatorForXLNetLanguageModeling in data/data_collator.py
to generate necessary inputs for language modeling training with
XLNetLMHeadModel. Also added related arguments, logic and calls in
examples/language-modeling/run_language_modeling.py.

Resolves: #4739, #2008 (partially)

* Changed name to `DataCollatorForPermutationLanguageModeling`

Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`.
Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use.
CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of
similar to `mems` for XLNet).
Changed calls and imports appropriately.

* Added detailed comments, changed variable names

Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative.

* Added tests for new data collator

Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences.

* Fixed styling issues

3dcb748e

01 Jul, 2020 2 commits
- Fix tensor label type inference in default collator (#5250) · 35befd9c
  Joe Davison authored Jul 01, 2020
```
* allow tensor label inputs to default collator

* replace try/except with type check
```
  35befd9c
- Move tests/utils.py -> transformers/testing_utils.py (#5350) · 13deb95a
  Sam Shleifer authored Jul 01, 2020
  
  13deb95a
18 Jun, 2020 1 commit
- Fix #5114 (#5122) · 5f721ad6
  Sylvain Gugger authored Jun 18, 2020
  
  5f721ad6
17 Jun, 2020 1 commit
- Make default_data_collator more flexible and deprecate old behavior (#5060) · 20fa8289
  Sylvain Gugger authored Jun 17, 2020
```
* Make default_data_collator more flexible

* Accept tensors for all features

* Document code

* Refactor

* Formatting
```
  20fa8289
15 Jun, 2020 1 commit

Make DataCollator a callable (#5015) · 1affde2f

Sylvain Gugger authored Jun 15, 2020



* Make DataCollator a callable

* Update src/transformers/data/data_collator.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

1affde2f

05 Jun, 2020 1 commit
- Fix argument label (#4792) · 4dd5cf22
  Sylvain Gugger authored Jun 05, 2020
```
* Fix argument label

* Fix test
```
  4dd5cf22
21 May, 2020 1 commit

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

13 May, 2020 1 commit

(v2) Improvements to the wandb integration (#4324) · 24175910

Julien Chaumond authored May 12, 2020



* Improvements to the wandb integration

* small reorg + no global necessary

* feat(trainer): log epoch and final metrics

* Simplify logging a bit

* Fixup

* Fix crash when just running eval
Co-authored-by: Chris Van Pelt <vanpelt@gmail.com>
Co-authored-by: Boris Dayma <boris.dayma@gmail.com>

24175910

07 May, 2020 1 commit

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d