Commits · 821d518e035211eb982aab73e7eb293cd2f05bbf · chenpangpang / transformers

08 Mar, 2021 3 commits
- Revert "Tests" · 821d518e
  Sylvain Gugger authored Mar 08, 2021
```
This reverts commit b35e7b68.
```
  821d518e
- Tests · b35e7b68
  Sylvain Gugger authored Mar 08, 2021
  
  b35e7b68
- fix double wrapping + test (#10583) · f8829660
  Stas Bekman authored Mar 08, 2021
  
  f8829660
05 Mar, 2021 1 commit
- fixed dead link in trainer doc (#10554) · 9f8bc87c
  Joakim Warholm authored Mar 05, 2021
  
  9f8bc87c
04 Mar, 2021 2 commits

Rework TPU checkpointing in Trainer (#10504) · 6290169e

Sylvain Gugger authored Mar 04, 2021

* Rework TPU checkpointing in Trainer

* Wraps the barrier in a dist test

* Address review comments

* Remove line

6290169e

Removes overwrites for output_dir (#10521) · 805c5200
Philipp Schmid authored Mar 04, 2021
```
* removed overwrites

* remove default value for output_dir

* adjusted typing
```
805c5200

03 Mar, 2021 2 commits
- Smp grad accum (#10488) · b70f441b
  Sylvain Gugger authored Mar 03, 2021
```
* Fix gradient accumulation for SM Model Parallelism

* Style and divide loss by grad accum steps
```
  b70f441b
- remap MODEL_FOR_QUESTION_ANSWERING_MAPPING classes to names auto-generated file (#10487) · 188574ac
  Stas Bekman authored Mar 03, 2021
```
* remap classes to strings

* missing new util

* style

* doc

* move the autogenerated file

* Trigger CI
```
  188574ac
01 Mar, 2021 1 commit

Add Fine-Tuning for Wav2Vec2 (#10145) · 0234de84

Patrick von Platen authored Mar 01, 2021



* add encode labels function to tokenizer

* start adding finetuning

* init dropout

* upload

* correct convert script

* apply changes

* fix second typo

* make first dummy training run

* adapt convert script

* push confg for comparison

* remove conf

* finish training

* adapt data collator

* add research folder

* update according to fairseq feedback

* some minor corrections

* refactor masking indices a bit

* some minor changes

* clean tokenizer

* finish clean-up

* remove previous logic

* update run script

* correct training

* finish changes

* finish model

* correct bug

* fix training a bit more

* add some tests

* finish gradient checkpointing

* finish example

* correct gradient checkpointing

* improve tokenization method

* revert changes in tokenizer

* revert general change

* adapt fine-tuning

* update

* save intermediate test

* Update README.md

* finish finetuning

* delete conversion script

* Update src/transformers/models/wav2vec2/configuration_wav2vec2.py

* Update src/transformers/models/wav2vec2/processing_wav2vec2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* finish wav2vec2 script

* finish wav2vec2 fine-tuning

* finalize test

* correct test

* adapt tests

* finish

* remove test file
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

0234de84

27 Feb, 2021 2 commits
- [examples] better model example (#10427) · ee04b698
  Stas Bekman authored Feb 26, 2021
```
* refactors

* typo
```
  ee04b698
- Ray Tune Integration Bug Fixes (#10406) · a85eb616
  Amog Kamsetty authored Feb 26, 2021
```
* fixes

* update resources

* formatting

* remove import

* add log statement

* use fstring

* add period

* Update src/transformers/integrations.py
```
  a85eb616
25 Feb, 2021 1 commit

Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354) · 9d14be5c

Sylvain Gugger authored Feb 25, 2021



* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale

* Quality

* Rework from review comments

* Add doc

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

9d14be5c

24 Feb, 2021 2 commits
- [trainer] move secondary methods into a separate file (#10363) · bdbb2c75
  Stas Bekman authored Feb 24, 2021
```
* move secondary methods into a separate file

* cleanup

* style
```
  bdbb2c75
- [Trainer/Deepspeed] handle get_last_lr() before first step() (#10362) · 3437d121
  Stas Bekman authored Feb 23, 2021
```
* handle get_last_lr() before first step()

* abstract away the lr getting logic

* cleanup

* add test

* move to utils
```
  3437d121
22 Feb, 2021 4 commits

Fix evaluation with label smoothing in Trainer (#10338) · 461e8cac
Sylvain Gugger authored Feb 22, 2021

461e8cac

[trainer] add Trainer methods for metrics logging and saving (#10266) · 622a8c59

Stas Bekman authored Feb 22, 2021



* make logging and saving trainer built-in

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

622a8c59

Loading from last checkpoint functionality in Trainer.train (#10334) · 94d8767b

Tanmay Garg authored Feb 23, 2021

Enhance resume_from_checkpoint argument of Trainer.train to accept
bool type. If True given, last saved checkpoint in self.args.output_dir
will be loaded. (#10280)

94d8767b

[Trainer] implement gradient_accumulation_steps support in DeepSpeed integration (#10310) · eab0afc1
Stas Bekman authored Feb 22, 2021
```
* implement gradient_accumulation_steps support in DeepSpeed integration

* typo

* cleanup

* cleanup
```
eab0afc1

19 Feb, 2021 1 commit

[trainer] implement support for full fp16 in evaluation/predict (#10268) · 4eddc459

Stas Bekman authored Feb 18, 2021



* implement --fp16_full_eval

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* add test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4eddc459

18 Feb, 2021 2 commits

[Trainer] memory tracker metrics (#10225) · 97e688bc

Stas Bekman authored Feb 18, 2021



* memory tracker metrics

* go back to eval for somewhat consistency

* handle no-gpu case

* deal with stackable eval calls

* restore callback order

* style

* simplify the API

* add test

* docs

* consistently use eval_ prefix

* improve docs

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename method

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

97e688bc

Introduce warmup_ratio training argument (#10229) · d7f38c5d

Tanmay Garg authored Feb 18, 2021

Introduce warmup_ratio training argument in both
TrainingArguments and TFTrainingArguments classes (#6673)

d7f38c5d

17 Feb, 2021 1 commit
- [trainer] refactor place_model_on_device logic, add deepspeed (#10243) · dee876ce
  Stas Bekman authored Feb 17, 2021
```
* refactor place_model_on_device logic, add deepspeed

* doc

* style
```
  dee876ce
16 Feb, 2021 2 commits

[trainer] fix ignored columns logger (#10219) · e94d63f6

Stas Bekman authored Feb 16, 2021



* [trainer] fix ignored columns logger

This PR fixes a confusing log entry that says:
```
The following columns in the evaluation set don't have a corresponding argument in `T5ForConditionalGeneration.forward` and have been ignored: .
```
when everything is in order.

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e94d63f6

Store FLOS as floats to avoid overflow. (#10213) · 7169d1ea
Sylvain Gugger authored Feb 16, 2021

7169d1ea

15 Feb, 2021 1 commit
- Fix datasets set_format (#10178) · 587197dc
  Sylvain Gugger authored Feb 15, 2021
  
  587197dc
11 Feb, 2021 2 commits

Add SageMakerTrainer for model paralellism (#10122) · 31245775

Sylvain Gugger authored Feb 11, 2021

* Refactor things out of main train

* Store signature

* Add SageMakerTrainer

* Init + Copyright

* Address review comments

31245775

[DeepSpeed in notebooks] Jupyter + Colab (#10130) · b54cb0bd

Stas Bekman authored Feb 11, 2021

* init devices/setup explicitly

* docs + test

* simplify

* cleanup

* cleanup

* cleanup

* correct the required dist setup

* derive local_rank from env LOCAL_RANK

b54cb0bd

10 Feb, 2021 1 commit
- [DeepSpeed] restore memory for evaluation (#10114) · 77b86284
  Stas Bekman authored Feb 10, 2021
```
* free up memory at the end of train

* rework tests

* consistent formatting

* correction
```
  77b86284
08 Feb, 2021 1 commit
- [trainer] deepspeed bug fixes and tests (#10039) · 322037e8
  Stas Bekman authored Feb 08, 2021
```
* deepspeed bug fixes and tests

* manual wrap?
```
  322037e8
04 Feb, 2021 2 commits
- Fix test for sagemaker and TPU integrations · 4739ce17
  Sylvain Gugger authored Feb 04, 2021
  
  4739ce17
- [trainer] a few fixes (#9993) · 8c3b1fcb
  Stas Bekman authored Feb 04, 2021
```
* trainer fixes

* don't switch the model  just for deepspeed and mp

* correct the fix
```
  8c3b1fcb
03 Feb, 2021 1 commit

fix steps_in_epoch variable in trainer when using max_steps (#9969) · 5442a11f

yylun authored Feb 03, 2021



* fix steps_in_epoch variable when using max_steps

* redundant sentence

* Revert "redundant sentence"

This reverts commit ad5c0e9b6e66d65732dee2239cdc9c76dfa0dc5a.

* remove redundant sentence
Co-authored-by: wujindou <wujindou@sogou-inc.com>

5442a11f

02 Feb, 2021 1 commit
- Use compute_loss in prediction_step (#9935) · d996024a
  Sylvain Gugger authored Feb 02, 2021
  
  d996024a
29 Jan, 2021 1 commit
- When on sagemaker use their env variables for saves (#9876) · 7eadfe16
  Sylvain Gugger authored Jan 29, 2021
```
* When on sagemaker use their env variables for saves

* Address review comments

* Quality
```
  7eadfe16
28 Jan, 2021 4 commits
- pin_memory -> dataloader_pin_memory (#9874) · bc109ae5
  abhishek thakur authored Jan 28, 2021
  
  bc109ae5
- on_log event should occur *after* the current log is written (#9872) · 80e4184f
  abhishek thakur authored Jan 28, 2021
  
  80e4184f
- Deprecate model_path in Trainer.train (#9854) · b4e559cf
  Sylvain Gugger authored Jan 28, 2021
  
  b4e559cf
- Pin memory in Trainer by default (#9857) · 25fcb5c1
  abhishek thakur authored Jan 28, 2021
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
```
  25fcb5c1
27 Jan, 2021 2 commits

When resuming training from checkpoint, Trainer loads model (#9818) · 35d55b7b

Sylvain Gugger authored Jan 27, 2021

* Whenresuming training from checkpoint, Trainer loads model

* Finish cleaning tests

* Address review comment

* Use global_step from state

35d55b7b

Add a flag for find_unused_parameters (#9820) · c7b7bd99

Sylvain Gugger authored Jan 27, 2021



* Add a flag for find_unused_parameters

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Remove negation
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

c7b7bd99