Commits · 97e688bc220514cd5ea072f06b186401c9cfbbd0 · chenpangpang / transformers

18 Feb, 2021 1 commit

[Trainer] memory tracker metrics (#10225) · 97e688bc

Stas Bekman authored Feb 18, 2021



* memory tracker metrics

* go back to eval for somewhat consistency

* handle no-gpu case

* deal with stackable eval calls

* restore callback order

* style

* simplify the API

* add test

* docs

* consistently use eval_ prefix

* improve docs

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename method

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

97e688bc

17 Feb, 2021 1 commit
- [CI] 2 fixes (#10248) · d1eb88f4
  Stas Bekman authored Feb 17, 2021
```
* fix invalid port

* missing requirements
```
  d1eb88f4
16 Feb, 2021 1 commit
- set tgt_lang of MBart Tokenizer for summarization (#10205) · df1b0fb5
  Zhang Cheng authored Feb 16, 2021
  
  df1b0fb5
15 Feb, 2021 2 commits

[WIP][examples/seq2seq] move old s2s scripts to legacy (#10136) · 1c8c2d9a

Suraj Patil authored Feb 16, 2021



* move old s2s scripts to legacy

* add the tests back

* proper rename

* restore

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1c8c2d9a

fix run_seq2seq.py; porting trainer tests to it (#10162) · 0b1f552a

Stas Bekman authored Feb 15, 2021

* fix run_seq2seq.py; porting DeepSpeed tests to it

* unrefactor

* defensive programming

* defensive programming 2

* port the rest of the trainer tests

* style

* a cleaner scripts dir finder

* cleanup

0b1f552a

12 Feb, 2021 1 commit
- [examples/run_s2s] remove task_specific_params and update rouge computation (#10133) · f51188cb
  Suraj Patil authored Feb 12, 2021
```
* fix rouge metrics and task specific params

* fix typo

* round metrics

* typo

* remove task_specific_params
```
  f51188cb
11 Feb, 2021 2 commits

[DeepSpeed in notebooks] Jupyter + Colab (#10130) · b54cb0bd

Stas Bekman authored Feb 11, 2021

* init devices/setup explicitly

* docs + test

* simplify

* cleanup

* cleanup

* cleanup

* correct the required dist setup

* derive local_rank from env LOCAL_RANK

b54cb0bd

Update run_xnli.py to use Datasets library (#9829) · 8dcfaea0

Qbiwan authored Feb 11, 2021

* remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric

* fix

* fix

* fix

* push

* fix

* everything works

* fix init

* fix

* special treatment for sepconv1d

* style

* 🙏🏽

* add doc and cleanup


* fix doc

* fix doc again

* fix doc again

* Apply suggestions from code review

* make style

* Proposal that should work

* Remove needless code

* Fix test

* Apply suggestions from code review

* remove xnli_compute_metrics, add load_dataset, load_metric, set_seed,metric.compute,load_metric

* amend README

* removed data_args.task_name and replaced with task_name = "xnli"; use split function to load train and validation dataset separately; remove __post_init__; remove flag --task_name from README.

* removed dict task_to_keys, use str "xnli" instead of variable task_name, change preprocess_function to use examples["premise"], examples["hypothesis"] directly, remove sentence1_key and sentence2_key, change compute_metrics function to cater only to accuracy metric, add condition for train_langauge is None when using dataset.load_dataset()

* removed `torch.distributed.barrier()` and `import torch` as `from_pretrained` is able to do the work; amend README

8dcfaea0

10 Feb, 2021 2 commits
- [DeepSpeed] restore memory for evaluation (#10114) · 77b86284
  Stas Bekman authored Feb 10, 2021
```
* free up memory at the end of train

* rework tests

* consistent formatting

* correction
```
  77b86284
- Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
  Lysandre Debut authored Feb 10, 2021
  
  0d8e554d
09 Feb, 2021 2 commits

doc: update W&B related doc (#10086) · 7c7962ba

Boris Dayma authored Feb 09, 2021



* doc: update W&B related doc

* doc(wandb): mention report_to

* doc(wandb): commit suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* doc(wandb): fix typo

* doc(wandb): remove WANDB_DISABLED
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7c7962ba

[examples/s2s] add test set predictions (#10085) · 63fddcf6
Suraj Patil authored Feb 09, 2021
```
* add do_predict, pass eval_beams durig eval

* update help

* apply suggestions from code review
```
63fddcf6

08 Feb, 2021 6 commits
- transition to new tests dir (#10080) · 781220ac
  Stas Bekman authored Feb 08, 2021
  
  781220ac
- [trainer] deepspeed bug fixes and tests (#10039) · 322037e8
  Stas Bekman authored Feb 08, 2021
```
* deepspeed bug fixes and tests

* manual wrap?
```
  322037e8
- [s2s examples] Replace -100 token ids with the tokenizer pad_id for compute_metrics (#10046) · ece6c514
  Olivier authored Feb 08, 2021
```
* replace -100 token ids with the tokenizer pad_id for compute_metrics

* fixed typo for label_ids
```
  ece6c514
- Truncate max length if needed in all examples (#10034) · b01483fa
  Sylvain Gugger authored Feb 08, 2021
  
  b01483fa
- Can't mix --fp16 and --device cpu (#10041) · 24db8cc3
  Stas Bekman authored Feb 07, 2021
  
  24db8cc3
- json to jsonlines, and doc, and typo (#10043) · 769948fa
  Stas Bekman authored Feb 07, 2021
  
  769948fa
05 Feb, 2021 2 commits

[examples] make run scripts executable (#10037) · 8ea412a8
Stas Bekman authored Feb 05, 2021
```
* make executable

* make executable

* same for the template

* cleanup
```
8ea412a8

[examples/seq2seq] support label smoothing (#9844) · 1cd16512

Suraj Patil authored Feb 05, 2021

* add prepare_decoder_input_ids_from_labels in s2s models

* support lbl smoothing and enc/emb freezing

* fix freezing

* use pad_token_id from config

* remove embed freezing and add warning

* prepare decoder_input_ids inside DataCollatorForSeq2Seq

1cd16512

03 Feb, 2021 2 commits

[run_clm.py] fix getting extention · bca0dd5e
Suraj Patil authored Feb 03, 2021

bca0dd5e

[research proj] [lxmert] rm bleach dependency (#9970) · d55e10be

Stas Bekman authored Feb 03, 2021

Looks like a vulnerability and it's not really used anywhere in the code, so just as well remove it completely from deps.
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/bleach/open

d55e10be

02 Feb, 2021 1 commit

[Tokenizer Utils Base] Make pad function more flexible (#9928) · 538b3b46

Patrick von Platen authored Feb 02, 2021

* change tokenizer requirement

* split line

* Correct typo from list to str

* improve style

* make other function pretty as well

* add comment

* correct typo

* add new test

* pass tests for tok without padding token

* Apply suggestions from code review

538b3b46

01 Feb, 2021 3 commits

Remove subclass for sortish sampler (#9907) · 115d97dd
Sylvain Gugger authored Feb 01, 2021
```
* Remove subclass for sortish sampler

* Use old Seq2SeqTrainer in script

* Styling
```
115d97dd

Fit chinese wwm to new datasets (#9887) · 1682804e

wlhgtc authored Feb 01, 2021



* MOD: fit chinese wwm to new datasets

* MOD: move wwm to new folder

* MOD: formate code

* Styling

* MOD add param and recover trainer
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

1682804e

fix logger format for non-main process (#9911) · 6bab8368
Stas Bekman authored Feb 01, 2021

6bab8368

29 Jan, 2021 1 commit
- correctly handle mt5 (#9879) · 6bf94bc0
  Stas Bekman authored Jan 29, 2021
  
  6bf94bc0
28 Jan, 2021 1 commit
- Deprecate model_path in Trainer.train (#9854) · b4e559cf
  Sylvain Gugger authored Jan 28, 2021
  
  b4e559cf
27 Jan, 2021 1 commit
- Setup logging with a stdout handler (#9816) · f2fabedb
  Sylvain Gugger authored Jan 27, 2021
  
  f2fabedb
26 Jan, 2021 3 commits

Fix a bug in run_glue.py (#9812) (#9815) · 059bb258
Yusuke Mori authored Jan 27, 2021

059bb258
Fix fine-tuning translation scripts (#9809) · 8f6c12d3
Magdalena Biesialska authored Jan 26, 2021

8f6c12d3

Improve pytorch examples for fp16 (#9796) · 10e5f282

Andrea Cappelli authored Jan 26, 2021



* Pad to 8x for fp16 multiple choice example (#9752)

* Pad to 8x for fp16 squad trainer example (#9752)

* Pad to 8x for fp16 ner example (#9752)

* Pad to 8x for fp16 swag example (#9752)

* Pad to 8x for fp16 qa beam search example (#9752)

* Pad to 8x for fp16 qa example (#9752)

* Pad to 8x for fp16 seq2seq example (#9752)

* Pad to 8x for fp16 glue example (#9752)

* Pad to 8x for fp16 new ner example (#9752)

* update script template #9752

* Update examples/multiple-choice/run_swag.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa_beam_search.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve code quality #9752
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

10e5f282

25 Jan, 2021 1 commit

Auto-resume training from checkpoint (#9776) · caf4abf7

Sylvain Gugger authored Jan 25, 2021



* Auto-resume training from checkpoint

* Update examples/text-classification/run_glue.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Roll out to other examples
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

caf4abf7

23 Jan, 2021 1 commit
- Fix broken [Open in Colab] links (#9761) · 9152f160
  Wilfried L. Bounsi authored Jan 23, 2021
  
  9152f160
22 Jan, 2021 2 commits
- Fixes to run_seq2seq and instructions (#9734) · 411c5821
  Sylvain Gugger authored Jan 22, 2021
```
* Fixes to run_seq2seq and instructions

* Add more defaults for summarization
```
  411c5821
- examples: fix XNLI url (#9741) · 08b22722
  Stefan Schweter authored Jan 22, 2021
  
  08b22722
21 Jan, 2021 1 commit

Fix memory regression in Seq2Seq example (#9713) · 5f80c15e

Sylvain Gugger authored Jan 21, 2021

* Fix memory regression in Seq2Seq example

* Fix test and properly deal with -100

* Easier condition with device safety

* Patch for MBartTokenzierFast

5f80c15e

20 Jan, 2021 2 commits
- Use datasets squad_v2 metric in run_qa (#9677) · 582f516a
  Sylvain Gugger authored Jan 20, 2021
  
  582f516a
- Restrain tokenizer.model_max_length default (#9681) · a1ad16a4
  Sylvain Gugger authored Jan 20, 2021
```
* Restrain tokenizer.model_max_length default

* Fix indent
```
  a1ad16a4
19 Jan, 2021 1 commit

New run_seq2seq script (#9605) · e4c06ed6

Sylvain Gugger authored Jan 19, 2021



* New run_seq2seq script

* Add tests

* Mark as slow

* Update examples/seq2seq/run_seq2seq.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/data/data_collator.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update src/transformers/data/data_collator.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

e4c06ed6