Commits · b9720dd6f296ae222961a26cf43477cfc8d24d80 · chenpangpang / transformers

03 Feb, 2021 2 commits

[run_clm.py] fix getting extention · bca0dd5e
Suraj Patil authored Feb 03, 2021

bca0dd5e

[research proj] [lxmert] rm bleach dependency (#9970) · d55e10be

Stas Bekman authored Feb 03, 2021

Looks like a vulnerability and it's not really used anywhere in the code, so just as well remove it completely from deps.
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/bleach/open

d55e10be

02 Feb, 2021 1 commit

[Tokenizer Utils Base] Make pad function more flexible (#9928) · 538b3b46

Patrick von Platen authored Feb 02, 2021

* change tokenizer requirement

* split line

* Correct typo from list to str

* improve style

* make other function pretty as well

* add comment

* correct typo

* add new test

* pass tests for tok without padding token

* Apply suggestions from code review

538b3b46

01 Feb, 2021 3 commits

Remove subclass for sortish sampler (#9907) · 115d97dd
Sylvain Gugger authored Feb 01, 2021
```
* Remove subclass for sortish sampler

* Use old Seq2SeqTrainer in script

* Styling
```
115d97dd

Fit chinese wwm to new datasets (#9887) · 1682804e

wlhgtc authored Feb 01, 2021



* MOD: fit chinese wwm to new datasets

* MOD: move wwm to new folder

* MOD: formate code

* Styling

* MOD add param and recover trainer
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

1682804e

fix logger format for non-main process (#9911) · 6bab8368
Stas Bekman authored Feb 01, 2021

6bab8368

29 Jan, 2021 1 commit
- correctly handle mt5 (#9879) · 6bf94bc0
  Stas Bekman authored Jan 29, 2021
  
  6bf94bc0
28 Jan, 2021 1 commit
- Deprecate model_path in Trainer.train (#9854) · b4e559cf
  Sylvain Gugger authored Jan 28, 2021
  
  b4e559cf
27 Jan, 2021 1 commit
- Setup logging with a stdout handler (#9816) · f2fabedb
  Sylvain Gugger authored Jan 27, 2021
  
  f2fabedb
26 Jan, 2021 3 commits

Fix a bug in run_glue.py (#9812) (#9815) · 059bb258
Yusuke Mori authored Jan 27, 2021

059bb258
Fix fine-tuning translation scripts (#9809) · 8f6c12d3
Magdalena Biesialska authored Jan 26, 2021

8f6c12d3

Improve pytorch examples for fp16 (#9796) · 10e5f282

Andrea Cappelli authored Jan 26, 2021



* Pad to 8x for fp16 multiple choice example (#9752)

* Pad to 8x for fp16 squad trainer example (#9752)

* Pad to 8x for fp16 ner example (#9752)

* Pad to 8x for fp16 swag example (#9752)

* Pad to 8x for fp16 qa beam search example (#9752)

* Pad to 8x for fp16 qa example (#9752)

* Pad to 8x for fp16 seq2seq example (#9752)

* Pad to 8x for fp16 glue example (#9752)

* Pad to 8x for fp16 new ner example (#9752)

* update script template #9752

* Update examples/multiple-choice/run_swag.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa_beam_search.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve code quality #9752
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

10e5f282

25 Jan, 2021 1 commit

Auto-resume training from checkpoint (#9776) · caf4abf7

Sylvain Gugger authored Jan 25, 2021



* Auto-resume training from checkpoint

* Update examples/text-classification/run_glue.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Roll out to other examples
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

caf4abf7

23 Jan, 2021 1 commit
- Fix broken [Open in Colab] links (#9761) · 9152f160
  Wilfried L. Bounsi authored Jan 23, 2021
  
  9152f160
22 Jan, 2021 2 commits
- Fixes to run_seq2seq and instructions (#9734) · 411c5821
  Sylvain Gugger authored Jan 22, 2021
```
* Fixes to run_seq2seq and instructions

* Add more defaults for summarization
```
  411c5821
- examples: fix XNLI url (#9741) · 08b22722
  Stefan Schweter authored Jan 22, 2021
  
  08b22722
21 Jan, 2021 1 commit

Fix memory regression in Seq2Seq example (#9713) · 5f80c15e

Sylvain Gugger authored Jan 21, 2021

* Fix memory regression in Seq2Seq example

* Fix test and properly deal with -100

* Easier condition with device safety

* Patch for MBartTokenzierFast

5f80c15e

20 Jan, 2021 2 commits
- Use datasets squad_v2 metric in run_qa (#9677) · 582f516a
  Sylvain Gugger authored Jan 20, 2021
  
  582f516a
- Restrain tokenizer.model_max_length default (#9681) · a1ad16a4
  Sylvain Gugger authored Jan 20, 2021
```
* Restrain tokenizer.model_max_length default

* Fix indent
```
  a1ad16a4
19 Jan, 2021 2 commits

New run_seq2seq script (#9605) · e4c06ed6

Sylvain Gugger authored Jan 19, 2021



* New run_seq2seq script

* Add tests

* Mark as slow

* Update examples/seq2seq/run_seq2seq.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/data/data_collator.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update src/transformers/data/data_collator.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

e4c06ed6

Fix old Seq2SeqTrainer (#9675) · 97b787fb
Sylvain Gugger authored Jan 19, 2021

97b787fb

15 Jan, 2021 1 commit
- deepspeed + grad acumm (#9622) · c60e0e1e
  Stas Bekman authored Jan 15, 2021
  
  c60e0e1e
14 Jan, 2021 2 commits

Upstream (and rename) sortish sampler (#9574) · 329fe274

Sylvain Gugger authored Jan 14, 2021



* Upstream (and rename) sortish sampler

* Use proper sampler

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

329fe274

Switch metrics in run_ner to datasets (#9567) · 46ed56cf

Sylvain Gugger authored Jan 14, 2021

* Switch metrics in run_ner to datasets

* Add flag to return all metrics

* Upstream (and rename) sortish_sampler

* Revert "Upstream (and rename) sortish_sampler"

This reverts commit e07d0dcf650c2bae36da011dd76c77a8bb4feb0d.

46ed56cf

13 Jan, 2021 3 commits

Update run_glue for do_predict with local test data (#9442) (#9486) · eabad8fd

Yusuke Mori authored Jan 13, 2021

* Update run_glue for do_predict with local test data (#9442)

* Update run_glue (#9442): fix comments ('files' to 'a file')

* Update run_glue (#9442): reflect the code review

* Update run_glue (#9442): auto format

* Update run_glue (#9442): reflect the code review

eabad8fd

Fix classification script: enable dynamic padding with truncation (#9554) · 27d0e01d
Pavel Tarashkevich authored Jan 13, 2021
```
Co-authored-by: Pavel Tarashkevich <Pavel.Tarashkievich@orange.com>
```
27d0e01d

[trainer] deepspeed integration (#9211) · 2df34f4a

Stas Bekman authored Jan 12, 2021



* deepspeed integration

* style

* add test

* ds wants to do its own backward

* fp16 assert

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* for clarity extract what args are being passed to deepspeed

* introduce the concept of self.wrapped_model

* s/self.wrapped_model/self.model_wrapped/

* complete transition to self.wrapped_model / self.model

* fix

* doc

* give ds its own init

* add custom overrides, handle bs correctly

* fix test

* clean up model_init logic, fix small bug

* complete fix

* collapse --deepspeed_config into --deepspeed

* style

* start adding doc notes

* style

* implement hf2ds optimizer and scheduler configuration remapping

* oops

* call get_num_training_steps absolutely when needed

* workaround broken auto-formatter

* deepspeed_config arg is no longer needed - fixed in deepspeed master

* use hf's fp16 args in config

* clean

* start on the docs

* rebase cleanup

* finish up --fp16

* clarify the supported stages

* big refactor thanks to discovering deepspeed.init_distributed

* cleanup

* revert fp16 part

* add checkpoint-support

* more init ds into integrations

* extend docs

* cleanup

* unfix docs

* clean up old code

* imports

* move docs

* fix logic

* make it clear which file it's referring to

* document nodes/gpus

* style

* wrong format

* style

* deepspeed handles gradient clipping

* easier to read

* major doc rewrite

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* docs

* switch to AdamW optimizer

* style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* clarify doc
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2df34f4a

07 Jan, 2021 1 commit
- Remove nested lxmert (#9440) · 3ec40299
  Sylvain Gugger authored Jan 07, 2021
  
  3ec40299
06 Jan, 2021 1 commit
- Allow example to use a revision and work with private models (#9407) · 453a70d4
  Sylvain Gugger authored Jan 06, 2021
```
* Allow example to use a revision and work with private models

* Copy to other examples and template

* Styling
```
  453a70d4
05 Jan, 2021 2 commits

[PyTorch Bart] Split Bart into different models (#9343) · eef66035

Patrick von Platen authored Jan 05, 2021

* first try

* remove old template

* finish bart

* finish mbart

* delete unnecessary line

* init pegasus

* save intermediate

* correct pegasus

* finish pegasus

* remove cookie cutter leftover

* add marian

* finish blenderbot

* replace in file

* correctly split blenderbot

* delete "old" folder

* correct "add statement"

* adapt config for tf comp

* correct configs for tf

* remove ipdb

* fix more stuff

* fix mbart

* push pegasus fix

* fix mbart

* more fixes

* fix research projects code

* finish docs for bart, mbart, and marian

* delete unnecessary file

* correct attn typo

* correct configs

* remove pegasus for seq class

* correct peg docs

* correct peg docs

* finish configs

* further improve docs

* add copied from statements to mbart

* fix copied from in mbart

* add copy statements to marian

* add copied from to marian

* add pegasus copied from

* finish pegasus

* finish copied from

* Apply suggestions from code review

* make style

* backward comp blenderbot

* apply lysandres and sylvains suggestions

* apply suggestions

* push last fixes

* fix docs

* fix tok tests

* fix imports code style

* fix doc

eef66035

[examples/text-classification] Fix a bug for using one's own dataset of a regression task (#9411) · 57a66269
Yusuke Mori authored Jan 05, 2021

57a66269

04 Jan, 2021 3 commits

Bump notebook from 6.1.4 to 6.1.5 in /examples/research_projects/lxmert (#9402) · 5dd389d1

dependabot[bot] authored Jan 04, 2021

Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5.
- [Release notes](https://github.com/jupyter/jupyterhub/releases)
- [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md)
- [Commits](https://github.com/jupyter/jupyterhub/commits

)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

5dd389d1

Put back LXMert example (#9401) · 23a71449
Sylvain Gugger authored Jan 04, 2021

23a71449
simplify marian distillation script (#9394) · 8eb7f26d
Sam Shleifer authored Jan 04, 2021

8eb7f26d

03 Jan, 2021 1 commit

Fix typos in README and bugs in RAG example code for end-to-end evaluation and finetuning (#9355) · d944966b

Yoshitomo Matsubara authored Jan 03, 2021

* fix a bug in eval_batch_retrieval

* should return parser as well as other staticmethod

* remove duplicate argument

* these kwargs are no longer accepted (cause TypeError in self.generator.generate of modeling_rag.py)

* fixed file paths in README

* moved an arg to add_ray_specific_args

d944966b

23 Dec, 2020 1 commit
- Adapt to new name of `label_smoothing_factor` training arg (#9282) · a1cb6e98
  Sylvain Gugger authored Dec 23, 2020
  
  a1cb6e98
22 Dec, 2020 4 commits
- Revert renaming in finetune_trainer (#9262) · e6c1f1ca
  Sylvain Gugger authored Dec 22, 2020
  
  e6c1f1ca
- Add speed metrics to all example scripts + template (#9260) · ab177588
  Sylvain Gugger authored Dec 22, 2020
  
  ab177588
- Fix link to bertabs/README.md (#9255) · 37d6fb5d
  Manuel Romero authored Dec 22, 2020
  
  37d6fb5d
- Fix link to old language modeling script (#9254) · 189c1b91
  Manuel Romero authored Dec 22, 2020
  
  189c1b91