Commits · 3312e96bfbebbad67ff29539d2df9211923ddd70 · chenpangpang / transformers

12 Apr, 2021 2 commits
- Fix typo (#11188) · cb251ba6
  Takuya Makino authored Apr 13, 2021
  
  cb251ba6
- model_path should be ignored as the checkpoint path (#11157) · ef102c48
  Masatoshi TSUCHIYA authored Apr 12, 2021
```
* model_path is refered as the path of the trainer, and should be ignored as the checkpoint path.

* Improved according to Sgugger's comment.
```
  ef102c48
09 Apr, 2021 3 commits
- [examples run_clm] fix _LazyModule hasher error (#11168) · 07f0bb69
  Stas Bekman authored Apr 09, 2021
```
* fix _LazyModule hasher error

* reword
```
  07f0bb69
- [examples/translation] support mBART-50 and M2M100 fine-tuning (#11170) · c161dd56
  Suraj Patil authored Apr 09, 2021
```
* keep a list of multilingual tokenizers

* add forced_bos_token argument
```
  c161dd56
- Update README.md (#11161) · 60607465
  Saviour Owolabi authored Apr 09, 2021
```
Corrected a typo ('Downlowd' to 'Download')
```
  60607465
08 Apr, 2021 4 commits

[tests] relocate core integration tests (#11146) · 66446909

Stas Bekman authored Apr 08, 2021

* relocate core integration tests

* add sys.path context manager

* cleanup

* try

* try2

* fix path

* doc

* style

* add dep

* add 2 more deps

66446909

Run mlm pad to multiple for fp16 (#11128) · 6c40e497
Andrea Cappelli authored Apr 08, 2021
```
* Add mlm collator pad to multiple option (#10627)

* Use padding to 8x in run mlm (#10627)
```
6c40e497

[DeepSpeed] ZeRO Stage 3 (#10753) · c6d66484

Stas Bekman authored Apr 08, 2021



* synced gpus

* fix

* fix

* need to use t5-small for quality tests

* notes

* complete merge

* fix a disappearing std stream problem

* start zero3 tests

* wip

* tune params

* sorting out the pre-trained model loading

* reworking generate loop wip

* wip

* style

* fix tests

* split the tests

* refactor tests

* wip

* parameterized

* fix

* workout the resume from non-ds checkpoint pass + test

* cleanup

* remove no longer needed code

* split getter/setter functions

* complete the docs

* suggestions

* gpus and their compute capabilities link

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* style

* remove invalid paramgd

* automatically configure zero3 params that rely on hidden size

* make _get_resized_embeddings zero3-aware

* add test exercising resize_token_embeddings()

* add docstring
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c6d66484

[run_clm] clarify why we get the tokenizer warning on long input (#11145) · acc851e1

Stas Bekman authored Apr 08, 2021



* clarify why we get the warning here

* Update examples/language-modeling/run_clm.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* wording

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

acc851e1

07 Apr, 2021 2 commits
- [examples] fix white space (#11099) · 424419f5
  Stas Bekman authored Apr 07, 2021
```
these get concatenated without whitespace, so fix it
```
  424419f5
- fix: The 'warn' method is deprecated (#11105) · c9035e45
  Stas Bekman authored Apr 07, 2021
```
* The 'warn' method is deprecated

* fix test
```
  c9035e45
06 Apr, 2021 5 commits

Style · fd338abd
Sylvain Gugger authored Apr 06, 2021

fd338abd

accelerate question answering examples with no trainer (#11091) · aef4cf8c

SHYAM SUNDER KUMAR authored Apr 07, 2021



* accelerate question answering examples with no trainer

* removed train and eval flags also fixed fill np array function

* Update examples/question-answering/run_qa_beam_search_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aef4cf8c

Development on v4.6.0dev0 · 9853c5dd
Lysandre authored Apr 06, 2021

9853c5dd
Release v4.5.0 · 4906a29f
Lysandre authored Apr 06, 2021

4906a29f
Add Readme for language modeling scripts with accelerate (#11073) · 6ab7d1a4
Hemil Desai authored Apr 06, 2021

6ab7d1a4

05 Apr, 2021 2 commits

Add `examples/language_modeling/run_clm_no_trainer.py` (#11026) · b51b87c4

Hemil Desai authored Apr 05, 2021



* Initial draft for clm no trainer

* Remove unwanted args

* Fix bug

* Update examples/language-modeling/run_clm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b51b87c4

s|Pretrained|PreTrained| (#11048) · 3d39226a
Stas Bekman authored Apr 04, 2021

3d39226a

02 Apr, 2021 1 commit
- fixed typo: logging instead of logger (#11025) · 335c0ca3
  versis authored Apr 02, 2021
  
  335c0ca3
31 Mar, 2021 3 commits

Add `examples/language_modeling/run_mlm_no_trainer.py` (#11001) · 838f83d8

Hemil Desai authored Apr 01, 2021



* Add initial script for finetuning MLM models with accelerate

* Add evaluation metric calculation

* Fix bugs

* Use no_grad on evaluation

* update script docstring

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* PR feedback

* Fix CI failure

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

838f83d8

Enforce string-formatting with f-strings (#10980) · acc3bd9d

Sylvain Gugger authored Mar 31, 2021



* First third

* Styling and fix mistake

* Quality

* All the rest

* Treat %s and %d

* typo

* Missing )

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

acc3bd9d

Fixed some typos and removed legacy url (#10989) · 645f45c4

WybeKoper authored Mar 31, 2021



* Fixed typos

* Removed legacy colab notebook from readme
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>

645f45c4

30 Mar, 2021 2 commits
- fix md file to avoid evaluation crash (#10962) · e031162a
  Yih-Dar authored Mar 30, 2021
  
  e031162a
- [examples/s2s] added py7zr dep (#10971) · 3e09d813
  Philipp Schmid authored Mar 30, 2021
```
* added py7zr

* comment out check_min for sagemaker test

* added min version again
```
  3e09d813
29 Mar, 2021 5 commits

[vulnerability] dep fix (#10954) · 05c966f2

Stas Bekman authored Mar 29, 2021

Fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pygments/open

@LysandreJik

05c966f2

Add `examples/multiple-choice/run_swag_no_trainer.py` (#10934) · 5057213b

Daniel Stancl authored Mar 29, 2021

* Initial commit

* Another bunch of updates

* make style quliaty + delete debug arg from bash script

* Use compue_metrics func

* Do a few fixes

* Add copyright

* Fix typos

5057213b

Remove duplicate code · 4002f95e
Sylvain Gugger authored Mar 29, 2021

4002f95e

Add `examples/run_ner_no_trainer.py` (#10902) · d7b50ce4

Daniel Stancl authored Mar 29, 2021

* Add NER example with accelerate library

* This commit contains the first (yet really unfinished)
version of a script for showing how to train HuggingFace model
with their new accelerate library.

* Fix metric calculation

* make style quality

* mv ner_no_trainer to token-classification dir

* Delete --debug flag from running script

* hf_datasets -> raw_datasets

* Make a few slight adjustments

* Add an informative comment + rewrite a help comment

* Change header

* Fix a few things

* Enforce to use fast tokenizers only

* DataCollatorWithPadding -> DataCollatorForTokenClassification

* Change bash script: python3 -> accelerate launch

* make style

* Add a few missing things (see below)

* Add a max-lenghth padding to predictions and labels to
enable accelerate gather functionality

* Add PyTorch no trainer example to the example README.md

* Remove --do-train from args as being redundant for now

* DataCollatorWithPadding -> DataCollatorForTokenClassification

* Remove some obsolete args.do_train conditions from the script

* Delete --do_train from bash running script

* Delete use_slow_tokenizer from args

* Add unintentionally removed flag --label_all_tokens

* Delete --debug flag from running script

d7b50ce4

Updated colab links in readme of examples (#10932) · ddea8771
WybeKoper authored Mar 29, 2021
```
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>
```
ddea8771

28 Mar, 2021 1 commit
- fixed finename (#10939) · 4f21e1dd
  Bhadresh Savani authored Mar 28, 2021
  
  4f21e1dd
26 Mar, 2021 1 commit

[vulnerability] fix dependency (#10914) · 3c27d246

Stas Bekman authored Mar 26, 2021

this PR fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/PyYAML/open

3c27d246

25 Mar, 2021 1 commit
- run_glue_no_trainer: datasets -> raw_datasets (#10898) · 5f1491d3
  Jethro Kuan authored Mar 25, 2021
```
Use the correct variable (raw_datasets) instead of the module (datasets)
where appropriate.
```
  5f1491d3
23 Mar, 2021 1 commit

[Examples] Added predict stage and Updated Example Template (#10868) · 7ef40120

Bhadresh Savani authored Mar 23, 2021



* added predict stage

* added test keyword in exception message

* removed example specific saving predictions

* fixed f-string error

* removed extra line
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

7ef40120

22 Mar, 2021 6 commits

Use DataCollatorForSeq2Seq in run_summarization in all cases (#10856) · 9f8fa4e9
Eliza Szczechla authored Mar 22, 2021
```
Co-authored-by: Eliza <eliza@habanero.tiger.com.pl>
```
9f8fa4e9

feat(wandb): logging and configuration improvements (#10826) · 125ccead

Boris Dayma authored Mar 22, 2021

* feat: ensure unique artifact id

* feat: allow manual init

* fix: simplify reinit logic

* fix: no dropped value + immediate commits

* fix: wandb use in sagemaker

* docs: improve documenation and formatting

* fix: typos

* docs: improve formatting

125ccead

[vulnerability] in example deps fix (#10817) · 8fb46718

Stas Bekman authored Mar 22, 2021

Takes care of:
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/jinja2/open



@LysandreJik
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

8fb46718

Bump jinja2 from 2.11.2 to 2.11.3 in /examples/research_projects/lxmert (#10818) · dbfe3795

dependabot[bot] authored Mar 22, 2021

Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.2 to 2.11.3.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/master/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/2.11.2...2.11.3

)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

dbfe3795

Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849) · 29904a96
Qiushi Pan authored Mar 22, 2021
```
Fix typo.
```
29904a96
push (#10846) · 0f226f78
Patrick von Platen authored Mar 22, 2021

0f226f78

21 Mar, 2021 1 commit
- Update FINE_TUNE_XLSR_WAV2VEC2.md · 82b8d8c7
  Suraj Patil authored Mar 21, 2021
  
  82b8d8c7