Commits · 8c9b5fcbaf27cbf1aa781670d598cf74c07b7e88 · chenpangpang / transformers

22 Apr, 2021 2 commits
- Correctly cast num_train_epochs to int (#11379) · 26173960
  Matt authored Apr 22, 2021
  
  26173960
- [run_translation.py] fix typo (#11372) · 5b5e4ca3
  johnson7788 authored Apr 22, 2021
```
fix typo
Co-authored-by: johnson <johnson@github.com>
```
  5b5e4ca3
21 Apr, 2021 3 commits

Move old TF text classification script to legacy (#11361) · 6fe79e57
Matt authored Apr 21, 2021
```
And update README to explain the work-in-progress!
```
6fe79e57
Merge new TF example script (#11360) · ac588594
Matt authored Apr 21, 2021
```
First of the new and more idiomatic TF examples!
```
ac588594

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

20 Apr, 2021 2 commits

Update to use datasets remove_cloumns method (#11343) · f1b938fd
Sylvain Gugger authored Apr 20, 2021
```
* Update to use datasets remove_cloumns method

* Quality
```
f1b938fd

Added translation example script (#11196) · bfd83c17

rajvi-k authored Apr 20, 2021

* initial changes

* modified evaluation

* updated evaluation

* updated evaluation on text translation example script

* added translation example script

* Formatted translation example script

* Reformatted translation example

* Fixed evaluation bug and added support for other tokenisers

* Fixed evaluation bug and added support for other tokenisers

* Added translation example script

* Formatted summarization example script

* Removed typos from summarization example script

bfd83c17

14 Apr, 2021 2 commits
- Close open files to suppress ResourceWarning (#11240) · f25444cb
  Sudharsan S T authored Apr 14, 2021
```
Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>
```
  f25444cb
- Save the Wav2Vec2 processor before training starts (#10910) · 653076ca
  Nithin Holla authored Apr 14, 2021
```
Co-authored-by: nithin19 <nithin@amberscript.com>
```
  653076ca
13 Apr, 2021 1 commit
- added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220) · 9fa29959
  Philipp Schmid authored Apr 13, 2021
  
  9fa29959
12 Apr, 2021 2 commits
- Fix typo (#11188) · cb251ba6
  Takuya Makino authored Apr 13, 2021
  
  cb251ba6
- model_path should be ignored as the checkpoint path (#11157) · ef102c48
  Masatoshi TSUCHIYA authored Apr 12, 2021
```
* model_path is refered as the path of the trainer, and should be ignored as the checkpoint path.

* Improved according to Sgugger's comment.
```
  ef102c48
09 Apr, 2021 3 commits
- [examples run_clm] fix _LazyModule hasher error (#11168) · 07f0bb69
  Stas Bekman authored Apr 09, 2021
```
* fix _LazyModule hasher error

* reword
```
  07f0bb69
- [examples/translation] support mBART-50 and M2M100 fine-tuning (#11170) · c161dd56
  Suraj Patil authored Apr 09, 2021
```
* keep a list of multilingual tokenizers

* add forced_bos_token argument
```
  c161dd56
- Update README.md (#11161) · 60607465
  Saviour Owolabi authored Apr 09, 2021
```
Corrected a typo ('Downlowd' to 'Download')
```
  60607465
08 Apr, 2021 4 commits

[tests] relocate core integration tests (#11146) · 66446909

Stas Bekman authored Apr 08, 2021

* relocate core integration tests

* add sys.path context manager

* cleanup

* try

* try2

* fix path

* doc

* style

* add dep

* add 2 more deps

66446909

Run mlm pad to multiple for fp16 (#11128) · 6c40e497
Andrea Cappelli authored Apr 08, 2021
```
* Add mlm collator pad to multiple option (#10627)

* Use padding to 8x in run mlm (#10627)
```
6c40e497

[DeepSpeed] ZeRO Stage 3 (#10753) · c6d66484

Stas Bekman authored Apr 08, 2021



* synced gpus

* fix

* fix

* need to use t5-small for quality tests

* notes

* complete merge

* fix a disappearing std stream problem

* start zero3 tests

* wip

* tune params

* sorting out the pre-trained model loading

* reworking generate loop wip

* wip

* style

* fix tests

* split the tests

* refactor tests

* wip

* parameterized

* fix

* workout the resume from non-ds checkpoint pass + test

* cleanup

* remove no longer needed code

* split getter/setter functions

* complete the docs

* suggestions

* gpus and their compute capabilities link

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* style

* remove invalid paramgd

* automatically configure zero3 params that rely on hidden size

* make _get_resized_embeddings zero3-aware

* add test exercising resize_token_embeddings()

* add docstring
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c6d66484

[run_clm] clarify why we get the tokenizer warning on long input (#11145) · acc851e1

Stas Bekman authored Apr 08, 2021



* clarify why we get the warning here

* Update examples/language-modeling/run_clm.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* wording

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

acc851e1

07 Apr, 2021 2 commits
- [examples] fix white space (#11099) · 424419f5
  Stas Bekman authored Apr 07, 2021
```
these get concatenated without whitespace, so fix it
```
  424419f5
- fix: The 'warn' method is deprecated (#11105) · c9035e45
  Stas Bekman authored Apr 07, 2021
```
* The 'warn' method is deprecated

* fix test
```
  c9035e45
06 Apr, 2021 5 commits

Style · fd338abd
Sylvain Gugger authored Apr 06, 2021

fd338abd

accelerate question answering examples with no trainer (#11091) · aef4cf8c

SHYAM SUNDER KUMAR authored Apr 07, 2021



* accelerate question answering examples with no trainer

* removed train and eval flags also fixed fill np array function

* Update examples/question-answering/run_qa_beam_search_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aef4cf8c

Development on v4.6.0dev0 · 9853c5dd
Lysandre authored Apr 06, 2021

9853c5dd
Release v4.5.0 · 4906a29f
Lysandre authored Apr 06, 2021

4906a29f
Add Readme for language modeling scripts with accelerate (#11073) · 6ab7d1a4
Hemil Desai authored Apr 06, 2021

6ab7d1a4

05 Apr, 2021 2 commits

Add `examples/language_modeling/run_clm_no_trainer.py` (#11026) · b51b87c4

Hemil Desai authored Apr 05, 2021



* Initial draft for clm no trainer

* Remove unwanted args

* Fix bug

* Update examples/language-modeling/run_clm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b51b87c4

s|Pretrained|PreTrained| (#11048) · 3d39226a
Stas Bekman authored Apr 04, 2021

3d39226a

02 Apr, 2021 1 commit
- fixed typo: logging instead of logger (#11025) · 335c0ca3
  versis authored Apr 02, 2021
  
  335c0ca3
31 Mar, 2021 3 commits

Add `examples/language_modeling/run_mlm_no_trainer.py` (#11001) · 838f83d8

Hemil Desai authored Apr 01, 2021



* Add initial script for finetuning MLM models with accelerate

* Add evaluation metric calculation

* Fix bugs

* Use no_grad on evaluation

* update script docstring

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* PR feedback

* Fix CI failure

* Update examples/language-modeling/run_mlm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

838f83d8

Enforce string-formatting with f-strings (#10980) · acc3bd9d

Sylvain Gugger authored Mar 31, 2021



* First third

* Styling and fix mistake

* Quality

* All the rest

* Treat %s and %d

* typo

* Missing )

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

acc3bd9d

Fixed some typos and removed legacy url (#10989) · 645f45c4

WybeKoper authored Mar 31, 2021



* Fixed typos

* Removed legacy colab notebook from readme
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>

645f45c4

30 Mar, 2021 2 commits
- fix md file to avoid evaluation crash (#10962) · e031162a
  Yih-Dar authored Mar 30, 2021
  
  e031162a
- [examples/s2s] added py7zr dep (#10971) · 3e09d813
  Philipp Schmid authored Mar 30, 2021
```
* added py7zr

* comment out check_min for sagemaker test

* added min version again
```
  3e09d813
29 Mar, 2021 5 commits

[vulnerability] dep fix (#10954) · 05c966f2

Stas Bekman authored Mar 29, 2021

Fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pygments/open

@LysandreJik

05c966f2

Add `examples/multiple-choice/run_swag_no_trainer.py` (#10934) · 5057213b

Daniel Stancl authored Mar 29, 2021

* Initial commit

* Another bunch of updates

* make style quliaty + delete debug arg from bash script

* Use compue_metrics func

* Do a few fixes

* Add copyright

* Fix typos

5057213b

Remove duplicate code · 4002f95e
Sylvain Gugger authored Mar 29, 2021

4002f95e

Add `examples/run_ner_no_trainer.py` (#10902) · d7b50ce4

Daniel Stancl authored Mar 29, 2021

* Add NER example with accelerate library

* This commit contains the first (yet really unfinished)
version of a script for showing how to train HuggingFace model
with their new accelerate library.

* Fix metric calculation

* make style quality

* mv ner_no_trainer to token-classification dir

* Delete --debug flag from running script

* hf_datasets -> raw_datasets

* Make a few slight adjustments

* Add an informative comment + rewrite a help comment

* Change header

* Fix a few things

* Enforce to use fast tokenizers only

* DataCollatorWithPadding -> DataCollatorForTokenClassification

* Change bash script: python3 -> accelerate launch

* make style

* Add a few missing things (see below)

* Add a max-lenghth padding to predictions and labels to
enable accelerate gather functionality

* Add PyTorch no trainer example to the example README.md

* Remove --do-train from args as being redundant for now

* DataCollatorWithPadding -> DataCollatorForTokenClassification

* Remove some obsolete args.do_train conditions from the script

* Delete --do_train from bash running script

* Delete use_slow_tokenizer from args

* Add unintentionally removed flag --label_all_tokens

* Delete --debug flag from running script

d7b50ce4

Updated colab links in readme of examples (#10932) · ddea8771
WybeKoper authored Mar 29, 2021
```
Co-authored-by: WybeKoper <WybeKoper@users.noreply.github.com>
```
ddea8771

28 Mar, 2021 1 commit
- fixed finename (#10939) · 4f21e1dd
  Bhadresh Savani authored Mar 28, 2021
  
  4f21e1dd