Commits · 8b945ef03eae3d234d07f1724229902cfecf38b8 · chenpangpang / transformers

30 Apr, 2021 1 commit
- Update README.md (#11489) · 58c789e3
  Manuel Romero authored Apr 30, 2021
```
Add link to code
```
  58c789e3
29 Apr, 2021 1 commit
- Split checkpoint from model_name_or_path in examples (#11492) · b29eb247
  Sylvain Gugger authored Apr 29, 2021
```
* Split checkpoint from model_name_or_path in examples

* Address review comments

* Address review comments
```
  b29eb247
26 Apr, 2021 4 commits

Variable Correction for Consistency in Distillation Example (#11444) · 0661abc5

Jaimeen Ahn authored Apr 27, 2021

As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively,  the correction makes the example work

0661abc5

[Examples] Fixes inconsistency around eval vs val and predict vs test (#11380) · 1d30ec95

Bhadresh Savani authored Apr 26, 2021

* added changes for uniformity

* modified files

* corrected typo

* fixed qa scripts

* fix typos

* fixed predict typo in qa no trainer

* fixed test file

* reverted trainer changes

* reverted trainer changes in custom exmaples

* updated readme

* added changes in deepspeed test

* added changes for predict and eval

1d30ec95

docs(examples): fix link to TPU launcher script (#11427) · e3e70f95
Amine Abdaoui authored Apr 26, 2021

e3e70f95
make style (#11442) · 32dbb2d9
Patrick von Platen authored Apr 26, 2021

32dbb2d9

23 Apr, 2021 5 commits

Default to accuracy metric (#11405) · 1ef152eb
Sylvain Gugger authored Apr 23, 2021

1ef152eb

Trainer push to hub (#11328) · bf2e0cf7

Sylvain Gugger authored Apr 23, 2021



* Initial support for upload to hub

* push -> upload

* Fixes + examples

* Fix torchhub test

* Torchhub test I hate you

* push_model_to_hub -> push_to_hub

* Apply mixin to other pretrained models

* Remove ABC inheritance

* Add tests

* Typo

* Run tests

* Install git-lfs

* Change approach

* Add push_to_hub to all

* Staging test suite

* Typo

* Maybe like this?

* More deps

* Cache

* Adapt name

* Quality

* MOAR tests

* Put it in testing_utils

* Docs + torchhub last hope

* Styling

* Wrong method

* Typos

* Update src/transformers/file_utils.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bf2e0cf7

fixed typos (#11391) · c3d6f339
Yoshitomo Matsubara authored Apr 23, 2021

c3d6f339
Fix typo in text (#11396) · a90d3f18
Max Del authored Apr 23, 2021

a90d3f18
correct typo (#11393) · b48cf712
Patrick von Platen authored Apr 23, 2021

b48cf712

22 Apr, 2021 2 commits
- Correctly cast num_train_epochs to int (#11379) · 26173960
  Matt authored Apr 22, 2021
  
  26173960
- [run_translation.py] fix typo (#11372) · 5b5e4ca3
  johnson7788 authored Apr 22, 2021
```
fix typo
Co-authored-by: johnson <johnson@github.com>
```
  5b5e4ca3
21 Apr, 2021 3 commits

Move old TF text classification script to legacy (#11361) · 6fe79e57
Matt authored Apr 21, 2021
```
And update README to explain the work-in-progress!
```
6fe79e57
Merge new TF example script (#11360) · ac588594
Matt authored Apr 21, 2021
```
First of the new and more idiomatic TF examples!
```
ac588594

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

20 Apr, 2021 2 commits

Update to use datasets remove_cloumns method (#11343) · f1b938fd
Sylvain Gugger authored Apr 20, 2021
```
* Update to use datasets remove_cloumns method

* Quality
```
f1b938fd

Added translation example script (#11196) · bfd83c17

rajvi-k authored Apr 20, 2021

* initial changes

* modified evaluation

* updated evaluation

* updated evaluation on text translation example script

* added translation example script

* Formatted translation example script

* Reformatted translation example

* Fixed evaluation bug and added support for other tokenisers

* Fixed evaluation bug and added support for other tokenisers

* Added translation example script

* Formatted summarization example script

* Removed typos from summarization example script

bfd83c17

14 Apr, 2021 2 commits
- Close open files to suppress ResourceWarning (#11240) · f25444cb
  Sudharsan S T authored Apr 14, 2021
```
Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>
```
  f25444cb
- Save the Wav2Vec2 processor before training starts (#10910) · 653076ca
  Nithin Holla authored Apr 14, 2021
```
Co-authored-by: nithin19 <nithin@amberscript.com>
```
  653076ca
13 Apr, 2021 1 commit
- added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220) · 9fa29959
  Philipp Schmid authored Apr 13, 2021
  
  9fa29959
12 Apr, 2021 2 commits
- Fix typo (#11188) · cb251ba6
  Takuya Makino authored Apr 13, 2021
  
  cb251ba6
- model_path should be ignored as the checkpoint path (#11157) · ef102c48
  Masatoshi TSUCHIYA authored Apr 12, 2021
```
* model_path is refered as the path of the trainer, and should be ignored as the checkpoint path.

* Improved according to Sgugger's comment.
```
  ef102c48
09 Apr, 2021 3 commits
- [examples run_clm] fix _LazyModule hasher error (#11168) · 07f0bb69
  Stas Bekman authored Apr 09, 2021
```
* fix _LazyModule hasher error

* reword
```
  07f0bb69
- [examples/translation] support mBART-50 and M2M100 fine-tuning (#11170) · c161dd56
  Suraj Patil authored Apr 09, 2021
```
* keep a list of multilingual tokenizers

* add forced_bos_token argument
```
  c161dd56
- Update README.md (#11161) · 60607465
  Saviour Owolabi authored Apr 09, 2021
```
Corrected a typo ('Downlowd' to 'Download')
```
  60607465
08 Apr, 2021 4 commits

[tests] relocate core integration tests (#11146) · 66446909

Stas Bekman authored Apr 08, 2021

* relocate core integration tests

* add sys.path context manager

* cleanup

* try

* try2

* fix path

* doc

* style

* add dep

* add 2 more deps

66446909

Run mlm pad to multiple for fp16 (#11128) · 6c40e497
Andrea Cappelli authored Apr 08, 2021
```
* Add mlm collator pad to multiple option (#10627)

* Use padding to 8x in run mlm (#10627)
```
6c40e497

[DeepSpeed] ZeRO Stage 3 (#10753) · c6d66484

Stas Bekman authored Apr 08, 2021



* synced gpus

* fix

* fix

* need to use t5-small for quality tests

* notes

* complete merge

* fix a disappearing std stream problem

* start zero3 tests

* wip

* tune params

* sorting out the pre-trained model loading

* reworking generate loop wip

* wip

* style

* fix tests

* split the tests

* refactor tests

* wip

* parameterized

* fix

* workout the resume from non-ds checkpoint pass + test

* cleanup

* remove no longer needed code

* split getter/setter functions

* complete the docs

* suggestions

* gpus and their compute capabilities link

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* style

* remove invalid paramgd

* automatically configure zero3 params that rely on hidden size

* make _get_resized_embeddings zero3-aware

* add test exercising resize_token_embeddings()

* add docstring
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c6d66484

[run_clm] clarify why we get the tokenizer warning on long input (#11145) · acc851e1

Stas Bekman authored Apr 08, 2021



* clarify why we get the warning here

* Update examples/language-modeling/run_clm.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* wording

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

acc851e1

07 Apr, 2021 2 commits
- [examples] fix white space (#11099) · 424419f5
  Stas Bekman authored Apr 07, 2021
```
these get concatenated without whitespace, so fix it
```
  424419f5
- fix: The 'warn' method is deprecated (#11105) · c9035e45
  Stas Bekman authored Apr 07, 2021
```
* The 'warn' method is deprecated

* fix test
```
  c9035e45
06 Apr, 2021 5 commits

Style · fd338abd
Sylvain Gugger authored Apr 06, 2021

fd338abd

accelerate question answering examples with no trainer (#11091) · aef4cf8c

SHYAM SUNDER KUMAR authored Apr 07, 2021



* accelerate question answering examples with no trainer

* removed train and eval flags also fixed fill np array function

* Update examples/question-answering/run_qa_beam_search_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/question-answering/run_qa_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aef4cf8c

Development on v4.6.0dev0 · 9853c5dd
Lysandre authored Apr 06, 2021

9853c5dd
Release v4.5.0 · 4906a29f
Lysandre authored Apr 06, 2021

4906a29f
Add Readme for language modeling scripts with accelerate (#11073) · 6ab7d1a4
Hemil Desai authored Apr 06, 2021

6ab7d1a4

05 Apr, 2021 2 commits

Add `examples/language_modeling/run_clm_no_trainer.py` (#11026) · b51b87c4

Hemil Desai authored Apr 05, 2021



* Initial draft for clm no trainer

* Remove unwanted args

* Fix bug

* Update examples/language-modeling/run_clm_no_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b51b87c4

s|Pretrained|PreTrained| (#11048) · 3d39226a
Stas Bekman authored Apr 04, 2021

3d39226a

02 Apr, 2021 1 commit
- fixed typo: logging instead of logger (#11025) · 335c0ca3
  versis authored Apr 02, 2021
  
  335c0ca3