Commits · 00440e350f58e33435f823ec8a940bd3861fe7ba · chenpangpang / transformers

19 May, 2021 1 commit

[Flax MLM] Refactor run mlm with optax (#11745) · 00440e35

Patrick von Platen authored May 19, 2021



* refactor

* update

* update

* update

* refactor run mlm

* finalize

* refactor more

* fix typo

* update

* finish refactor

* modify run mlm

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* small fixes

* upload

* upload

* finish run mlm script
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

00440e35

18 May, 2021 5 commits
- Fix a small error in summarization example (#11762) · eb3e072a
  Tomy Hsieh authored May 19, 2021
  
  eb3e072a
- Add Flax Examples and Cloud TPU README (#11753) · 77f9bd18
  Avital Oliver authored May 18, 2021
```
* Add Flax Examples README

* Apply suggestions from code review

* Update examples/flax/README.md

* add nice table

* fix

* fix

* apply suggestions

* upload

* finish flax readme.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  77f9bd18
- add `dataset_name` to data_args and added accuracy metric (#11760) · 04e25c62
  Philipp Schmid authored May 18, 2021
```
* add `dataset_name` to data_args and added accuracy metric

* added documentation for dataset_name

* spelling correction
```
  04e25c62
- Add more subsections to main doc (#11758) · cebb96f5
  Patrick von Platen authored May 18, 2021
```
* add headers to main doc

* Apply suggestions from code review

* update

* upload
```
  cebb96f5
- Fix incorrect newline in #11650 (#11757) · da7e73b7
  Tommy Chiang authored May 18, 2021
  
  da7e73b7
17 May, 2021 2 commits
- Use new evaluation loop in TrainerQA (#11746) · 936b5715
  Sylvain Gugger authored May 17, 2021
  
  936b5715
- Improvements to Flax finetuning script (#11727) · 726e953d
  Marc van Zee authored May 17, 2021
```
* Add Cloud details to README

* Flax script and readme updates

* Some simplifications of Flax script
```
  726e953d
14 May, 2021 2 commits
- Add Cloud details to README (#11706) · 94a23487
  Marc van Zee authored May 14, 2021
```
* Add Cloud details to README

* Flax script and readme updates
```
  94a23487
- correct example script (#11726) · 113eaa75
  Patrick von Platen authored May 14, 2021
  
  113eaa75
12 May, 2021 4 commits
- Docs for v4.7.0.dev0 · d77eb0cf
  Lysandre authored May 12, 2021
  
  d77eb0cf
- Release: v4.6.0 · 64e78564
  Lysandre authored May 12, 2021
  
  64e78564
- remove defaults to None if optional (#11703) · 77f4c46b
  Philip May authored May 12, 2021
  
  77f4c46b
- Updates README and fixes bug (#11701) · 6797cdc0
  Marc van Zee authored May 12, 2021
  
  6797cdc0
11 May, 2021 3 commits

Adds Flax BERT finetuning example on GLUE (#11564) · 4ce6bcc3

Marc van Zee authored May 11, 2021



* Adds Flax BERT finetuning example

* fix traced jax tensor type

* Use Optax losses and learning schedulers

* Add 1GPU training results

* merge into master & make style

* fix input

* del file

* Fix bug in loss and add torch runs

* finish bert flax fine-tune

* Update examples/flax/text-classification/README.md

* Update examples/flax/text-classification/run_flax_glue.py

* add requirements

* finalize

* finalize
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4ce6bcc3

Auto modelcard (#11599) · a135f595

Sylvain Gugger authored May 11, 2021



* Autogenerate model cards from the Trainer

* ModelCard deprecated

* Fix test

* Style

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Quality

* With all metadata

* Metadata

* Post-merge conflict mess

* Data args and all examples

* Default license and languages when possible
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a135f595

Add --text_column to run_summarization_no_trainer (#11673) · 64232bc0
Jonathan Chang authored May 11, 2021

64232bc0

10 May, 2021 3 commits
- Fix suggested by @bhadreshpsavani (#11660) · ef8d32c5
  Matt authored May 10, 2021
  
  ef8d32c5
- Update requirements.txt (#11634) · 1a0b4178
  Quentin Lhoest authored May 10, 2021
  
  1a0b4178
- [Examples] Fix invalid links after reorg (#11650) · 7e406f4a
  Tommy Chiang authored May 10, 2021
  
  7e406f4a
09 May, 2021 1 commit
- [Examples] Check key exists in datasets first (#11503) · f2ffcaf4
  Tommy Chiang authored May 10, 2021
  
  f2ffcaf4
07 May, 2021 2 commits
- [examples] fix sys.path in conftest.py (#11636) · ba0d50f2
  Stas Bekman authored May 07, 2021
```
* restore conftest.py

* fix conftest and make copies

* remove unneeded parts

* remove unwanted files
```
  ba0d50f2
- Fix comment in run_clm_no_trainer.py (#11624) · 6f40e317
  Jonathan Chang authored May 07, 2021
  
  6f40e317
06 May, 2021 1 commit
- fix typo in command (#11605) · f594090a
  Vipul Raheja authored May 06, 2021
  
  f594090a
05 May, 2021 1 commit

Pytorch - Lazy initialization of models (#11471) · 3e3e41ae

Patrick von Platen authored May 05, 2021



* lazy_init_weights

* remove ipdb

* save int

* add necessary code

* remove unnecessary utils

* Update src/transformers/models/t5/modeling_t5.py

* clean

* add tests

* correct

* finish tests

* finish tests

* fix some more tests

* fix xlnet & transfo-xl

* fix more tests

* make sure tests are independent

* fix tests more

* finist tests

* final touches

* Update src/transformers/modeling_utils.py

* Apply suggestions from code review

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* clean tests

* give arg positive name

* add more mock weights to xlnet
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

3e3e41ae

04 May, 2021 2 commits

Reproducible checkpoint (#11582) · 6b241e0e

Sylvain Gugger authored May 04, 2021

* Set generator in dataloader

* Use generator in all random samplers

* Checkpoint all RNG states

* Final version

* Quality

* Test

* Address review comments

* Quality

* Remove debug util

* Add python and numpy RNGs

* Split states in different files in distributed

* Quality

* local_rank for TPUs

* Only use generator when accepted

* Add test

* Set seed to avoid flakiness

* Make test less flaky

* Quality

6b241e0e

[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) · 084a187d

Patrick von Platen authored May 04, 2021



* add flax roberta

* make style

* correct initialiazation

* modify model to save weights

* fix copied from

* fix copied from

* correct some more code

* add more roberta models

* Apply suggestions from code review

* merge from master

* finish

* finish docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

084a187d

03 May, 2021 1 commit
- Fix metric computation in `run_glue_no_trainer` (#11569) · 87dd1a00
  Sylvain Gugger authored May 03, 2021
  
  87dd1a00
30 Apr, 2021 4 commits
- [Examples] Added support for test-file in QA examples with no trainer (#11510) · 84326a28
  Bhadresh Savani authored Apr 30, 2021
```
* added support for test-file

* fixed typo

* added suggested changes

* reformatted code

* modifed files

* fix post processing error

* Trigger CI

* removed extra lines
```
  84326a28
- reszie token embeds (#11524) · 57c8e822
  Suraj Patil authored Apr 30, 2021
  
  57c8e822
- Update TF text classification example (#11496) · 20d6931e
  Matt authored Apr 30, 2021
```
Big refactor, fixes and multi-GPU/TPU support
```
  20d6931e
- Update README.md (#11489) · 58c789e3
  Manuel Romero authored Apr 30, 2021
```
Add link to code
```
  58c789e3
29 Apr, 2021 1 commit
- Split checkpoint from model_name_or_path in examples (#11492) · b29eb247
  Sylvain Gugger authored Apr 29, 2021
```
* Split checkpoint from model_name_or_path in examples

* Address review comments

* Address review comments
```
  b29eb247
26 Apr, 2021 4 commits

Variable Correction for Consistency in Distillation Example (#11444) · 0661abc5

Jaimeen Ahn authored Apr 27, 2021

As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively,  the correction makes the example work

0661abc5

[Examples] Fixes inconsistency around eval vs val and predict vs test (#11380) · 1d30ec95

Bhadresh Savani authored Apr 26, 2021

* added changes for uniformity

* modified files

* corrected typo

* fixed qa scripts

* fix typos

* fixed predict typo in qa no trainer

* fixed test file

* reverted trainer changes

* reverted trainer changes in custom exmaples

* updated readme

* added changes in deepspeed test

* added changes for predict and eval

1d30ec95

docs(examples): fix link to TPU launcher script (#11427) · e3e70f95
Amine Abdaoui authored Apr 26, 2021

e3e70f95
make style (#11442) · 32dbb2d9
Patrick von Platen authored Apr 26, 2021

32dbb2d9

23 Apr, 2021 3 commits

Default to accuracy metric (#11405) · 1ef152eb
Sylvain Gugger authored Apr 23, 2021

1ef152eb

Trainer push to hub (#11328) · bf2e0cf7

Sylvain Gugger authored Apr 23, 2021



* Initial support for upload to hub

* push -> upload

* Fixes + examples

* Fix torchhub test

* Torchhub test I hate you

* push_model_to_hub -> push_to_hub

* Apply mixin to other pretrained models

* Remove ABC inheritance

* Add tests

* Typo

* Run tests

* Install git-lfs

* Change approach

* Add push_to_hub to all

* Staging test suite

* Typo

* Maybe like this?

* More deps

* Cache

* Adapt name

* Quality

* MOAR tests

* Put it in testing_utils

* Docs + torchhub last hope

* Styling

* Wrong method

* Typos

* Update src/transformers/file_utils.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bf2e0cf7

fixed typos (#11391) · c3d6f339
Yoshitomo Matsubara authored Apr 23, 2021

c3d6f339