Commits · cfca638acbc9361fca43a6fcff33fa512ab5df93 · chenpangpang / transformers

31 May, 2021 2 commits

Add MT5ForConditionalGeneration as supported arch. to summarization README (#11961) · cfca638a
Philip May authored May 31, 2021
```
* Add MT5ForConditionalGeneration as supported arch.

* Update README.md
```
cfca638a

Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920) · 1ab147d6

Nicholas Vadivelu authored May 31, 2021

* Remove redundant `nn.log_softmax` in `run_flax_glue.py`

`optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference.

* Remove unused 'flax.linen' import

1ab147d6

26 May, 2021 1 commit
- Link official Cloud TPU JAX docs (#11892) · 2df54691
  Avital Oliver authored May 26, 2021
  
  2df54691
25 May, 2021 4 commits

[Examples] create model with custom config on the fly (#11798) · 1b653010

Stas Bekman authored May 25, 2021



* create custom model on the flight

* better wording

* add update_from_string

* cleanup

* cleanup

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more bool options

* style

* fix logger

* add test

* add the doc

* assert on conflict of options
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1b653010

[lm examples] fix overflow in perplexity calc (#11855) · 6287c929
Stas Bekman authored May 25, 2021
```
* fix overflow in perplexity calc

* use inf

* fix
```
6287c929
Add option to log only once in multinode training (#11819) · f086652b
Sylvain Gugger authored May 25, 2021
```
* Add option to long only once in multinode training

* Use an alternate property
```
f086652b
typo (#11858) · b8344a27
Wang Ran (汪然) authored May 25, 2021

b8344a27

24 May, 2021 1 commit
- [Flax] Fix PyTorch import error (#11839) · f5806041
  Patrick von Platen authored May 24, 2021
```
* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import
```
  f5806041
21 May, 2021 3 commits

Add flax text class colab (#11824) · da22245e
Patrick von Platen authored May 21, 2021
```
* fix_torch_device_generate_test

* remove @

* add flax glue link
```
da22245e

[Flax] Small fixes in `run_flax_glue.py` (#11820) · 82335185

Patrick von Platen authored May 21, 2021



* fix_torch_device_generate_test

* remove @

* correct best seed for flax fine-tuning
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

82335185

[Flax] Align GLUE training script with mlm training script (#11778) · bd987165

Patrick von Platen authored May 21, 2021



* speed up flax glue

* remove unnecessary line

* remove folder

* remove run in loop
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

bd987165

20 May, 2021 1 commit

Fix failing test on Windows Platform (#11589) · 22394387

Keren Fuentes authored May 20, 2021

* add separator for windows

* fixes test_is_copy_consistent on Windows

* fixing writing encoding issue on extended test (for Windows)

* resolving comments

22394387

19 May, 2021 1 commit

[Flax MLM] Refactor run mlm with optax (#11745) · 00440e35

Patrick von Platen authored May 19, 2021



* refactor

* update

* update

* update

* refactor run mlm

* finalize

* refactor more

* fix typo

* update

* finish refactor

* modify run mlm

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* small fixes

* upload

* upload

* finish run mlm script
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

00440e35

18 May, 2021 5 commits
- Fix a small error in summarization example (#11762) · eb3e072a
  Tomy Hsieh authored May 19, 2021
  
  eb3e072a
- Add Flax Examples and Cloud TPU README (#11753) · 77f9bd18
  Avital Oliver authored May 18, 2021
```
* Add Flax Examples README

* Apply suggestions from code review

* Update examples/flax/README.md

* add nice table

* fix

* fix

* apply suggestions

* upload

* finish flax readme.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  77f9bd18
- add `dataset_name` to data_args and added accuracy metric (#11760) · 04e25c62
  Philipp Schmid authored May 18, 2021
```
* add `dataset_name` to data_args and added accuracy metric

* added documentation for dataset_name

* spelling correction
```
  04e25c62
- Add more subsections to main doc (#11758) · cebb96f5
  Patrick von Platen authored May 18, 2021
```
* add headers to main doc

* Apply suggestions from code review

* update

* upload
```
  cebb96f5
- Fix incorrect newline in #11650 (#11757) · da7e73b7
  Tommy Chiang authored May 18, 2021
  
  da7e73b7
17 May, 2021 2 commits
- Use new evaluation loop in TrainerQA (#11746) · 936b5715
  Sylvain Gugger authored May 17, 2021
  
  936b5715
- Improvements to Flax finetuning script (#11727) · 726e953d
  Marc van Zee authored May 17, 2021
```
* Add Cloud details to README

* Flax script and readme updates

* Some simplifications of Flax script
```
  726e953d
14 May, 2021 2 commits
- Add Cloud details to README (#11706) · 94a23487
  Marc van Zee authored May 14, 2021
```
* Add Cloud details to README

* Flax script and readme updates
```
  94a23487
- correct example script (#11726) · 113eaa75
  Patrick von Platen authored May 14, 2021
  
  113eaa75
12 May, 2021 4 commits
- Docs for v4.7.0.dev0 · d77eb0cf
  Lysandre authored May 12, 2021
  
  d77eb0cf
- Release: v4.6.0 · 64e78564
  Lysandre authored May 12, 2021
  
  64e78564
- remove defaults to None if optional (#11703) · 77f4c46b
  Philip May authored May 12, 2021
  
  77f4c46b
- Updates README and fixes bug (#11701) · 6797cdc0
  Marc van Zee authored May 12, 2021
  
  6797cdc0
11 May, 2021 3 commits

Adds Flax BERT finetuning example on GLUE (#11564) · 4ce6bcc3

Marc van Zee authored May 11, 2021



* Adds Flax BERT finetuning example

* fix traced jax tensor type

* Use Optax losses and learning schedulers

* Add 1GPU training results

* merge into master & make style

* fix input

* del file

* Fix bug in loss and add torch runs

* finish bert flax fine-tune

* Update examples/flax/text-classification/README.md

* Update examples/flax/text-classification/run_flax_glue.py

* add requirements

* finalize

* finalize
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4ce6bcc3

Auto modelcard (#11599) · a135f595

Sylvain Gugger authored May 11, 2021



* Autogenerate model cards from the Trainer

* ModelCard deprecated

* Fix test

* Style

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Quality

* With all metadata

* Metadata

* Post-merge conflict mess

* Data args and all examples

* Default license and languages when possible
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a135f595

Add --text_column to run_summarization_no_trainer (#11673) · 64232bc0
Jonathan Chang authored May 11, 2021

64232bc0

10 May, 2021 3 commits
- Fix suggested by @bhadreshpsavani (#11660) · ef8d32c5
  Matt authored May 10, 2021
  
  ef8d32c5
- Update requirements.txt (#11634) · 1a0b4178
  Quentin Lhoest authored May 10, 2021
  
  1a0b4178
- [Examples] Fix invalid links after reorg (#11650) · 7e406f4a
  Tommy Chiang authored May 10, 2021
  
  7e406f4a
09 May, 2021 1 commit
- [Examples] Check key exists in datasets first (#11503) · f2ffcaf4
  Tommy Chiang authored May 10, 2021
  
  f2ffcaf4
07 May, 2021 2 commits
- [examples] fix sys.path in conftest.py (#11636) · ba0d50f2
  Stas Bekman authored May 07, 2021
```
* restore conftest.py

* fix conftest and make copies

* remove unneeded parts

* remove unwanted files
```
  ba0d50f2
- Fix comment in run_clm_no_trainer.py (#11624) · 6f40e317
  Jonathan Chang authored May 07, 2021
  
  6f40e317
06 May, 2021 1 commit
- fix typo in command (#11605) · f594090a
  Vipul Raheja authored May 06, 2021
  
  f594090a
05 May, 2021 1 commit

Pytorch - Lazy initialization of models (#11471) · 3e3e41ae

Patrick von Platen authored May 05, 2021



* lazy_init_weights

* remove ipdb

* save int

* add necessary code

* remove unnecessary utils

* Update src/transformers/models/t5/modeling_t5.py

* clean

* add tests

* correct

* finish tests

* finish tests

* fix some more tests

* fix xlnet & transfo-xl

* fix more tests

* make sure tests are independent

* fix tests more

* finist tests

* final touches

* Update src/transformers/modeling_utils.py

* Apply suggestions from code review

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* clean tests

* give arg positive name

* add more mock weights to xlnet
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

3e3e41ae

04 May, 2021 2 commits

Reproducible checkpoint (#11582) · 6b241e0e

Sylvain Gugger authored May 04, 2021

* Set generator in dataloader

* Use generator in all random samplers

* Checkpoint all RNG states

* Final version

* Quality

* Test

* Address review comments

* Quality

* Remove debug util

* Add python and numpy RNGs

* Split states in different files in distributed

* Quality

* local_rank for TPUs

* Only use generator when accepted

* Add test

* Set seed to avoid flakiness

* Make test less flaky

* Quality

6b241e0e

[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) · 084a187d

Patrick von Platen authored May 04, 2021



* add flax roberta

* make style

* correct initialiazation

* modify model to save weights

* fix copied from

* fix copied from

* correct some more code

* add more roberta models

* Apply suggestions from code review

* merge from master

* finish

* finish docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

084a187d

03 May, 2021 1 commit
- Fix metric computation in `run_glue_no_trainer` (#11569) · 87dd1a00
  Sylvain Gugger authored May 03, 2021
  
  87dd1a00