Commits · ea5567502441135fc1cdab4abdf77fa710461ec3 · chenpangpang / transformers

05 Jul, 2021 1 commit

[Flax] ViT training example (#12300) · f1c81d6b

Suraj Patil authored Jul 05, 2021

* begin script

* clean example, add readme

* update readme

* remove decay mask

* remove masking

* update readme & make flake happy

f1c81d6b

29 Jun, 2021 2 commits

[Flax] Example scripts - correct weight decay (#12409) · 81332868
Patrick von Platen authored Jun 29, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* finish

* correct style
```
81332868

[example/flax] add summarization readme (#12393) · aecae533

Suraj Patil authored Jun 29, 2021



* add readme

* update readme and add requirements

* Update examples/flax/summarization/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

aecae533

28 Jun, 2021 2 commits

[Flax] Add T5 pretraining script (#12355) · 31c3e7e7

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* add length computatan

* finish masking

* finish

* upload

* fix some bugs

* finish

* fix dependency table

* correct tensorboard

* Apply suggestions from code review

* correct processing

* slight change init

* correct some more mistakes

* apply suggestions

* improve readme

* fix indent

* Apply suggestions from code review
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* correct tokenizer

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

31c3e7e7

[Flax] Adapt flax examples to include `push_to_hub` (#12391) · 2d70c912

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* finish

* correct summary writer

* correct push to hub

* fix indent

* finish

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2d70c912

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
24 Jun, 2021 1 commit
- [examples/Flax] move the examples table up (#12341) · aef3823e
  Suraj Patil authored Jun 24, 2021
  
  aef3823e
23 Jun, 2021 1 commit

Flax summarization script (#12230) · c0fe3c9a

Suraj Patil authored Jun 23, 2021

* add summrization script

* fix arguments, preprocessing, metrics

* add generation and metrics

* auto model, prediction loop

* prettify

* label smoothing

* adress Sylvain and Patricks suggestions

* dynamically import shift_tokens_right

* fix shift_tokens_right_fn call

c0fe3c9a

15 Jun, 2021 1 commit
- Use a released version of optax rather than installing from Git. (#12173) · 9b393240
  Avital Oliver authored Jun 15, 2021
```
Use a released version of optax rather than installing from Git
```
  9b393240
14 Jun, 2021 3 commits

[Flax] Add links to google colabs (#12146) · 7566fefa
Patrick von Platen authored Jun 14, 2021
```
* fix_torch_device_generate_test

* remove @

* add colab links
```
7566fefa

add readme for flax clm (#12111) · d36fce82

Suraj Patil authored Jun 14, 2021



* add readme for flax clm

* use section link for tokenizer

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update metrics
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d36fce82

Add mlm pretraining xla torch readme (#12011) · 16c0efca

Patrick von Platen authored Jun 14, 2021



* fix_torch_device_generate_test

* remove @

* upload

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Update examples/flax/language-modeling/README.md

* add more info

* finish

* fix
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

16c0efca

11 Jun, 2021 1 commit

Flax CLM script (#12023) · 15b498f3

Suraj Patil authored Jun 11, 2021

* first draft

* max_seq_length => block_size

* fix arg names

* fix typos

* fix loss calculation

* add max examples, fix  train eval steps, metrics

* optimizer mask

* fix perpelexity, metric logging

* fix logging

* data_collator = > data_loader

* refactor loss_fn

* support single GPU

* pass distributed to write_metric

* fix jitting

* fix single device training

* fix single device metrics

* close inner progress bars once finished

* add overwrite_cache arg

* ifx dataset caching issue

* add more logs

* few small fixes,

* address nicholas suggestions

* fix docstr

* address patricks suggestions

* make flake happy

* pass new new_dropout_rng to apply_gradients

* reset train metrics after every epoc

* remove distributed logis, small fixes

15b498f3

09 Jun, 2021 1 commit
- pass decay_mask fn to optimizer (#12087) · d1500d91
  Suraj Patil authored Jun 09, 2021
  
  d1500d91
03 Jun, 2021 2 commits

[Flax] Refactor MLM (#12013) · 242ec31a

Patrick von Platen authored Jun 03, 2021



* fix_torch_device_generate_test

* remove @

* finish refactor
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

242ec31a

Fix weight decay masking in `run_flax_glue.py` (#11964) · 4674061b

Nicholas Vadivelu authored Jun 03, 2021



* Fix weight decay masking in `run_flax_glue.py`

Issues with the previous implementation:
- The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods.
- `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped.
- Flax's LayerNorm calls the scale parameter `scale` not `weight`

* Fix formatting with black

* adapt results
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4674061b

31 May, 2021 1 commit

Remove redundant `nn.log_softmax` in `run_flax_glue.py` (#11920) · 1ab147d6

Nicholas Vadivelu authored May 31, 2021

* Remove redundant `nn.log_softmax` in `run_flax_glue.py`

`optax.softmax_cross_entropy` expects unnormalized logits, and so it already calls `nn.log_softmax`, so I believe it is not needed here. `nn.log_softmax` is idempotent so mathematically it shouldn't have made a difference.

* Remove unused 'flax.linen' import

1ab147d6

26 May, 2021 1 commit
- Link official Cloud TPU JAX docs (#11892) · 2df54691
  Avital Oliver authored May 26, 2021
  
  2df54691
24 May, 2021 1 commit
- [Flax] Fix PyTorch import error (#11839) · f5806041
  Patrick von Platen authored May 24, 2021
```
* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import
```
  f5806041
21 May, 2021 3 commits

Add flax text class colab (#11824) · da22245e
Patrick von Platen authored May 21, 2021
```
* fix_torch_device_generate_test

* remove @

* add flax glue link
```
da22245e

[Flax] Small fixes in `run_flax_glue.py` (#11820) · 82335185

Patrick von Platen authored May 21, 2021



* fix_torch_device_generate_test

* remove @

* correct best seed for flax fine-tuning
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

82335185

[Flax] Align GLUE training script with mlm training script (#11778) · bd987165

Patrick von Platen authored May 21, 2021



* speed up flax glue

* remove unnecessary line

* remove folder

* remove run in loop
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

bd987165

19 May, 2021 1 commit

[Flax MLM] Refactor run mlm with optax (#11745) · 00440e35

Patrick von Platen authored May 19, 2021



* refactor

* update

* update

* update

* refactor run mlm

* finalize

* refactor more

* fix typo

* update

* finish refactor

* modify run mlm

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* small fixes

* upload

* upload

* finish run mlm script
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

00440e35

18 May, 2021 1 commit

Add Flax Examples and Cloud TPU README (#11753) · 77f9bd18

Avital Oliver authored May 18, 2021



* Add Flax Examples README

* Apply suggestions from code review

* Update examples/flax/README.md

* add nice table

* fix

* fix

* apply suggestions

* upload

* finish flax readme.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

77f9bd18

17 May, 2021 1 commit

Improvements to Flax finetuning script (#11727) · 726e953d

Marc van Zee authored May 17, 2021

* Add Cloud details to README

* Flax script and readme updates

* Some simplifications of Flax script

726e953d

14 May, 2021 2 commits
- Add Cloud details to README (#11706) · 94a23487
  Marc van Zee authored May 14, 2021
```
* Add Cloud details to README

* Flax script and readme updates
```
  94a23487
- correct example script (#11726) · 113eaa75
  Patrick von Platen authored May 14, 2021
  
  113eaa75
12 May, 2021 1 commit
- Updates README and fixes bug (#11701) · 6797cdc0
  Marc van Zee authored May 12, 2021
  
  6797cdc0
11 May, 2021 1 commit

Adds Flax BERT finetuning example on GLUE (#11564) · 4ce6bcc3

Marc van Zee authored May 11, 2021



* Adds Flax BERT finetuning example

* fix traced jax tensor type

* Use Optax losses and learning schedulers

* Add 1GPU training results

* merge into master & make style

* fix input

* del file

* Fix bug in loss and add torch runs

* finish bert flax fine-tune

* Update examples/flax/text-classification/README.md

* Update examples/flax/text-classification/run_flax_glue.py

* add requirements

* finalize

* finalize
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4ce6bcc3

04 May, 2021 1 commit

[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) · 084a187d

Patrick von Platen authored May 04, 2021



* add flax roberta

* make style

* correct initialiazation

* modify model to save weights

* fix copied from

* fix copied from

* correct some more code

* add more roberta models

* Apply suggestions from code review

* merge from master

* finish

* finish docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

084a187d

23 Apr, 2021 1 commit
- correct typo (#11393) · b48cf712
  Patrick von Platen authored Apr 23, 2021
  
  b48cf712
21 Apr, 2021 1 commit

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152