Commits · 2e4082364e4bd001f7933d81b3f75548704f79d7 · chenpangpang / transformers

06 Aug, 2021 1 commit

[Flax T5] Speed up t5 training (#13012) · 2e408236

Patrick von Platen authored Aug 06, 2021



* fix_torch_device_generate_test

* remove @

* update

* up

* fix

* remove f-stings

* correct readme

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e408236

30 Jul, 2021 1 commit
- examples: use correct way to get vocab size in flax lm readme (#12947) · 3d4b3bc3
  Stefan Schweter authored Jul 30, 2021
  
  3d4b3bc3
27 Jul, 2021 1 commit

[FLAX] Minor fixes in CLM example (#12914) · d3c3e722

Stefan Schweter authored Jul 27, 2021

* readme: fix retrieval of vocab size for flax clm example

* examples: fix flax clm example when using training/evaluation files

d3c3e722

20 Jul, 2021 2 commits
- Update README.md · 13fefdf3
  Patrick von Platen authored Jul 20, 2021
```
cc @patil-suraj
```
  13fefdf3
- Flax MLM: Allow validation split when loading dataset from local file (#12689) · 66197adc
  fgaim authored Jul 20, 2021
```
* Allow validation split when loading dataset from local file

* Flax clm & t5, enable validation split for datasets loaded from local file
```
  66197adc
14 Jul, 2021 1 commit
- Update README.md · f4399ec5
  Patrick von Platen authored Jul 14, 2021
  
  f4399ec5
13 Jul, 2021 1 commit

Add ByT5 option to example run_t5_mlm_flax.py (#12634) · 5803a2a7

Nick Doiron authored Jul 13, 2021

* Allow ByT5 type in Flax T5 script

* use T5TokenizerFast

* change up tokenizer config

* model_args

* reorder imports

* Update run_t5_mlm_flax.py

5803a2a7

09 Jul, 2021 1 commit
- [Flax] Fix cur step flax examples (#12608) · deecdd49
  Patrick von Platen authored Jul 09, 2021
```
* fix_torch_device_generate_test

* remove @

* fix save problem
```
  deecdd49
08 Jul, 2021 1 commit
- Fix group_lengths for short datasets (#12558) · 6f1adc43
  Sylvain Gugger authored Jul 08, 2021
  
  6f1adc43
07 Jul, 2021 3 commits
- Remove logging of GPU count etc logging. (#12569) · 122d7dc3
  Ibraheem Moosa authored Jul 08, 2021
```
Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.
```
  122d7dc3
- [Flax] Allow retraining from save checkpoint (#12559) · 7d321b76
  Patrick von Platen authored Jul 07, 2021
```
* fix_torch_device_generate_test

* remove @

* finish
```
  7d321b76
- [examples/flax] add adafactor optimizer (#12544) · 2d42915a
  Suraj Patil authored Jul 07, 2021
```
* add adafactor

* Update examples/flax/language-modeling/run_mlm_flax.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  2d42915a
06 Jul, 2021 1 commit

[Flax] Adapt examples to be able to use eval_steps and save_steps (#12543) · 208df208

Patrick von Platen authored Jul 06, 2021



* fix_torch_device_generate_test

* remove @

* up

* up

* correct

* upload
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

208df208

05 Jul, 2021 3 commits
- [Flax] Fix another bug in logging steps (#12516) · 4605b2b8
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* up
```
  4605b2b8
- [Flax] Correct logging steps flax (#12515) · d0f7508a
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* push
```
  d0f7508a
- [Flax] Correct flax training scripts (#12514) · bb4ac2b5
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* add logging steps

* correct training scripts

* correct readme

* correct
```
  bb4ac2b5
29 Jun, 2021 1 commit
- [Flax] Example scripts - correct weight decay (#12409) · 81332868
  Patrick von Platen authored Jun 29, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* finish

* correct style
```
  81332868
28 Jun, 2021 2 commits

[Flax] Add T5 pretraining script (#12355) · 31c3e7e7

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* add length computatan

* finish masking

* finish

* upload

* fix some bugs

* finish

* fix dependency table

* correct tensorboard

* Apply suggestions from code review

* correct processing

* slight change init

* correct some more mistakes

* apply suggestions

* improve readme

* fix indent

* Apply suggestions from code review
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* correct tokenizer

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

31c3e7e7

[Flax] Adapt flax examples to include `push_to_hub` (#12391) · 2d70c912

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* finish

* correct summary writer

* correct push to hub

* fix indent

* finish

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2d70c912

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
15 Jun, 2021 1 commit
- Use a released version of optax rather than installing from Git. (#12173) · 9b393240
  Avital Oliver authored Jun 15, 2021
```
Use a released version of optax rather than installing from Git
```
  9b393240
14 Jun, 2021 3 commits

[Flax] Add links to google colabs (#12146) · 7566fefa
Patrick von Platen authored Jun 14, 2021
```
* fix_torch_device_generate_test

* remove @

* add colab links
```
7566fefa

add readme for flax clm (#12111) · d36fce82

Suraj Patil authored Jun 14, 2021



* add readme for flax clm

* use section link for tokenizer

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update metrics
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d36fce82

Add mlm pretraining xla torch readme (#12011) · 16c0efca

Patrick von Platen authored Jun 14, 2021



* fix_torch_device_generate_test

* remove @

* upload

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* Update examples/flax/language-modeling/README.md

* add more info

* finish

* fix
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

16c0efca

11 Jun, 2021 1 commit

Flax CLM script (#12023) · 15b498f3

Suraj Patil authored Jun 11, 2021

* first draft

* max_seq_length => block_size

* fix arg names

* fix typos

* fix loss calculation

* add max examples, fix  train eval steps, metrics

* optimizer mask

* fix perpelexity, metric logging

* fix logging

* data_collator = > data_loader

* refactor loss_fn

* support single GPU

* pass distributed to write_metric

* fix jitting

* fix single device training

* fix single device metrics

* close inner progress bars once finished

* add overwrite_cache arg

* ifx dataset caching issue

* add more logs

* few small fixes,

* address nicholas suggestions

* fix docstr

* address patricks suggestions

* make flake happy

* pass new new_dropout_rng to apply_gradients

* reset train metrics after every epoc

* remove distributed logis, small fixes

15b498f3

09 Jun, 2021 1 commit
- pass decay_mask fn to optimizer (#12087) · d1500d91
  Suraj Patil authored Jun 09, 2021
  
  d1500d91
03 Jun, 2021 1 commit

[Flax] Refactor MLM (#12013) · 242ec31a

Patrick von Platen authored Jun 03, 2021



* fix_torch_device_generate_test

* remove @

* finish refactor
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

242ec31a

24 May, 2021 1 commit
- [Flax] Fix PyTorch import error (#11839) · f5806041
  Patrick von Platen authored May 24, 2021
```
* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import
```
  f5806041
19 May, 2021 1 commit

[Flax MLM] Refactor run mlm with optax (#11745) · 00440e35

Patrick von Platen authored May 19, 2021



* refactor

* update

* update

* update

* refactor run mlm

* finalize

* refactor more

* fix typo

* update

* finish refactor

* modify run mlm

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* small fixes

* upload

* upload

* finish run mlm script
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

00440e35

04 May, 2021 1 commit

[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) · 084a187d

Patrick von Platen authored May 04, 2021



* add flax roberta

* make style

* correct initialiazation

* modify model to save weights

* fix copied from

* fix copied from

* correct some more code

* add more roberta models

* Apply suggestions from code review

* merge from master

* finish

* finish docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

084a187d

23 Apr, 2021 1 commit
- correct typo (#11393) · b48cf712
  Patrick von Platen authored Apr 23, 2021
  
  b48cf712
21 Apr, 2021 1 commit

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152