Commits · 1e8140caad248ba7ea8797f0575f7931bd4b12a6 · chenpangpang / transformers

11 Jul, 2022 1 commit

Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) · 1e8140ca

Duong A. Nguyen authored Jul 11, 2022

* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints

1e8140ca

07 Jun, 2022 1 commit

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

04 Apr, 2022 1 commit
- Add use_auth to load_datasets for private datasets to PT and TF examples (#16521) · 24a85cca
  Karim Foda authored Apr 04, 2022
```
* fix formatting and remove use_auth

* Add use_auth_token to Flax examples
```
  24a85cca
23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

13 Jan, 2022 1 commit
- [examples/flax/language-modeling] set loglevel (#15129) · 762416ff
  Stas Bekman authored Jan 13, 2022
  
  762416ff
12 Dec, 2021 1 commit
- [Flax examples] remove dependancy on pytorch training args (#14636) · 6a025487
  Suraj Patil authored Dec 12, 2021
```
* use custom training arguments

* update tests
```
  6a025487
06 Dec, 2021 2 commits

[urls to hub] Replace outdated model tags with their now-canonical pipeline types (#14617) · 6cdc3a78
Julien Chaumond authored Dec 06, 2021
```
* Replace outdated model tags with their now-canonical pipeline types

* spam the CI till it's green
```
6cdc3a78

Add Flax example tests (#14599) · c5bd732a

Suraj Patil authored Dec 06, 2021

* add test for glue

* add tests for clm

* fix clm test

* add summrization tests

* more tests

* fix few tests

* add test for t5 mlm

* fix t5 mlm test

* fix tests for multi device

* cleanup

* ci job

* fix metric file name

* make t5 more robust

c5bd732a

22 Nov, 2021 1 commit

Switch from using sum for flattening lists of lists in group_texts (#14472) · 69e16abf

Nicholas Broad authored Nov 22, 2021



* remove sum for list flattening

* change to chain(*)

* make chain object a list

* delete empty lines

per sgugger's suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

69e16abf

30 Sep, 2021 1 commit

[examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b

Suraj Patil authored Sep 30, 2021

* use Repository for push_to_hub

* update readme

* update other flax scripts

* update readme

* update qa example

* fix push_to_hub call

* fix typo

* fix more typos

* update readme

* use abosolute path to get repo name

* fix glue script

7db2a79b

09 Aug, 2021 1 commit

[Flax] Refactor gpt2 & bert example docs (#13024) · 13a9c9a3

Patrick von Platen authored Aug 09, 2021



* fix_torch_device_generate_test

* remove @

* improve docs for clm

* speed-ups

* correct t5 example as well

* push final touches

* Update examples/flax/language-modeling/README.md

* correct docs for mlm

* Update examples/flax/language-modeling/README.md
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

13a9c9a3

20 Jul, 2021 1 commit

Flax MLM: Allow validation split when loading dataset from local file (#12689) · 66197adc

fgaim authored Jul 20, 2021

* Allow validation split when loading dataset from local file

* Flax clm & t5, enable validation split for datasets loaded from local file

66197adc

09 Jul, 2021 1 commit
- [Flax] Fix cur step flax examples (#12608) · deecdd49
  Patrick von Platen authored Jul 09, 2021
```
* fix_torch_device_generate_test

* remove @

* fix save problem
```
  deecdd49
08 Jul, 2021 1 commit
- Fix group_lengths for short datasets (#12558) · 6f1adc43
  Sylvain Gugger authored Jul 08, 2021
  
  6f1adc43
07 Jul, 2021 2 commits
- [Flax] Allow retraining from save checkpoint (#12559) · 7d321b76
  Patrick von Platen authored Jul 07, 2021
```
* fix_torch_device_generate_test

* remove @

* finish
```
  7d321b76
- [examples/flax] add adafactor optimizer (#12544) · 2d42915a
  Suraj Patil authored Jul 07, 2021
```
* add adafactor

* Update examples/flax/language-modeling/run_mlm_flax.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  2d42915a
06 Jul, 2021 1 commit

[Flax] Adapt examples to be able to use eval_steps and save_steps (#12543) · 208df208

Patrick von Platen authored Jul 06, 2021



* fix_torch_device_generate_test

* remove @

* up

* up

* correct

* upload
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

208df208

05 Jul, 2021 3 commits
- [Flax] Fix another bug in logging steps (#12516) · 4605b2b8
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* up
```
  4605b2b8
- [Flax] Correct logging steps flax (#12515) · d0f7508a
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* push
```
  d0f7508a
- [Flax] Correct flax training scripts (#12514) · bb4ac2b5
  Patrick von Platen authored Jul 05, 2021
```
* fix_torch_device_generate_test

* remove @

* add logging steps

* correct training scripts

* correct readme

* correct
```
  bb4ac2b5
29 Jun, 2021 1 commit
- [Flax] Example scripts - correct weight decay (#12409) · 81332868
  Patrick von Platen authored Jun 29, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* finish

* correct style
```
  81332868
28 Jun, 2021 2 commits

[Flax] Add T5 pretraining script (#12355) · 31c3e7e7

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* add length computatan

* finish masking

* finish

* upload

* fix some bugs

* finish

* fix dependency table

* correct tensorboard

* Apply suggestions from code review

* correct processing

* slight change init

* correct some more mistakes

* apply suggestions

* improve readme

* fix indent

* Apply suggestions from code review
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

* correct tokenizer

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>
Co-authored-by: SaulLu <55560583+SaulLu@users.noreply.github.com>

31c3e7e7

[Flax] Adapt flax examples to include `push_to_hub` (#12391) · 2d70c912

Patrick von Platen authored Jun 28, 2021



* fix_torch_device_generate_test

* remove @

* finish

* correct summary writer

* correct push to hub

* fix indent

* finish

* finish

* finish

* finish

* finish
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2d70c912

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
09 Jun, 2021 1 commit
- pass decay_mask fn to optimizer (#12087) · d1500d91
  Suraj Patil authored Jun 09, 2021
  
  d1500d91
03 Jun, 2021 1 commit

[Flax] Refactor MLM (#12013) · 242ec31a

Patrick von Platen authored Jun 03, 2021



* fix_torch_device_generate_test

* remove @

* finish refactor
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

242ec31a

24 May, 2021 1 commit
- [Flax] Fix PyTorch import error (#11839) · f5806041
  Patrick von Platen authored May 24, 2021
```
* fix_torch_device_generate_test

* remove @

* change pytorch import to flax import
```
  f5806041
19 May, 2021 1 commit

[Flax MLM] Refactor run mlm with optax (#11745) · 00440e35

Patrick von Platen authored May 19, 2021



* refactor

* update

* update

* update

* refactor run mlm

* finalize

* refactor more

* fix typo

* update

* finish refactor

* modify run mlm

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

* small fixes

* upload

* upload

* finish run mlm script
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

00440e35

04 May, 2021 1 commit

[FlaxRoberta] Add FlaxRobertaModels & adapt run_mlm_flax.py (#11470) · 084a187d

Patrick von Platen authored May 04, 2021



* add flax roberta

* make style

* correct initialiazation

* modify model to save weights

* fix copied from

* fix copied from

* correct some more code

* add more roberta models

* Apply suggestions from code review

* merge from master

* finish

* finish docs
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

084a187d

23 Apr, 2021 1 commit
- correct typo (#11393) · b48cf712
  Patrick von Platen authored Apr 23, 2021
  
  b48cf712
21 Apr, 2021 1 commit

Examples reorg (#11350) · dabeb152

Sylvain Gugger authored Apr 21, 2021



* Base move

* Examples reorganization

* Update references

* Put back test data

* Move conftest

* More fixes

* Move test data to test fixtures

* Update path

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments and clean
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

dabeb152

13 Apr, 2021 1 commit
- added cache_dir=model_args.cache_dir to all example with cache_dir arg (#11220) · 9fa29959
  Philipp Schmid authored Apr 13, 2021
  
  9fa29959
31 Mar, 2021 1 commit

Enforce string-formatting with f-strings (#10980) · acc3bd9d

Sylvain Gugger authored Mar 31, 2021



* First third

* Styling and fix mistake

* Quality

* All the rest

* Treat %s and %d

* typo

* Missing )

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

acc3bd9d

05 Feb, 2021 1 commit
- [examples] make run scripts executable (#10037) · 8ea412a8
  Stas Bekman authored Feb 05, 2021
```
* make executable

* make executable

* same for the template

* cleanup
```
  8ea412a8
16 Dec, 2020 1 commit

[Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054) · 640e6fe1

Patrick von Platen authored Dec 16, 2020



* save intermediate

* save intermediate

* save intermediate

* correct flax bert model file

* new module / model naming

* make style

* almost finish BERT

* finish roberta

* make fix-copies

* delete keys file

* last refactor

* fixes in run_mlm_flax.py

* remove pooled from run_mlm_flax.py`

* fix gelu | gelu_new

* remove Module from inits

* splits

* dirty print

* preventing warmup_steps == 0

* smaller splits

* make fix-copies

* dirty print

* dirty print

* initial_evaluation argument

* declaration order fix

* proper model initialization/loading

* proper initialization

* run_mlm_flax improvements: improper model inputs bugfix + automatic dataset splitting + tokenizers parallelism warning + avoiding warmup_steps=0 bug

* removed tokenizers warning hack, fixed model re-initialization

* reverted training_args.py changes

* fix flax from pretrained

* improve test in flax

* apply sylvains tips

* update init

* make 0.3.0 compatible

* revert tevens changes

* revert tevens changes 2

* finalize revert

* fix bug

* add docs

* add pretrained to init

* Update src/transformers/modeling_flax_utils.py

* fix copies

* final improvements
Co-authored-by: TevenLeScao <teven.lescao@gmail.com>

640e6fe1

15 Dec, 2020 1 commit

[Examples] Add automatic dataset splitting in language-modeling examples (#9133) · 2a7e8e16

Teven authored Dec 15, 2020

* replaced jnp.split + removing textual model inputs + ensuring warmup_steps > 0

* Add automatic dataset splitting in language-modeling examples

2a7e8e16

09 Dec, 2020 1 commit

Flax Masked Language Modeling training example (#8728) · 75627148

Funtowicz Morgan authored Dec 09, 2020



* Remove "Model" suffix from Flax models to look more :hugs:
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Initial working (forward + backward) for Flax MLM training example.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Simply code
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Addressing comments, using module and moving to LM task.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Restore parameter name "module" wrongly renamed model.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Restore correct output ordering...
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Actually commit the example 😅

Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Add FlaxBertModelForMaskedLM after rebasing.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make it possible to initialize the training from scratch
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Reuse flax linen example of cross entropy loss
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added specific data collator for flax
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Remove todo for data collator
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added evaluation step
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added ability to provide dtype to support bfloat16 on TPU
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable flax tensorboard output
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable jax.pmap support.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Ensure batches are correctly sized to be dispatched with jax.pmap
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable bfloat16 with --fp16 cmdline args
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Correctly export metrics to tensorboard
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added dropout and ability to use it.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Effectively enable & disable during training and evaluation steps.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Oops.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Enable specifying kernel initializer scale
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Style.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added warmup step to the learning rate scheduler.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix typo.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Print training loss
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make style
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* fix linter issue (flake8)
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix model matching
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix dummies
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Fix non default dtype on Flax models
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Use the same create_position_ids_from_input_ids for FlaxRoberta
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make Roberta attention as Bert
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* fix copy
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Wording.
Co-authored-by: Marc van Zee <marcvanzee@gmail.com>
Co-authored-by: Marc van Zee <marcvanzee@gmail.com>

75627148