Commits · 0903fc80b5fd17ff5b9350fc497a3ed381800656 · chenpangpang / transformers

"tests/models/vit_mae/__init__.py" did not exist on "29c10a41d04f855c433a6cde7797b325651417d2"

10 Oct, 2022 2 commits
- Dev version · 10100979
  Lysandre authored Oct 10, 2022
  
  10100979
- Fix the error message in run_t5_mlm_flax.py (#19282) · e150c4e2
  Kaiyu Yang authored Oct 10, 2022
  
  e150c4e2
14 Sep, 2022 1 commit
- Dev version · 16913b3c
  Lysandre authored Sep 14, 2022
  
  16913b3c
09 Sep, 2022 1 commit
- [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361) · e6f221c8
  Sanchit Gandhi authored Sep 09, 2022
```
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*

* fix double tree_util
```
  e6f221c8
14 Aug, 2022 1 commit

Flax Remat for LongT5 (#17994) · d6eeb871

Karim Foda authored Aug 14, 2022



* [Flax] Add remat (gradient checkpointing)

* fix variable naming in test

* flip: checkpoint using a method

* fix naming

* fix class naming

* apply PVP's suggestions from code review

* add gradient_checkpointing to examples

* Add gradient_checkpointing to run_mlm_flax

* Add remat to longt5

* Add gradient checkpointing test longt5

* Fix args errors

* Fix remaining tests

* Make fixup & quality fixes

* replace kwargs

* remove unecessary kwargs

* Make fixup changes

* revert long_t5_flax changes

* Remove return_dict and copy to LongT5

* Remove test_gradient_checkpointing
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>

d6eeb871

06 Aug, 2022 1 commit

`transformers-cli login` => `huggingface-cli login` (#18490) · 9129fd03

Julien Chaumond authored Aug 06, 2022

* zero chance anyone's using that constant no?

* `transformers-cli login` => `huggingface-cli login`

* `transformers-cli repo create` => `huggingface-cli repo create`

* `make style`

9129fd03

01 Aug, 2022 2 commits

Add Flax BART pretraining script (#18297) · 3909d7f1

Duong A. Nguyen authored Aug 01, 2022



* add bart pretraining flax script

* fixup

* add bart pretraining flax script

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add bos eos document

* Update README.md

* Update README.md

* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* final

* final

* final

* remove use_auth_token ing from_config
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

3909d7f1

Fix ROUGE add example check and update README (#18398) · 941d2331
Sylvain Gugger authored Aug 01, 2022
```
* Fix ROUGE add example check and update README

* Stay consistent in values
```
941d2331

29 Jul, 2022 1 commit

Replace `as_target` context managers by direct calls (#18325) · 986526a0

Sylvain Gugger authored Jul 29, 2022



* Preliminary work on tokenizers

* Quality + fix tests

* Treat processors

* Fix pad

* Remove all uses of  in tests, docs and examples

* Replace all as_target_tokenizer

* Fix tests

* Fix quality

* Update examples/flax/image-captioning/run_image_captioning_flax.py
Co-authored-by: amyeroberts <amy@huggingface.co>

* Style
Co-authored-by: amyeroberts <amy@huggingface.co>

986526a0

28 Jul, 2022 1 commit

Migrate metrics used in flax examples to Evaluate (#18348) · da503ea0

Vijay S Kalmath authored Jul 28, 2022

Currently, tensorflow examples use the `load_metric` function from
Datasets library, commit migrates function call to `load` function
from Evaluate library.

da503ea0

27 Jul, 2022 3 commits

Dev version · c89a592e
Lysandre authored Jul 27, 2022

c89a592e

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6

19 Jul, 2022 1 commit
- Remove use_auth_token from the from_config method (#18192) · 4bea6584
  Duong A. Nguyen authored Jul 19, 2022
```
* remove use_auth_token from from_config

* restore use_auth_token from_pretrained run_t5_mlm_flax
```
  4bea6584
11 Jul, 2022 1 commit

Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) · 1e8140ca

Duong A. Nguyen authored Jul 11, 2022

* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints

1e8140ca

16 Jun, 2022 1 commit
- v4.21.0.dev0 · 7c6ec195
  Sylvain Gugger authored Jun 16, 2022
  
  7c6ec195
07 Jun, 2022 1 commit

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

12 May, 2022 2 commits
- Black preview (#17217) · afe5d42d
  Sylvain Gugger authored May 12, 2022
```
* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black
```
  afe5d42d
- Dev version · 5294fa12
  Lysandre Debut authored May 12, 2022
  
  5294fa12
27 Apr, 2022 1 commit

Misc. fixes for Pytorch QA examples: (#16958) · c82e017a

Leonid Boytsov authored Apr 27, 2022

1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.

c82e017a

19 Apr, 2022 1 commit

Correct Logging of Eval metric to Tensorboard (#16825) · b5c6a63e

Jeevesh Juneja authored Apr 19, 2022

* Correct Logging of Eval metric to Tensorboard

An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``.

* Remove unused variable

b5c6a63e

11 Apr, 2022 2 commits

Fix example logs repeating themselves (#16669) · 69233cf0

Zachary Mueller authored Apr 11, 2022

Move declaration of log streams to before tests, so that results won't get compounded on top of each other

69233cf0

Fix t5 shard on TPU Pods (#16527) · 5e686757

Ahmed Elnaggar authored Apr 11, 2022



* Fix t5 shard on TPU Pods

The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host.
This pull request fixes this issue by dividing the global batch to each host before it is shared on each host.

* fix style
Co-authored-by: ahmed-elnaggar <ahmed.elnaggar@allianz.com>

5e686757

06 Apr, 2022 1 commit
- Dev version · a180efe7
  Lysandre Debut authored Apr 06, 2022
  
  a180efe7
04 Apr, 2022 1 commit
- Add use_auth to load_datasets for private datasets to PT and TF examples (#16521) · 24a85cca
  Karim Foda authored Apr 04, 2022
```
* fix formatting and remove use_auth

* Add use_auth_token to Flax examples
```
  24a85cca
30 Mar, 2022 1 commit
- [examples] max samples can't be bigger than the len of dataset (#16501) · a73281e3
  Stas Bekman authored Mar 30, 2022
```
* [examples] max samples can't be bigger than then len of dataset

* do tf and flax
```
  a73281e3
28 Mar, 2022 1 commit
- Update run_t5_mlm_flax.py (#16421) · 8049dfa4
  Yongrae Jo authored Mar 28, 2022
```
Fix typo in comment: proprocessed -> preprocessed
```
  8049dfa4
25 Mar, 2022 1 commit
- Rename master to main for notebooks links and leftovers (#16397) · 867f3950
  Sylvain Gugger authored Mar 25, 2022
  
  867f3950
23 Mar, 2022 2 commits

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

08 Mar, 2022 1 commit
- Speedup training by using numpy instead of jnp for batch shuffling (#15963) · 91fb62d0
  Yeb Havinga authored Mar 08, 2022
```
Speedup training by using numpy instead of jnp for batch shuffling
Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>
```
  91fb62d0
04 Mar, 2022 1 commit
- [FlaxT5 Example] fix flax t5 example pretraining (#15835) · 10b76987
  Patrick von Platen authored Mar 04, 2022
  
  10b76987
03 Mar, 2022 1 commit
- v4.18.0.dev.0 · 79d28e80
  Sylvain Gugger authored Mar 03, 2022
  
  79d28e80
23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

01 Feb, 2022 2 commits
- Harder check for IndexErrors in QA scripts (#15438) · d0b5ed11
  Sylvain Gugger authored Feb 01, 2022
```
* Harder check for IndexErrors in QA scripts

* Make test stronger
```
  d0b5ed11
- Update README.md (#15462) · d2749cf7
  Kamal Raj authored Feb 01, 2022
```
fix typo
```
  d2749cf7
31 Jan, 2022 1 commit

[examples/Flax] add a section about GPUs (#15198) · 87918d32

Suraj Patil authored Jan 31, 2022



* add a section about GPUs

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

87918d32

27 Jan, 2022 2 commits
- Docs for version v4.16.0 · eab33810
  Lysandre authored Jan 27, 2022
  
  eab33810
- Release: v4.16.0 · f87db5e4
  Lysandre authored Jan 27, 2022
  
  f87db5e4
19 Jan, 2022 1 commit

[FLAX] glue training example refactor (#13815) · d1f5ca1a

Kamal Raj authored Jan 19, 2022

* refactor run_flax_glue.py

* updated readme

* rm unused import and args typo fix

* refactor

* make consistent arg name across task

* has_tensorboard check

* argparse -> argument dataclasses

* refactor according to review

* fix

d1f5ca1a