Commits · c89a592e87e1fe67f8c7d675e8029ef997651e8e · chenpangpang / transformers

27 Jul, 2022 3 commits

Dev version · c89a592e
Lysandre authored Jul 27, 2022

c89a592e

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6

19 Jul, 2022 1 commit
- Remove use_auth_token from the from_config method (#18192) · 4bea6584
  Duong A. Nguyen authored Jul 19, 2022
```
* remove use_auth_token from from_config

* restore use_auth_token from_pretrained run_t5_mlm_flax
```
  4bea6584
11 Jul, 2022 1 commit

Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) · 1e8140ca

Duong A. Nguyen authored Jul 11, 2022

* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints

1e8140ca

16 Jun, 2022 1 commit
- v4.21.0.dev0 · 7c6ec195
  Sylvain Gugger authored Jun 16, 2022
  
  7c6ec195
07 Jun, 2022 1 commit

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

12 May, 2022 2 commits
- Black preview (#17217) · afe5d42d
  Sylvain Gugger authored May 12, 2022
```
* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black
```
  afe5d42d
- Dev version · 5294fa12
  Lysandre Debut authored May 12, 2022
  
  5294fa12
27 Apr, 2022 1 commit

Misc. fixes for Pytorch QA examples: (#16958) · c82e017a

Leonid Boytsov authored Apr 27, 2022

1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed).
2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion).
3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions.
4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.

c82e017a

19 Apr, 2022 1 commit

Correct Logging of Eval metric to Tensorboard (#16825) · b5c6a63e

Jeevesh Juneja authored Apr 19, 2022

* Correct Logging of Eval metric to Tensorboard

An empty dictionary ``eval_metrics`` was being logged, is replaced by ``eval_metric`` which is the output dictionary of ``metric.compute()``.

* Remove unused variable

b5c6a63e

11 Apr, 2022 2 commits

Fix example logs repeating themselves (#16669) · 69233cf0

Zachary Mueller authored Apr 11, 2022

Move declaration of log streams to before tests, so that results won't get compounded on top of each other

69233cf0

Fix t5 shard on TPU Pods (#16527) · 5e686757

Ahmed Elnaggar authored Apr 11, 2022



* Fix t5 shard on TPU Pods

The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host.
This pull request fixes this issue by dividing the global batch to each host before it is shared on each host.

* fix style
Co-authored-by: ahmed-elnaggar <ahmed.elnaggar@allianz.com>

5e686757

06 Apr, 2022 1 commit
- Dev version · a180efe7
  Lysandre Debut authored Apr 06, 2022
  
  a180efe7
04 Apr, 2022 1 commit
- Add use_auth to load_datasets for private datasets to PT and TF examples (#16521) · 24a85cca
  Karim Foda authored Apr 04, 2022
```
* fix formatting and remove use_auth

* Add use_auth_token to Flax examples
```
  24a85cca
30 Mar, 2022 1 commit
- [examples] max samples can't be bigger than the len of dataset (#16501) · a73281e3
  Stas Bekman authored Mar 30, 2022
```
* [examples] max samples can't be bigger than then len of dataset

* do tf and flax
```
  a73281e3
28 Mar, 2022 1 commit
- Update run_t5_mlm_flax.py (#16421) · 8049dfa4
  Yongrae Jo authored Mar 28, 2022
```
Fix typo in comment: proprocessed -> preprocessed
```
  8049dfa4
25 Mar, 2022 1 commit
- Rename master to main for notebooks links and leftovers (#16397) · 867f3950
  Sylvain Gugger authored Mar 25, 2022
  
  867f3950
23 Mar, 2022 2 commits

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

08 Mar, 2022 1 commit
- Speedup training by using numpy instead of jnp for batch shuffling (#15963) · 91fb62d0
  Yeb Havinga authored Mar 08, 2022
```
Speedup training by using numpy instead of jnp for batch shuffling
Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>
```
  91fb62d0
04 Mar, 2022 1 commit
- [FlaxT5 Example] fix flax t5 example pretraining (#15835) · 10b76987
  Patrick von Platen authored Mar 04, 2022
  
  10b76987
03 Mar, 2022 1 commit
- v4.18.0.dev.0 · 79d28e80
  Sylvain Gugger authored Mar 03, 2022
  
  79d28e80
23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

01 Feb, 2022 2 commits
- Harder check for IndexErrors in QA scripts (#15438) · d0b5ed11
  Sylvain Gugger authored Feb 01, 2022
```
* Harder check for IndexErrors in QA scripts

* Make test stronger
```
  d0b5ed11
- Update README.md (#15462) · d2749cf7
  Kamal Raj authored Feb 01, 2022
```
fix typo
```
  d2749cf7
31 Jan, 2022 1 commit

[examples/Flax] add a section about GPUs (#15198) · 87918d32

Suraj Patil authored Jan 31, 2022



* add a section about GPUs

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

87918d32

27 Jan, 2022 2 commits
- Docs for version v4.16.0 · eab33810
  Lysandre authored Jan 27, 2022
  
  eab33810
- Release: v4.16.0 · f87db5e4
  Lysandre authored Jan 27, 2022
  
  f87db5e4
19 Jan, 2022 1 commit

[FLAX] glue training example refactor (#13815) · d1f5ca1a

Kamal Raj authored Jan 19, 2022

* refactor run_flax_glue.py

* updated readme

* rm unused import and args typo fix

* refactor

* make consistent arg name across task

* has_tensorboard check

* argparse -> argument dataclasses

* refactor according to review

* fix

d1f5ca1a

13 Jan, 2022 1 commit
- [examples/flax/language-modeling] set loglevel (#15129) · 762416ff
  Stas Bekman authored Jan 13, 2022
  
  762416ff
06 Jan, 2022 1 commit

Add Flax image captioning example (#14864) · 9f89fa02

Yih-Dar authored Jan 06, 2022



* add image captioning example

* update README

* fix style & quality

* simplify

* apply review suggestions

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply review suggestions

* add comments about using np instead jax array

* remove unused lines

* add model creation script

* only support from_pretrained

* fix style

* fix

* not use cache_dir when creating model

* fix tokenizer creation

* update README

* fix quality

* apply suggestion

* simplify some blocks

* Update examples/flax/image-captioning/README.md


* Update examples/flax/image-captioning/run_image_captioning_flax.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* apply suggestion
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

9f89fa02

22 Dec, 2021 2 commits
- Docs for v4.16.0dev0 · fa39ff9f
  Patrick von Platen authored Dec 22, 2021
  
  fa39ff9f
- Release: v4.15.0 · 05fa1a7a
  Patrick von Platen authored Dec 22, 2021
  
  05fa1a7a
15 Dec, 2021 3 commits
- Docs for v4.14.0 · 7c9c41f4
  Lysandre authored Dec 15, 2021
  
  7c9c41f4
- Release: v4.14.0 · 960d8cb4
  Lysandre authored Dec 15, 2021
  
  960d8cb4
- Fix preprocess_function in run_summarization_flax.py (#14769) · a94105f9
  Yih-Dar authored Dec 15, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a94105f9
14 Dec, 2021 1 commit
- Make data shuffling in `run_clm_flax.py` respect global seed (#13410) · 2a606f99
  Benjamin Minixhofer authored Dec 14, 2021
```
* use jax and jnp instead of numpy in data_loader

* return batches as np.ndarray
```
  2a606f99
12 Dec, 2021 1 commit
- [Flax examples] remove dependancy on pytorch training args (#14636) · 6a025487
  Suraj Patil authored Dec 12, 2021
```
* use custom training arguments

* update tests
```
  6a025487
09 Dec, 2021 1 commit
- Docs for v4.14.0dev0 · ab31b3e4
  Lysandre authored Dec 09, 2021
  
  ab31b3e4