Commits · 40ea9ab2a1ad99f12e71c3a26215ad33df082ef9 · chenpangpang / transformers

12 Oct, 2023 1 commit
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
22 Sep, 2023 1 commit

feat: adding num_proc to load_dataset (#26326) · 910faa3e

Phuc Van Phan authored Sep 23, 2023

* feat: adding num_proc to load_dataset

* feat: add add_num_proc for run_mlm_flax

* feat: add num_proc for bart and t5

* chorse: remove

910faa3e

11 Sep, 2023 2 commits
- docs: add space to docs (#26067) · 5af2c626
  Phuc Van Phan authored Sep 12, 2023
```
* docs: add space to docs

* docs: remove reduntant space
```
  5af2c626
- docs: update link huggingface map (#26077) · 9cebae64
  Phuc Van Phan authored Sep 11, 2023
  
  9cebae64
02 Aug, 2023 1 commit

Add `token` arugment in example scripts (#25172) · 149cb0cc

Yih-Dar authored Aug 02, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

149cb0cc

28 Jul, 2023 2 commits

Update `use_auth_token` -> `token` in example scripts (#25167) · d53b8ad7

Yih-Dar authored Jul 28, 2023



* pytorch examples

* tensorflow examples

* flax examples

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d53b8ad7

Fix `.push_to_hub` and cleanup `get_full_repo_name` usage (#25120) · 6232c380

Lucain authored Jul 28, 2023

* Fix .push_to_hub and cleanup get_full_repo_name usage

* Do not rely on Python bool conversion magic

* request changes

6232c380

02 May, 2023 1 commit
- num_noise_spans should be <= num_items #22246 (#22938) · 805db1fe
  Alex Punnen authored May 02, 2023
  
  805db1fe
22 Feb, 2023 1 commit
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
06 Feb, 2023 1 commit

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

18 Jan, 2023 1 commit

Adapt repository creation to latest hf_hub (#21158) · 05e72aa0

Sylvain Gugger authored Jan 18, 2023

* Adapt repository creation to latest hf_hub

* Update all examples

* Fix other tests, add Flax examples

* Address review comments

05e72aa0

13 Oct, 2022 1 commit

[Re-submit] Compute true loss Flax examples (#19504) · 4212bb0d

Duong A. Nguyen authored Oct 13, 2022



* Compute true loss

* fixup

* final

* final

* final

* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* jax.tree_map => jax.tree_util.tree_map

* Compute true loss

* final

* fixup

* final

* final

* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* jax.tree_map => jax.tree_util.tree_map
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

4212bb0d

10 Oct, 2022 1 commit
- Fix the error message in run_t5_mlm_flax.py (#19282) · e150c4e2
  Kaiyu Yang authored Oct 10, 2022
  
  e150c4e2
09 Sep, 2022 1 commit
- [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* (#18361) · e6f221c8
  Sanchit Gandhi authored Sep 09, 2022
```
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_*

* fix double tree_util
```
  e6f221c8
06 Aug, 2022 1 commit

`transformers-cli login` => `huggingface-cli login` (#18490) · 9129fd03

Julien Chaumond authored Aug 06, 2022

* zero chance anyone's using that constant no?

* `transformers-cli login` => `huggingface-cli login`

* `transformers-cli repo create` => `huggingface-cli repo create`

* `make style`

9129fd03

01 Aug, 2022 1 commit

Add Flax BART pretraining script (#18297) · 3909d7f1

Duong A. Nguyen authored Aug 01, 2022



* add bart pretraining flax script

* fixup

* add bart pretraining flax script

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add BART to README

* add bos eos document

* Update README.md

* Update README.md

* Update examples/flax/language-modeling/run_bart_dlm_flax.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* final

* final

* final

* remove use_auth_token ing from_config
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

3909d7f1

27 Jul, 2022 2 commits

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored Jul 27, 2022

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored Jul 27, 2022
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6

19 Jul, 2022 1 commit
- Remove use_auth_token from the from_config method (#18192) · 4bea6584
  Duong A. Nguyen authored Jul 19, 2022
```
* remove use_auth_token from from_config

* restore use_auth_token from_pretrained run_t5_mlm_flax
```
  4bea6584
11 Jul, 2022 1 commit

Fix RESOURCE_EXHAUSTED error when dealing with large datasets in Flax example scripts (#18069) · 1e8140ca

Duong A. Nguyen authored Jul 11, 2022

* Fix RESOURCE_EXHAUSTED error for large datasets on Flax example scripts

* using np.permutation for creating batch_idx

* train_samples_idx -> training_samples_idx

* fix type hints

1e8140ca

07 Jun, 2022 1 commit

Add examples telemetry (#17552) · 3cab9027

Sylvain Gugger authored Jun 07, 2022

* Add examples telemetry

* Alternative approach

* Add to all other examples

* Add to templates as well

* Put framework separately

* Same for TensorFlow

3cab9027

12 May, 2022 1 commit

Black preview (#17217) · afe5d42d

Sylvain Gugger authored May 12, 2022

* Black preview

* Fixup too!

* Fix check copies

* Use the same version as the CI

* Bump black

afe5d42d

11 Apr, 2022 1 commit

Fix t5 shard on TPU Pods (#16527) · 5e686757

Ahmed Elnaggar authored Apr 11, 2022



* Fix t5 shard on TPU Pods

The current script doesn't work properly on a TPU pod because the global batch is not divided correctly per host.
This pull request fixes this issue by dividing the global batch to each host before it is shared on each host.

* fix style
Co-authored-by: ahmed-elnaggar <ahmed.elnaggar@allianz.com>

5e686757

04 Apr, 2022 1 commit
- Add use_auth to load_datasets for private datasets to PT and TF examples (#16521) · 24a85cca
  Karim Foda authored Apr 04, 2022
```
* fix formatting and remove use_auth

* Add use_auth_token to Flax examples
```
  24a85cca
28 Mar, 2022 1 commit
- Update run_t5_mlm_flax.py (#16421) · 8049dfa4
  Yongrae Jo authored Mar 28, 2022
```
Fix typo in comment: proprocessed -> preprocessed
```
  8049dfa4
23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

08 Mar, 2022 1 commit
- Speedup training by using numpy instead of jnp for batch shuffling (#15963) · 91fb62d0
  Yeb Havinga authored Mar 08, 2022
```
Speedup training by using numpy instead of jnp for batch shuffling
Co-authored-by: Yeb Havinga <y.t.havinga@mgrid.net>
```
  91fb62d0
04 Mar, 2022 1 commit
- [FlaxT5 Example] fix flax t5 example pretraining (#15835) · 10b76987
  Patrick von Platen authored Mar 04, 2022
  
  10b76987
13 Jan, 2022 1 commit
- [examples/flax/language-modeling] set loglevel (#15129) · 762416ff
  Stas Bekman authored Jan 13, 2022
  
  762416ff
12 Dec, 2021 1 commit
- [Flax examples] remove dependancy on pytorch training args (#14636) · 6a025487
  Suraj Patil authored Dec 12, 2021
```
* use custom training arguments

* update tests
```
  6a025487
06 Dec, 2021 1 commit

Add Flax example tests (#14599) · c5bd732a

Suraj Patil authored Dec 06, 2021

* add test for glue

* add tests for clm

* fix clm test

* add summrization tests

* more tests

* fix few tests

* add test for t5 mlm

* fix t5 mlm test

* fix tests for multi device

* cleanup

* ci job

* fix metric file name

* make t5 more robust

c5bd732a

29 Nov, 2021 1 commit
- Fix sentinel token IDs in data collator for Flax T5 pretraining script (#14477) · 8332327d
  Rahul Nadkarni authored Nov 29, 2021
  
  8332327d
22 Nov, 2021 1 commit

Switch from using sum for flattening lists of lists in group_texts (#14472) · 69e16abf

Nicholas Broad authored Nov 22, 2021



* remove sum for list flattening

* change to chain(*)

* make chain object a list

* delete empty lines

per sgugger's suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Nicholas Broad <nicholas@nmbroad.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

69e16abf

30 Sep, 2021 1 commit

[examples/flax] use Repository API for push_to_hub (#13672) · 7db2a79b

Suraj Patil authored Sep 30, 2021

* use Repository for push_to_hub

* update readme

* update other flax scripts

* update readme

* update qa example

* fix push_to_hub call

* fix typo

* fix more typos

* update readme

* use abosolute path to get repo name

* fix glue script

7db2a79b

06 Aug, 2021 1 commit

[Flax T5] Speed up t5 training (#13012) · 2e408236

Patrick von Platen authored Aug 06, 2021



* fix_torch_device_generate_test

* remove @

* update

* up

* fix

* remove f-stings

* correct readme

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e408236

20 Jul, 2021 1 commit

Flax MLM: Allow validation split when loading dataset from local file (#12689) · 66197adc

fgaim authored Jul 20, 2021

* Allow validation split when loading dataset from local file

* Flax clm & t5, enable validation split for datasets loaded from local file

66197adc

13 Jul, 2021 1 commit

Add ByT5 option to example run_t5_mlm_flax.py (#12634) · 5803a2a7

Nick Doiron authored Jul 13, 2021

* Allow ByT5 type in Flax T5 script

* use T5TokenizerFast

* change up tokenizer config

* model_args

* reorder imports

* Update run_t5_mlm_flax.py

5803a2a7

09 Jul, 2021 1 commit
- [Flax] Fix cur step flax examples (#12608) · deecdd49
  Patrick von Platen authored Jul 09, 2021
```
* fix_torch_device_generate_test

* remove @

* fix save problem
```
  deecdd49
08 Jul, 2021 1 commit
- Fix group_lengths for short datasets (#12558) · 6f1adc43
  Sylvain Gugger authored Jul 08, 2021
  
  6f1adc43
07 Jul, 2021 1 commit
- Remove logging of GPU count etc logging. (#12569) · 122d7dc3
  Ibraheem Moosa authored Jul 08, 2021
```
Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.
```
  122d7dc3