Commits · e19c12e094678dd0b355c1bdd529d35abcb7b34c · chenpangpang / transformers

30 Jan, 2024 1 commit

Add tf_keras imports to prepare for Keras 3 (#28588) · 415e9a09

Matt authored Jan 30, 2024

* Port core files + ESM (because ESM code is odd)

* Search-replace in modelling code

* Fix up transfo_xl as well

* Fix other core files + tests (still need to add correct import to tests)

* Fix cookiecutter

* make fixup, fix imports in some more core files

* Auto-add imports to tests

* Cleanup, add imports to sagemaker tests

* Use correct exception for importing tf_keras

* Fixes in modeling_tf_utils

* make fixup

* Correct version parsing code

* Ensure the pipeline tests correctly revert to float32 after each test

* Ensure the pipeline tests correctly revert to float32 after each test

* More tf.keras -> keras

* Add dtype cast

* Better imports of tf_keras

* Add a cast for tf.assign, just in case

* Fix callback imports

415e9a09

29 Jan, 2024 1 commit
- Fix input data file extension in examples (#28741) · 39fa4009
  Klaus Hipp authored Jan 29, 2024
  
  39fa4009
19 Jan, 2024 1 commit
- v4.38.dev.0 · b2748a6e
  Amy Roberts authored Jan 19, 2024
  
  b2748a6e
12 Jan, 2024 1 commit
- TF: purge `TFTrainer` (#28483) · 4fb3d3a0
  Joao Gante authored Jan 12, 2024
  
  4fb3d3a0
11 Jan, 2024 1 commit

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

13 Dec, 2023 1 commit
- Dev version · 3ed3e319
  Lysandre authored Dec 13, 2023
  
  3ed3e319
17 Nov, 2023 1 commit
- Broken links fixed related to datasets docs (#27569) · ffbcfc01
  V.Prasanna kumar authored Nov 18, 2023
```
fixed the broken links belogs to dataset library of transformers
```
  ffbcfc01
16 Nov, 2023 1 commit
- Update the TF pin for 2.15 (#27375) · 4989e73e
  Matt authored Nov 16, 2023
```
* Move the TF pin for 2.15

* make fixup
```
  4989e73e
15 Nov, 2023 1 commit

Incorrect setting for num_beams in translation and summarization examples (#27519) · 2e72bbab

Matt authored Nov 15, 2023



* Remove the torch main_process_first context manager from TF examples

* Correctly set num_beams=1 in our examples, and add a guard in GenerationConfig.validate()

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2e72bbab

02 Nov, 2023 1 commit
- Dev version · bc78fd12
  Lysandre authored Nov 02, 2023
  
  bc78fd12
27 Oct, 2023 1 commit
- Provide alternative when warning on use_auth_token (#27105) · 66b088fa
  Lucain authored Oct 27, 2023
  
  66b088fa
19 Oct, 2023 1 commit

Pin Keras for now (#26904) · cbd278f0

Matt authored Oct 19, 2023

* Pin Keras for now out of paranoia

* Add the keras pin to _tests_requirements.txt too

* Make sure the Keras version matches the TF one

* make fixup

cbd278f0

12 Oct, 2023 1 commit
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
04 Oct, 2023 1 commit

refactor: change default block_size (#26229) · 6015f91a

Phuc Van Phan authored Oct 04, 2023

* refactor: change default block_size

* fix: return tf to origin

* fix: change files to origin

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* refactor: add min block_size to files

* reformat: add min block_size for run_clm tf

6015f91a

03 Oct, 2023 1 commit
- v4.35.0.dev0 · bd620591
  Lysandre authored Oct 03, 2023
  
  bd620591
18 Sep, 2023 1 commit
- Update README.md (#26198) · 7d4e0c23
  Nino Risteski authored Sep 19, 2023
```
Fixed a few typos
```
  7d4e0c23
11 Sep, 2023 2 commits
- docs: add space to docs (#26067) · 5af2c626
  Phuc Van Phan authored Sep 12, 2023
```
* docs: add space to docs

* docs: remove reduntant space
```
  5af2c626
- docs: update link huggingface map (#26077) · 9cebae64
  Phuc Van Phan authored Sep 11, 2023
  
  9cebae64
04 Sep, 2023 1 commit
- v4.34.dev.0 · d8e13b3e
  Lysandre authored Sep 04, 2023
  
  d8e13b3e
01 Sep, 2023 1 commit
- Revert frozen training arguments (#25903) · be0e189b
  Zach Mueller authored Sep 01, 2023
```
* Revert frozen training arguments

* TODO
```
  be0e189b
22 Aug, 2023 1 commit

TF 2.14 compatibility (#25630) · 62396cff

Matt authored Aug 22, 2023

* Update the TF pin and see if anything breaks

* make fixup

* make fixup

* make fixup

62396cff

21 Aug, 2023 1 commit
- v4.33.0.dev0 · 5c67682b
  Sylvain Gugger authored Aug 21, 2023
  
  5c67682b
15 Aug, 2023 1 commit

Make training args fully immutable (#25435) · ca514992

Zach Mueller authored Aug 15, 2023

* Make training args fully immutable

* Working tests, PyTorch

* In test_trainer

* during testing

* Use proper dataclass way

* Fix test

* Another one

* Fix tf

* Lingering slow

* Exception

* Clean

ca514992

08 Aug, 2023 1 commit

Fix missing usage of `token` (#25382) · 9c7b7447

Yih-Dar authored Aug 08, 2023



* add missing tokens

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9c7b7447

07 Aug, 2023 1 commit

Allow `trust_remote_code` in example scripts (#25248) · 14510938

Jackmin801 authored Aug 07, 2023

* pytorch examples

* pytorch mim no trainer

* cookiecutter

* flax examples

* missed line in pytorch run_glue

* tensorflow examples

* tensorflow run_clip

* tensorflow run_mlm

* tensorflow run_ner

* tensorflow run_clm

* pytorch example from_configs

* pytorch no trainer examples

* Revert "tensorflow run_clip"

This reverts commit 261f86ac1f1c9e05dd3fd0291e1a1f8e573781d5.

* fix: duplicated argument

14510938

02 Aug, 2023 1 commit

Add `token` arugment in example scripts (#25172) · 149cb0cc

Yih-Dar authored Aug 02, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

149cb0cc

28 Jul, 2023 1 commit

Update `use_auth_token` -> `token` in example scripts (#25167) · d53b8ad7

Yih-Dar authored Jul 28, 2023



* pytorch examples

* tensorflow examples

* flax examples

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d53b8ad7

17 Jul, 2023 1 commit
- 4.32.0.dev0 · e9ad5130
  Sylvain Gugger authored Jul 17, 2023
  
  e9ad5130
03 Jul, 2023 1 commit

Fix loading dataset docs link in run_translation.py example (#24594) · 4b26a616

Gema Parreño authored Jul 03, 2023



* fix loading dataset link

* Update examples/tensorflow/translation/run_translation.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Update examples/tensorflow/translation/run_translation.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

4b26a616

23 Jun, 2023 1 commit

Improved keras imports (#24448) · 8e164c54

Matt authored Jun 23, 2023

* An end to accursed version-specific imports

* No more K.is_keras_tensor() either

* Update dependency tables

* Use a cleaner call context function getter

* Add a cap to <2.14

* Add cap to examples requirements too

8e164c54

07 Jun, 2023 1 commit
- v4.31.0.dev0 · ba695c1e
  Sylvain Gugger authored Jun 07, 2023
  
  ba695c1e
02 Jun, 2023 1 commit

Add an option to reduce compile() console spam (#23938) · 167a0d8f

Matt authored Jun 02, 2023

* Add an option to reduce compile() console spam

* Add annotations to the example scripts

* Add notes to the quicktour docs as well

* minor fix

167a0d8f

26 May, 2023 1 commit
- Fix no such file or directory error (#23783) · e7242469
  Ran Ran authored May 26, 2023
```
* Fix no such file or directory error

* Address comment

* Fix formatting issue
```
  e7242469
09 May, 2023 2 commits

v4.30.0.dev0 · a0c0a782
Sylvain Gugger authored May 09, 2023

a0c0a782

Proposed fix for TF example now running on safetensors. (#23208) · c34a525d

Nicolas Patry authored May 09, 2023



* Proposed fix for TF example now running on safetensors.

* Adding more warnings and returning keys.

* Trigger CI

* Trigger CI

---------
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

c34a525d

08 May, 2023 1 commit
- Skip failing test · fd6970bc
  Sylvain Gugger authored May 08, 2023
  
  fd6970bc
20 Apr, 2023 1 commit
- [Examples/TensorFlow] minor refactoring to allow compatible datasets to work (#22879) · 4116d1ec
  Sayak Paul authored Apr 20, 2023
```
minor refactoring to allow compatible datasets to work.
```
  4116d1ec
17 Apr, 2023 2 commits
- Remove accelerate from tf test reqs (#22777) · cd3e0211
  Zachary Mueller authored Apr 17, 2023
```
Remove accelerate from tf
```
  cd3e0211
- Fix sneaky torch dependency in TF example (#22804) · 2237127a
  Matt authored Apr 17, 2023
  
  2237127a
14 Apr, 2023 1 commit

[Examples] TPU-based training of a language model using TensorFlow (#21657) · 390e121f

Sayak Paul authored Apr 14, 2023



* add: tokenizer training script for TF TPU LM training.

* add: script for preparing the TFRecord shards.

* add: sequence of execution to readme.

* remove limit from the tfrecord shard name.

* Add initial train_model.py

* Add basic training arguments and model init

* Get up to the point of writing the data collator

* Pushing progress so far!

* Complete first draft of model training code

* feat: grouping of texts efficiently.
Co-authored-by: Matt <rocketknight1@gmail.com>

* Add proper masking collator and get training loop working

* fix: things.

* Read sample counts from filenames

* Read sample counts from filenames

* Draft README

* Improve TPU warning

* Use distribute instead of distribute.experimental

* Apply suggestions from code review
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Modularize loading and add MLM probability as arg

* minor refactoring to better use the cli args.

* readme fillup.

* include tpu and inference sections in the readme.

* table of contents.

* parallelize maps.

* polish readme.

* change script name to run_mlm.py

* address PR feedback (round I).

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

390e121f