Commits · fd6970bc56afa5a011ab9b839f73fae1c9e72625 · chenpangpang / transformers

08 May, 2023 1 commit
- Skip failing test · fd6970bc
  Sylvain Gugger authored May 08, 2023
  
  fd6970bc
05 May, 2023 1 commit

Add `no_trainer` scripts to pre-train Vision Transformers (#23156) · fc6c8b0e

Ashwin Mathur authored May 05, 2023



* Add run_mim_no_trainer.py draft from #20412

Add parse_args method and copy over other dependencies

Add Method call for sending telemetry

Initialize Accelerator

Make one log on every process

Set seed and Handle repository creation

Initialize dataset and Set validation split

Create Config

Adapt Config

Update Config

Create Feature Extractor

Create model

Set column names

Create transforms

Create mask generator

Create method to preprocess images

Shuffle datasets if needed and set transforms

Create Dataloaders

Add optimizer

Add learning rate scheduler

Prepare everything with our accelerator

Tie weights for TPU training

Recalculate training steps and training epochs

Set accelerator checkpointing steps

Initialize trackers and store configuration

Set total batch size

Fix typo: mlm -> mim

Log info at the start of training

Load in the weights and states from previous save

update the progress_bar if load from checkpoint

Define train loop

Add evaluation loop to training

Add to parse_args method

Push repo to hub

Save accelerator state

End training and save model and feature extractor

Remove unused imports

Fix trailing whitespace

* Update code based on comments, Rename feature_extractor to image_processor

* Fix linting

* Add argument for learning rate

* Add argument for setting number of training epochs

* Remove incorrect logger argument

* Convert max_train_steps to int for tqdm

---------
Co-authored-by: Saad Mahmud <shuvro.mahmud79@gmail.com>

fc6c8b0e

03 May, 2023 1 commit
- Tidy Pytorch GLUE benchmark example (#23134) · b6933d76
  Robert Stone authored May 03, 2023
```
Migration to Evaluate for metric is not quite complete
```
  b6933d76
02 May, 2023 3 commits

num_noise_spans should be <= num_items #22246 (#22938) · 805db1fe
Alex Punnen authored May 02, 2023

805db1fe

Save the tokenizer and image preprocessor after training a model with the... · bcedd0a4

regisss authored May 02, 2023

Save the tokenizer and image preprocessor after training a model with the contrastive image-text example (#23035)

Save tokenizer and image preprocessor

bcedd0a4

Bump flask from 2.0.3 to 2.3.2 in /examples/research_projects/decision_transformer (#23094) · b8648290

dependabot[bot] authored May 01, 2023

Bump flask in /examples/research_projects/decision_transformer

Bumps [flask](https://github.com/pallets/flask) from 2.0.3 to 2.3.2.
- [Release notes](https://github.com/pallets/flask/releases)
- [Changelog](https://github.com/pallets/flask/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/flask/compare/2.0.3...2.3.2

)

---
updated-dependencies:
- dependency-name: flask
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

b8648290

25 Apr, 2023 1 commit
- Avoid invalid escape sequences, use raw strings (#22936) · 54272503
  Lingepumpe authored Apr 25, 2023
```
* Avoid invalid escape sequences, use raw strings

* Integrate PR feedback
```
  54272503
21 Apr, 2023 1 commit
- Remove broken test_data symlink in legacy s2s examples (#22876) · 874c7caf
  Roy Hvaara authored Apr 21, 2023
  
  874c7caf
20 Apr, 2023 1 commit
- [Examples/TensorFlow] minor refactoring to allow compatible datasets to work (#22879) · 4116d1ec
  Sayak Paul authored Apr 20, 2023
```
minor refactoring to allow compatible datasets to work.
```
  4116d1ec
17 Apr, 2023 2 commits
- Remove accelerate from tf test reqs (#22777) · cd3e0211
  Zachary Mueller authored Apr 17, 2023
```
Remove accelerate from tf
```
  cd3e0211
- Fix sneaky torch dependency in TF example (#22804) · 2237127a
  Matt authored Apr 17, 2023
  
  2237127a
14 Apr, 2023 1 commit

[Examples] TPU-based training of a language model using TensorFlow (#21657) · 390e121f

Sayak Paul authored Apr 14, 2023



* add: tokenizer training script for TF TPU LM training.

* add: script for preparing the TFRecord shards.

* add: sequence of execution to readme.

* remove limit from the tfrecord shard name.

* Add initial train_model.py

* Add basic training arguments and model init

* Get up to the point of writing the data collator

* Pushing progress so far!

* Complete first draft of model training code

* feat: grouping of texts efficiently.
Co-authored-by: Matt <rocketknight1@gmail.com>

* Add proper masking collator and get training loop working

* fix: things.

* Read sample counts from filenames

* Read sample counts from filenames

* Draft README

* Improve TPU warning

* Use distribute instead of distribute.experimental

* Apply suggestions from code review
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Modularize loading and add MLM probability as arg

* minor refactoring to better use the cli args.

* readme fillup.

* include tpu and inference sections in the readme.

* table of contents.

* parallelize maps.

* polish readme.

* change script name to run_mlm.py

* address PR feedback (round I).

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

390e121f

13 Apr, 2023 1 commit
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored Apr 12, 2023
  
  888c4a2a
11 Apr, 2023 1 commit
- Replace -100s in predictions by the pad token (#22693) · 1b1867d8
  Sylvain Gugger authored Apr 11, 2023
```
* Replace -100s in predictions by the pad token

* Style

* Try to catch them all
```
  1b1867d8
05 Apr, 2023 1 commit

Sync preprocesses before loading the processor at run_speech_recognition_ctc.py (#21926) · d5239bab

Mikel Penagarikano authored Apr 05, 2023

* Update run_speech_recognition_ctc.py

Make sure all processes wait until data is saved before loading the processor from the output_dit

* Make sure all processes wait until data is saved before loading the processor from the output_dit

* Update run_speech_recognition_ctc.py

* Update run_speech_recognition_seq2seq.py

d5239bab

04 Apr, 2023 1 commit
- Add id2label and label2id to model's config in run_xnil (#22558) · 98268b2e
  Maziyar Panahi authored Apr 04, 2023
```
Add id2label and label2id to config in run_xnil
```
  98268b2e
31 Mar, 2023 1 commit

Bump redis from 4.5.3 to 4.5.4 in /examples/research_projects/decision_transformer (#22494) · 6fc44656

dependabot[bot] authored Mar 31, 2023

Bump redis in /examples/research_projects/decision_transformer

Bumps [redis](https://github.com/redis/redis-py) from 4.5.3 to 4.5.4.
- [Release notes](https://github.com/redis/redis-py/releases)
- [Changelog](https://github.com/redis/redis-py/blob/master/CHANGES)
- [Commits](https://github.com/redis/redis-py/compare/v4.5.3...v4.5.4

)

---
updated-dependencies:
- dependency-name: redis
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

6fc44656

29 Mar, 2023 1 commit
- Update Neptune docs (#22452) · 173193cc
  Sabine authored Mar 29, 2023
  
  173193cc
28 Mar, 2023 1 commit

Bump redis from 4.1.4 to 4.5.3 in /examples/research_projects/decision_transformer (#22410) · 32ff0640

dependabot[bot] authored Mar 27, 2023

Bump redis in /examples/research_projects/decision_transformer

Bumps [redis](https://github.com/redis/redis-py) from 4.1.4 to 4.5.3.
- [Release notes](https://github.com/redis/redis-py/releases)
- [Changelog](https://github.com/redis/redis-py/blob/master/CHANGES)
- [Commits](https://github.com/redis/redis-py/compare/v4.1.4...v4.5.3

)

---
updated-dependencies:
- dependency-name: redis
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

32ff0640

27 Mar, 2023 2 commits

Fix quality · 057e1d74
Sylvain Gugger authored Mar 27, 2023

057e1d74

Hardware Auto-Setup for Examples (#22319) · f02e3a2b

Donny Greenberg authored Mar 27, 2023

* Add initial remote hardware auto-setup docs

* Fix a few typos and clarify some language

* Add missing dependency

* Update self-hosted launch script with Sylvain's comments.

* Formatting.

* Trigger CI

* Style

f02e3a2b

24 Mar, 2023 2 commits
- TensorFlow: pin maximum version to 2.12 (#22364) · 88dae78f
  Joao Gante authored Mar 24, 2023
  
  88dae78f
- Pin tensorflow-text to go with tensorflow (#22362) · 6587125c
  Sylvain Gugger authored Mar 24, 2023
```
* Pin tensorflow-text to go with tensorflow

* Make it more convenient to pin TensorFlow

* setup don't like f-strings
```
  6587125c
23 Mar, 2023 1 commit
- Fix quality due to ruff release · ef28df05
  Sylvain authored Mar 22, 2023
  
  ef28df05
22 Mar, 2023 3 commits

fix: Allow only test_file in pytorch and flax summarization (#22293) · 8e6c34b3
Connor Henderson authored Mar 22, 2023
```
allow only test_file in pytorch and flax summarization
```
8e6c34b3

add low_cpu_mem_usage option in run_clm.py example which will benefit… (#22288) · 4ccaf268

Wang, Yi authored Mar 22, 2023



* add low_cpu_mem_usage option in run_clm.py example which will benefit LLM loading
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* update all the example and README under language-modeling
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

4ccaf268

Enable traced model for text-generation task (#22265) · 8472a224
jiqing-feng authored Mar 22, 2023

8472a224

14 Mar, 2023 1 commit
- v4.28.0.dev0 · ebdb185b
  Sylvain Gugger authored Mar 14, 2023
  
  ebdb185b
08 Mar, 2023 1 commit

[examples/speech-recognition] Add SpecAugment to run_speech_recognition_seq2seq.py (#21942) · 6192549c

bofeng huang authored Mar 08, 2023



* Add specaugment to run_speech_recognition_seq2seq.py

* Remove useless argument: text_column

* Fix quality

* Update return_attention_mask condition

* Update specaugment arguments only for whisper models

* Remove SpecAugment arguments from ModelArguments, only leave default values for simplicity

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update apply_spec_augment only for whisper models

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Rename return_attention_mask to forward_attention_mask to avoid confusion with wav2vec2 models

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

6192549c

07 Mar, 2023 1 commit
- Stop requiring Torch for our TF examples! (#21997) · d128f2ff
  Matt authored Mar 07, 2023
```
* Stop requiring Torch for our TF examples!

* Slight tweak to logging in the example itself
```
  d128f2ff
06 Mar, 2023 1 commit

Add TF contrastive image text finetuning example (#21939) · 5d8efc79

Matt authored Mar 06, 2023

* Initial commit

* stash commit

* Add model checkpointing and pushing

* Fix model name inference

* Update README

* Update README

* Remove a couple of Torch references

* Update copyright date

* make fixup

* Update PushToHubCallback args!

* Remove the torch summary

* Add strategy.scope

5d8efc79

01 Mar, 2023 1 commit
- Add check for different embedding types in examples (#21881) · 1d3a1cc4
  Matt authored Mar 01, 2023
```
* Add check for different embedding types in examples

* Correctly update summarization example
```
  1d3a1cc4
27 Feb, 2023 1 commit

[examples/summarization] deal with `max_length` and `num_beams` (#21740) · 3c0ce608

bofeng huang authored Feb 27, 2023

* Override the decoding parameters of Seq2SeqTrainer

* Fix quality

* Fix max_length parameter

* Fix quality

* Remove redundant parameter max_length

* Separate the preprocess of train and validation to use different max_target_length

3c0ce608

24 Feb, 2023 1 commit
- [Examples] Generalise run audio classification for log-mel models (#21756) · 13489248
  Sanchit Gandhi authored Feb 24, 2023
```
* [Examples] Generalise run audio classification for log-mel models

* batch feature extractor

* make style
```
  13489248
22 Feb, 2023 2 commits
- Respect documentation on passive log level (#21700) · b19d64d8
  Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
  b19d64d8
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
20 Feb, 2023 1 commit
- Fix-rag-finetune-project-requirement (#21697) · 4194e5f4
  Arthur authored Feb 20, 2023
```
pin pytorch lightning requirement
```
  4194e5f4
16 Feb, 2023 2 commits

Bump werkzeug from 2.0.3 to 2.2.3 in /examples/research_projects/decision_transformer (#21658) · fcfd4ec7

dependabot[bot] authored Feb 16, 2023

Bump werkzeug in /examples/research_projects/decision_transformer

Bumps [werkzeug](https://github.com/pallets/werkzeug) from 2.0.3 to 2.2.3.
- [Release notes](https://github.com/pallets/werkzeug/releases)
- [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/werkzeug/compare/2.0.3...2.2.3

)

---
updated-dependencies:
- dependency-name: werkzeug
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

fcfd4ec7

Fix typos in contrastive-image-text example README (#21665) · 751f17aa
regisss authored Feb 16, 2023

751f17aa

13 Feb, 2023 1 commit
- Add missing arguemtn to run_clip.py (#21588) · fd5320bb
  Warren Green authored Feb 13, 2023
  
  fd5320bb