Commits · a0c0a7823393f7276f835fef815eabcadd8eaf64 · chenpangpang / transformers

09 May, 2023 2 commits
- v4.30.0.dev0 · a0c0a782
  Sylvain Gugger authored May 09, 2023
  
  a0c0a782
- fix: Update run_qa.py to work with deepset/germanquad (#23225) · 1a8f6111
  Sebastian authored May 09, 2023
```
Call str on id to make sure any ints are converted into the expected format for squad datasets
```
  1a8f6111
05 May, 2023 1 commit

Add `no_trainer` scripts to pre-train Vision Transformers (#23156) · fc6c8b0e

Ashwin Mathur authored May 05, 2023



* Add run_mim_no_trainer.py draft from #20412

Add parse_args method and copy over other dependencies

Add Method call for sending telemetry

Initialize Accelerator

Make one log on every process

Set seed and Handle repository creation

Initialize dataset and Set validation split

Create Config

Adapt Config

Update Config

Create Feature Extractor

Create model

Set column names

Create transforms

Create mask generator

Create method to preprocess images

Shuffle datasets if needed and set transforms

Create Dataloaders

Add optimizer

Add learning rate scheduler

Prepare everything with our accelerator

Tie weights for TPU training

Recalculate training steps and training epochs

Set accelerator checkpointing steps

Initialize trackers and store configuration

Set total batch size

Fix typo: mlm -> mim

Log info at the start of training

Load in the weights and states from previous save

update the progress_bar if load from checkpoint

Define train loop

Add evaluation loop to training

Add to parse_args method

Push repo to hub

Save accelerator state

End training and save model and feature extractor

Remove unused imports

Fix trailing whitespace

* Update code based on comments, Rename feature_extractor to image_processor

* Fix linting

* Add argument for learning rate

* Add argument for setting number of training epochs

* Remove incorrect logger argument

* Convert max_train_steps to int for tqdm

---------
Co-authored-by: Saad Mahmud <shuvro.mahmud79@gmail.com>

fc6c8b0e

03 May, 2023 1 commit
- Tidy Pytorch GLUE benchmark example (#23134) · b6933d76
  Robert Stone authored May 03, 2023
```
Migration to Evaluate for metric is not quite complete
```
  b6933d76
02 May, 2023 1 commit

Save the tokenizer and image preprocessor after training a model with the... · bcedd0a4

regisss authored May 02, 2023

Save the tokenizer and image preprocessor after training a model with the contrastive image-text example (#23035)

Save tokenizer and image preprocessor

bcedd0a4

13 Apr, 2023 1 commit
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored Apr 12, 2023
  
  888c4a2a
11 Apr, 2023 1 commit
- Replace -100s in predictions by the pad token (#22693) · 1b1867d8
  Sylvain Gugger authored Apr 11, 2023
```
* Replace -100s in predictions by the pad token

* Style

* Try to catch them all
```
  1b1867d8
05 Apr, 2023 1 commit

Sync preprocesses before loading the processor at run_speech_recognition_ctc.py (#21926) · d5239bab

Mikel Penagarikano authored Apr 05, 2023

* Update run_speech_recognition_ctc.py

Make sure all processes wait until data is saved before loading the processor from the output_dit

* Make sure all processes wait until data is saved before loading the processor from the output_dit

* Update run_speech_recognition_ctc.py

* Update run_speech_recognition_seq2seq.py

d5239bab

04 Apr, 2023 1 commit
- Add id2label and label2id to model's config in run_xnil (#22558) · 98268b2e
  Maziyar Panahi authored Apr 04, 2023
```
Add id2label and label2id to config in run_xnil
```
  98268b2e
29 Mar, 2023 1 commit
- Update Neptune docs (#22452) · 173193cc
  Sabine authored Mar 29, 2023
  
  173193cc
23 Mar, 2023 1 commit
- Fix quality due to ruff release · ef28df05
  Sylvain authored Mar 22, 2023
  
  ef28df05
22 Mar, 2023 3 commits

fix: Allow only test_file in pytorch and flax summarization (#22293) · 8e6c34b3
Connor Henderson authored Mar 22, 2023
```
allow only test_file in pytorch and flax summarization
```
8e6c34b3

add low_cpu_mem_usage option in run_clm.py example which will benefit… (#22288) · 4ccaf268

Wang, Yi authored Mar 22, 2023



* add low_cpu_mem_usage option in run_clm.py example which will benefit LLM loading
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* update all the example and README under language-modeling
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

4ccaf268

Enable traced model for text-generation task (#22265) · 8472a224
jiqing-feng authored Mar 22, 2023

8472a224

14 Mar, 2023 1 commit
- v4.28.0.dev0 · ebdb185b
  Sylvain Gugger authored Mar 14, 2023
  
  ebdb185b
08 Mar, 2023 1 commit

[examples/speech-recognition] Add SpecAugment to run_speech_recognition_seq2seq.py (#21942) · 6192549c

bofeng huang authored Mar 08, 2023



* Add specaugment to run_speech_recognition_seq2seq.py

* Remove useless argument: text_column

* Fix quality

* Update return_attention_mask condition

* Update specaugment arguments only for whisper models

* Remove SpecAugment arguments from ModelArguments, only leave default values for simplicity

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update apply_spec_augment only for whisper models

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Rename return_attention_mask to forward_attention_mask to avoid confusion with wav2vec2 models

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

6192549c

27 Feb, 2023 1 commit

[examples/summarization] deal with `max_length` and `num_beams` (#21740) · 3c0ce608

bofeng huang authored Feb 27, 2023

* Override the decoding parameters of Seq2SeqTrainer

* Fix quality

* Fix max_length parameter

* Fix quality

* Remove redundant parameter max_length

* Separate the preprocess of train and validation to use different max_target_length

3c0ce608

24 Feb, 2023 1 commit
- [Examples] Generalise run audio classification for log-mel models (#21756) · 13489248
  Sanchit Gandhi authored Feb 24, 2023
```
* [Examples] Generalise run audio classification for log-mel models

* batch feature extractor

* make style
```
  13489248
22 Feb, 2023 2 commits
- Respect documentation on passive log level (#21700) · b19d64d8
  Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
  b19d64d8
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
16 Feb, 2023 1 commit
- Fix typos in contrastive-image-text example README (#21665) · 751f17aa
  regisss authored Feb 16, 2023
  
  751f17aa
13 Feb, 2023 1 commit
- Add missing arguemtn to run_clip.py (#21588) · fd5320bb
  Warren Green authored Feb 13, 2023
  
  fd5320bb
10 Feb, 2023 1 commit
- Add _mp_fn to run_mae.py for XLA testing (#21551) · c88b11c5
  steventk-g authored Feb 10, 2023
```
Update run_mae.py
```
  c88b11c5
09 Feb, 2023 1 commit

fix typo in run_speech_recognition_ctc.py (#21528) · b31cee67

lee1jun authored Feb 09, 2023

Update run_speech_recognition_ctc.py

There should be `# limitations under the License` line at the end of the documentation section.

b31cee67

08 Feb, 2023 1 commit
- [Doc] Minor URL fixes in PyTorch Text Classification Readme (#21511) · d3046dad
  Stefan Schweter authored Feb 08, 2023
```
docs: fix some references in PyTorch text classification readme
```
  d3046dad
07 Feb, 2023 1 commit
- :pen: fix typo in pytorch semantic segmentation readme (#21492) · bbe98ea9
  Jeroen Van Der Donckt authored Feb 07, 2023
  
  bbe98ea9
06 Feb, 2023 2 commits

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

[examples] improve block_size warning message (#21463) · 3b9a1dc1
Stas Bekman authored Feb 06, 2023

3b9a1dc1

31 Jan, 2023 1 commit
- Simplify column_names in run_clm/mlm (#21382) · 074d6b75
  Quentin Lhoest authored Jan 31, 2023
```
* simplify column_names in run_clm

* simplify column_names in run_mlm

* minor
```
  074d6b75
30 Jan, 2023 1 commit

[`run_(clm|mlm).py` examples] add streaming dataset support (#21343) · 98d88b23

Stas Bekman authored Jan 30, 2023

* [run_clm example] add streaming dataset support

* unrefactor kwargs

* fix

* fix

* require datasets>=2.0.0

* port to mlm

98d88b23

23 Jan, 2023 2 commits
- v4.27.0.dev0 · 7119bb05
  Sylvain Gugger authored Jan 23, 2023
  
  7119bb05
- Add scikit-learn dependency to train langage-modeling (#21229) · 5603f78f
  Mostafa Elhoushi authored Jan 23, 2023
  
  5603f78f
19 Jan, 2023 1 commit
- Update examples with image processors (#21155) · 4bc18e7a
  amyeroberts authored Jan 19, 2023
```
* Update examples to use image processors

* Small fixes

* Resolve conflicts
```
  4bc18e7a
18 Jan, 2023 1 commit

Adapt repository creation to latest hf_hub (#21158) · 05e72aa0

Sylvain Gugger authored Jan 18, 2023

* Adapt repository creation to latest hf_hub

* Update all examples

* Fix other tests, add Flax examples

* Address review comments

05e72aa0

06 Jan, 2023 1 commit
- Fix arguments passed to predict function in QA Seq2seq training script (#21026) · ff8dcb5e
  Observer46 authored Jan 06, 2023
```
fix args passed to predict function
```
  ff8dcb5e
05 Jan, 2023 2 commits

[NumPy] Remove references to deprecated NumPy type aliases (#21022) · 35a7052b

Roy Hvaara authored Jan 05, 2023



[NumPy] Remove references to deprecated NumPy type aliases.

This change replaces references to a number of deprecated NumPy type aliases (np.bool, np.int, np.float, np.complex, np.object, np.str) with their recommended replacement (bool, int, float, complex, object, str).

NumPy 1.24 drops the deprecated aliases, so we must remove uses before updating NumPy.
Co-authored-by: Peter Hawkins <phawkins@google.com>
Co-authored-by: Peter Hawkins <phawkins@google.com>

35a7052b

Added mask_time_prob and mask_time_length arguments to wav2vec2 pretraining script (#20985) · 1d21471c
Magnus Pierrau authored Jan 05, 2023
```
Added mask_time_prob and mask_time_length arguments to wav2vec2 pretraining script and readme - new branch
```
1d21471c

03 Jan, 2023 1 commit

[run_clm example] add torch_dtype option for model load. (#20971) · 9c9fe89f

Wang, Yi authored Jan 03, 2023



* [run_clm example] add torch_dtype option for model load.
for BLOOM 175B model. peak memory will reduce about 350G for inference. the weight of BLOOM in model hub is bfloat16
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add other type in option

* fix style
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

9c9fe89f

24 Dec, 2022 1 commit
- Fixes typo in the help text for --max_length (#20883) · 3830b3f7
  Márton Makrai authored Dec 24, 2022
  
  3830b3f7
21 Dec, 2022 1 commit

[Examples] Update big table (#20845) · d87e381f

NielsRogge authored Dec 21, 2022



Update big table
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain>

d87e381f