Commits · ff8870350151091d3d8b2af4c1c0fa3ebcc1052a · chenpangpang / transformers

14 Mar, 2023 4 commits

Update 2 doctest expected values for torch 2.0.0 (#22148) · ff887035
Yih-Dar authored Mar 14, 2023
```
update values
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ff887035

Alara Dirik authored Mar 14, 2023

* Add ConvNeXt V2 to transformers
* TF model is separated from the PR to fix issues

cdddfbff

Move `is_pipeline_test_to_skip` to specific model test classes (#21999) · 6c2ad00c

Yih-Dar authored Mar 14, 2023



* Move `is_pipeline_test_to_skip` to specific model test classes

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6c2ad00c

[

🛠

️] Fix-whisper-breaking-changes (#21965) · 2beabd24

Arthur authored Mar 14, 2023



* temp fix

* temporary fix

* update

* fix tests

* fixup

* update based on reveiew
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* update to fix tests

* update docstring

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

2beabd24

13 Mar, 2023 25 commits

docs: New terms and updates to glossary (#21982) · 101a6cd2

MichaelRipa authored Mar 13, 2023



* Updated glossary with new terms, added abbreviations for certain terms and merged autoencoding models, autoregressive models and causal language modeling into encoder and decoder models

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Added link to 'Pipeline for inference' tutorial

* Trigger CI

* Update docs/source/en/glossary.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Added entry for self supervised learning, added deleted entries + fixed broken links

* Update docs/source/en/glossary.mdx
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

101a6cd2

Prepare daily CI for torch 2.0.0 (#22135) · ba9e0191
Yih-Dar authored Mar 13, 2023
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ba9e0191

[Safetensors] Add explicit flag to from pretrained (#22083) · f780557a

Patrick von Platen authored Mar 13, 2023



* [Safetensors] Add explicit  flag to from pretrained

* add test

* remove @

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f780557a

Remove backend check for torch.compile (#22140) · 3a35937e

Sylvain Gugger authored Mar 13, 2023



* Remove backend enforcment for torch.compile

* Update error

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Style

---------
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

3a35937e

[deepspeed docs] Activation Checkpointing (#22099) · 618697ef

Stas Bekman authored Mar 13, 2023



* [deepspeed docs] Activation Checkpointing

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update deepspeed.mdx

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

618697ef

[trainer] fix bug in grad accum with multiple epochs (#22098) · 5b85add7
Stas Bekman authored Mar 13, 2023
```
* [trainer] fix bug in grad accum

* comment out debug

* fix one-off

* rename counter
```
5b85add7
Enforce same behavior as PyTorch 2.0 for older versions (#22136) · 1c801d65
Sylvain Gugger authored Mar 13, 2023

1c801d65
Trainer: let generate pick its inputs (#22108) · e16cbe88
Joao Gante authored Mar 13, 2023
```
* Let generate pick its inputs

* fix squad seq2seq example
```
e16cbe88

[`Whiper`] add `get_input_embeddings` to `WhisperForAudioClassification` (#22133) · d979cf6e

Younes Belkada authored Mar 13, 2023



* add `get_input_embeddings` to `WhisperForAudioClassification`

* add common tests

* fix another common test

* Update tests/models/whisper/test_modeling_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

d979cf6e

Update configuration_align.py (projected_dim=640) (#22139) · 98797237
bishmdl76 authored Mar 13, 2023
```
Update configuration_align.py

updated projected_dim=640 from 512 in arguments of AlignConfig
```
98797237
Add a new script to check model testers' config (#22063) · 54ee56b1
Yih-Dar authored Mar 13, 2023
```
* Add script

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
54ee56b1
Adding Type Hints to TF_Pegasus model (#21941) · a096eaca
mollerup23 authored Mar 13, 2023
```
* Adding Type Hints to TF_Pegasus model

* Updated some parameters per maintainer comments
```
a096eaca
Fix doc link for MGP-STR (#22138) · 6cb5132a
Sylvain Gugger authored Mar 13, 2023

6cb5132a

Zero-shot image classification task guide (#22132) · 8def252d

Maria Khalusova authored Mar 13, 2023



* WIP

* WIP

* manual inference example

* make style

* Apply suggestions from code review
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>

---------
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>

8def252d

Fix gradient checkpointing bug in trocr (#22126) · e61081e7

Karim Foda authored Mar 13, 2023



* Fix gradient checkpointing bug in trocr

* Fix format

* Update src/transformers/models/trocr/modeling_trocr.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

e61081e7

Fix gradient checkpointing bug in LongT5 (#22130) · ef74e7e7
Karim Foda authored Mar 13, 2023

ef74e7e7
Fix gradient checkpointing bug in xmod (#22129) · c1db6a3b
Karim Foda authored Mar 13, 2023

c1db6a3b
[`Blip2`] skip accelerate test (#22124) · 6652e7da
Younes Belkada authored Mar 13, 2023
```
skip accelerate test
```
6652e7da
Added big_models.mdx italian translation #17600 (#22115) · dd3a0580
Nicola Procopio authored Mar 13, 2023
```
* updated toctree

* italian translation big_model.mdx

* italian translation big_models
```
dd3a0580
Fix gradient checkpointing bug in xlm_roberta_xl (#22128) · 0768c5e2
Karim Foda authored Mar 13, 2023

0768c5e2
Fix gradient checkpointing bug in Trajectory Transformer (#22125) · 4c14c1f4
Karim Foda authored Mar 13, 2023

4c14c1f4
Fix gradient checkpointing bug in xglm (#22127) · d0876a09
Karim Foda authored Mar 13, 2023

d0876a09
Add pr_checks.mdx Italian translation (#17459) (#22116) · 0c883766
Alex Calabrese authored Mar 13, 2023
```
* Add pr_checks.mdx Italian translation (#17459)

* Updated pr_checks.mdx Italian translation (#17459)
```
0c883766

add new model of MGP-STR (#21418) · 102b5ff4

wangpeng authored Mar 13, 2023



* add new model of MGP-STR

* fix the check failings

* remove torch and numpy from mgp_tokenization

* remove unused import from modeling_mgp_str

* add test_processing_mgp_str

* rm test_processing_mgp_str.py

* add test_processing_mgp_str

* add test_processing_mgp_str

* add test_processing_mgp_str

* rm test_processing_mgp_str and add softmax outs to model

* rm test_processing_mgp_str and add softmax outs to model

* rewrite the code of mgp-str according to PR suggestions

* rewrite the code of mgp-str according to PR suggestions

* add new model of MGP-STR

* fix the check failings

* remove torch and numpy from mgp_tokenization

* remove unused import from modeling_mgp_str

* add test_processing_mgp_str

* rm test_processing_mgp_str.py

* add test_processing_mgp_str

* add test_processing_mgp_str

* add test_processing_mgp_str

* rm test_processing_mgp_str and add softmax outs to model

* rewrite the code of mgp-str according to PR suggestions

* rewrite the code of mgp-str according to PR suggestions

* remove representation_size from MGPSTRConfig

* reformat configuration_mgp_str.py

* format test_processor_mgp_str.py

* add test for tokenizer and complete model/processer test and model file

* rm Unnecessary tupple in modeling_mgp_str

* reduce hidden_size/layers/label_size in test_model

* add integration tests and change MGPSTR to Mgpstr

* add test for logit values

* reformat test model file

---------
Co-authored-by: yue kun <yuekun.wp@alibaba-inc.com>

102b5ff4

Add AutoModelForZeroShotImageClassification (#22087) · 32e3466d
Alara Dirik authored Mar 13, 2023
```
Adds AutoModelForZeroShotImageClassification to transformers
```
32e3466d

11 Mar, 2023 1 commit
- [Whisper] Remove embed_tokens from encoder docstring (#21996) · b90fbc7e
  Sanchit Gandhi authored Mar 11, 2023
```
* [Whisper] Remove embed_tokens from encoder docstring

* new line to retrigger CI

* remove new line
```
  b90fbc7e
10 Mar, 2023 10 commits
- Revert "[GPT2] Propose fix for #21080" (#22093) · 2f320661
  Yih-Dar authored Mar 10, 2023
```
Revert "[GPT2] Propose fix for #21080 (#21853)" to avoid CI failure

This reverts commit a3fef89b.
```
  2f320661
- Fix imports of TF MobileViT (#22065) · 499770c0
  Sylvain Gugger authored Mar 10, 2023
```
* Fix imports of TF MobileViT

* Fix copies
```
  499770c0
- GPT-J specific half precision on CPU note (#22086) · bdec2768
  Maria Khalusova authored Mar 10, 2023
```
* re: #21989

* update re: #21989

* removed cpu option

* make style
```
  bdec2768
- handle numpy inputs in whole word mask data collator (#22032) · 2f4cdd97
  Dean Wyatte authored Mar 10, 2023
  
  2f4cdd97
- Fix hint in src/transformers/modeling_utils.py (#22074) · a70da86b
  J-shang authored Mar 10, 2023
```
fix hint
```
  a70da86b
- Fix gradient checkpointing bug in Speecht5 (#22080) · 419d979f
  Karim Foda authored Mar 10, 2023
```
* Fix gradient checkpointing bug in Speecht5

* Update modeling_speech_to_text.py

* Update src/transformers/models/speech_to_text/modeling_speech_to_text.py

* Fix change errors

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
```
  419d979f
- Generate - Fix broken documentation links (#22078) · 7014fc36
  Joao Gante authored Mar 10, 2023
```
fix broken links
```
  7014fc36
- Fix small typo in flan-ul2.mdx (#22068) · ade26bf9
  Kevin Jiang authored Mar 10, 2023
```
* Update flan-ul2.mdx

* Update flan-ul2.mdx
```
  ade26bf9
- [GPT2] Propose fix for #21080 (#21853) · a3fef89b
  Arthur authored Mar 10, 2023
```
* Make sure position ids are masked

* test that padded input produce the same results

* fix failing tests

* fixup

* fix batch test
```
  a3fef89b
- Fix gradient checkpointing bug in switch transformer (#22081) · eee195b3
  Karim Foda authored Mar 10, 2023
  
  eee195b3