Commits · d4bf9ee1ff0e85cb24feec4dd160af39b623d4b9 · chenpangpang / transformers

12 Dec, 2022 18 commits

Update CI to torch 1.13.0 (#20687) · d4bf9ee1
Yih-Dar authored Dec 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d4bf9ee1
rename `layoutlm_job` to `exotic_models_job` (#20736) · f41a11a1
Yih-Dar authored Dec 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f41a11a1
Add decorator for flaky Donut tests (#20739) · 1416b5d9
amyeroberts authored Dec 12, 2022
```
* Add decorator for flaky tests

* Fix up
```
1416b5d9
Disambiguate test for required_input in tokenization base file. (#20731) · a450789d
Sylvain Gugger authored Dec 12, 2022
```
* Disambiguate test for required_input in tokenization base file.

* Add test for size
```
a450789d
Add a progress bar for large model loading (#20713) · 29ff8716
Sylvain Gugger authored Dec 12, 2022

29ff8716

Add gpt-sw3 model to transformers (#20209) · 5f94855d

Ariel Ekgren authored Dec 12, 2022



* Add templates for gpt-sw3

* Add templates for gpt-sw3

* Added sentencepiece tokenizer

* intermediate commit with many changes

* fixed conflicts

* Init commit for tokenization port

* Tokenization progress

* Remove fast tokenizer

* Clean up and rename spm.model -> spiece.model

* Remove TF -> PT conversion script template, Clean up Megatron -> PT script

* Optimize encode & decode performance

* added new attention

* added new attention

* attention for gpt-sw3 working

* attention good

* Cache is now working

* fixed attention mask so that it works with causal attention

* fixed badbmm bug for cpu and caching

* updated config with correct parameters

* Refactor and leave optimizations as separate functions to avoid breaking expected functionality

* Fix special tokens mapping for both tokenizers

* cleaning up of code and comments

* HF compatible attention outputs

* Tokenizer now passing tests, add documentation

* Update documentation

* reverted back to base implementation after checking that it is identical to pretrained model

* updated gpt-sw3 config

* updated conversion script

* aligned parameters with gpt-sw3 config

* changed default scale_attn_by_inverse_layer_idx to true

* removed flag from conversion script

* added temporary model path

* reverted back to functioning convert script

* small changes to default config

* updated tests for gpt-sw3

* make style, make quality, minor cleanup

* Change local paths to testing online repository

* Change name: GptSw3 -> GPTSw3

* Remove GPTSw3TokenizerFast references

* Use official model repository and add more model sizes

* Added reference to 6.7b model

* Add GPTSw3DoubleHeadsModel to IGNORE_NON_AUTO_CONFIGURED, like GPT2DoubleHeadsModel

* Remove pointers to non-existing TFGPTSw3

* Add GPTSw3 to docs/_toctree.yml

* Remove TF artifacts from GPTSw3 in __init__ files

* Update README:s with 'make fix-copies'

* Add 20b model to archive list

* Add documentation for GPT-Sw3

* Fix typo in documentation for GPT-Sw3

* Do 'make fix-copies' again after having updated docs

* Fix some typos in docs

* Update src/transformers/models/gpt_sw3/configuration_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/configuration_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/__init__.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/__init__.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/convert_megatron_to_pytorch.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update tests/models/gpt_sw3/test_tokenization_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/modeling_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Resolve comments from PR feedback

* Resolve more comments from PR feedback, also set use_cache=True in convert script

* Add '# Copied from' comments for GPTSw3 modeling

* Set 'is_parallelizable = False'

* Remove '# Copied from' where code was modified and add 'with x->y' when appropriate

* Remove parallelize in mdx

* make style, make quality

* Update GPTSw3Config default values and corresponding documentation

* Update src/transformers/models/gpt_sw3/tokenization_gpt_sw3.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/__init__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Clean up and protect GPTSw3Tokenizer imports with is_sentencepiece_available

* Make style, make quality

* Add dummy object for GPTSw3Tokenizer via 'make fix-copies'

* make fix-copies

* Remove GPTSw3 modeling classes

* make style, make quality

* Add GPTSw3 auto-mappings for other GPT2 heads

* Update docs/source/en/model_doc/gpt-sw3.mdx
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/convert_megatron_to_pytorch.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/gpt_sw3/tokenization_gpt_sw3.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove old TODO-comment

* Add example usage to GPTSw3Tokenizer docstring

* make style, make quality

* Add implementation details and example usage to gpt-sw3.mdx
Co-authored-by: JoeyOhman <joeyoh@kth.se>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5f94855d

Add vision requirement to image transforms (#20712) · b58beebe

amyeroberts authored Dec 12, 2022

* Add require_vision decorator

* Fixup

* Use requires_backends

* Add requires_backend to utils functions

b58beebe

Clarify return_tensor and return_text parameters (#20662) · fd2bed7f
Steven Liu authored Dec 12, 2022
```
* clarify docstring

* make style
```
fd2bed7f
Convert tokenizer outputs for Keras in doc example (#20732) · c1b9a11d
Matt authored Dec 12, 2022
```
* Convert tokenizer outputs for Keras in doc example

* Das deutsche Beispiel auch korrigieren
```
c1b9a11d

Spanish translation of the file debugging.mdx (#20566) · 0ba94ace

Juanjo do Olmo authored Dec 12, 2022



* Create and translate to Spanish debugging.mdx

* solved typo error in a header

* Update debugging.mdx

* Update debugging.mdx

* Update docs/source/es/debugging.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/debugging.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/debugging.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/debugging.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/debugging.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update _toctree.yml
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0ba94ace

fsdp fix (#20719) · a413c725
Sourab Mangrulkar authored Dec 12, 2022

a413c725
Very small edit to change name to OpenAI GPT (#20722) · 17c742bb
stanleycai95 authored Dec 12, 2022

17c742bb

Add type hints for Whisper models (#20396) · 8f1f59ce

Ian C authored Dec 12, 2022

* Initial commit

* Add type hints for two major classes

* Run make fixup

* Fix output type for Whisper

* Run isort to fix imports

8f1f59ce

Adding ValueError when imcompatible parameters are used. (#20729) · 53357e81
Nicolas Patry authored Dec 12, 2022

53357e81
Fix `AutoModelTest.test_model_from_pretrained` (#20730) · 5ba2dbd9
Yih-Dar authored Dec 12, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5ba2dbd9

Add `accelerate` support for LongT5 models (#20341) · a3345c1f

Peter authored Dec 12, 2022

* ✨

 add accelerate support for LongT5 models
Signed-off-by: peter szemraj <peterszemraj@gmail.com>

* fix `accelerate` tests

* Trigger CI test
Signed-off-by: peter szemraj <peterszemraj@gmail.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

a3345c1f

Spanish translation of asr.mdx and add_new_pipeline.mdx (#20569) · 8286af6f

Alberto Mario Ceballos-Arroyo authored Dec 12, 2022



* Fix minor typo in question_answering.mdx

* Fixes minor typo in the english version of tasks/asr.mdx

* Update _toctree.yml

* Translate add_new_pipeline.mdx into Spanish

* Fixes some typos in the English version of add_new_pipeline.mdx

* Translate asr.mdx into Spanish

* Fixes small typos in add_new_pipeline.mdx

* Update docs/source/es/add_new_pipeline.mdx

Suggestion by @osanseviero
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/add_new_pipeline.mdx

Suggestion by @osanseviero: use "biblioteca" instead of "librería."
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/tasks/asr.mdx

Suggestion by @osanseviero.
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/add_new_pipeline.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/add_new_pipeline.mdx

Suggestion by @osanseviero.
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/add_new_pipeline.mdx

Suggestion by @osanseviero.
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/add_new_pipeline.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/tasks/asr.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/tasks/asr.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update docs/source/es/tasks/asr.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Update asr.mdx
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

8286af6f

Made LUKE Tokenizer independent from RoBERTa (#20720) · 8d2fca07
Salvo Cavallaro authored Dec 12, 2022

8d2fca07

09 Dec, 2022 7 commits

Fix rendering issue in quicktour (#20708) · 799cea64
Sylvain Gugger authored Dec 09, 2022
```
* Fix rendering issue in quicktour

* Separate in two blocks
```
799cea64

[`ViTHybrid`] fix last `accelerate` slow test (#20705) · 74330083

Younes Belkada authored Dec 09, 2022

* fix last slow test

* revert deletion

* Update src/transformers/models/vit_hybrid/modeling_vit_hybrid.py

74330083

Replace FE references (#20702) · 73198509
amyeroberts authored Dec 09, 2022

73198509

Vision processors - replace FE with IPs (#20590) · a95fd354

amyeroberts authored Dec 09, 2022



* Replace FE references with IPs

* Update processor tests

* Update src/transformers/models/clip/processing_clip.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/clip/processing_clip.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update warning messages v4.27 -> v5

* Fixup

* Update Chinese CLIP processor

* Add feature_extractor property

* Add attributes

* Add tests
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

a95fd354

skip `test_multi_gpu_data_parallel_forward` for `MaskFormerSwinModelTest` (#20688) · 704027f0
Yih-Dar authored Dec 09, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
704027f0

Change transformers.onnx to use optimum.exporters.onnx (#20529) · 6a062a3e

Michael Benayoun authored Dec 09, 2022



* Change transformers.onnx to use optimum.exporters.onnx

* Update doc

* Remove print

* Fix transformers.onnx cli

* Update documentation

* Update documentation

* Small fixes

* Fix log message

* Apply suggestions

* Update src/transformers/onnx/__main__.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions

* Add missing line break

* Ran make fix-copies

* Update src/transformers/onnx/__main__.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update src/transformers/onnx/__main__.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: Michael Benayoun <michael@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

6a062a3e

[Backbones] Improve out features (#20675) · 9a6c6ef9

NielsRogge authored Dec 09, 2022



* Improve ResNet backbone

* Improve Bit backbone

* Improve docstrings

* Fix default stage

* Apply suggestions from code review
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

9a6c6ef9

08 Dec, 2022 15 commits

Add video classification pipeline (#20151) · 9e56aff5

Nathan Raw authored Dec 08, 2022

* 🚧 wip video classification pipeline

* 🚧 wip - add is_decord_available check

* 🐛 add missing import

* ✅ add tests

* 🔧 add decord to setup extras

* 🚧 add is_decord_available

* ✨ add video-classification pipeline

* 📝 add video classification pipe to docs

* 🐛 add missing VideoClassificationPipeline import

* 📌 add decord install in test runner

* ✅ fix url inputs to video-classification pipeline

* ✨ updates from review

* 📝 add video cls pipeline to docs

* 📝 add docstring

* 🔥 remove unused import

* 🔥 remove some code

* 📝 docfix

9e56aff5

Add deprecation warning when image FE instantiated (#20427) · c56ebbbe

amyeroberts authored Dec 08, 2022



* Add deprecation warning when image FE instantiated

* Update src/transformers/models/beit/feature_extraction_beit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update v2.7 -> v5 and add for new IPs

* Add message to Chinese CLIP
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c56ebbbe

Added missing `test_tokenization_led` (#20568) · 183af58b

IMvision12 authored Dec 09, 2022

* Create test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

* Update test_tokenization_led.py

183af58b

Fix donut image processor (#20625) · cf1b8c34

amyeroberts authored Dec 08, 2022

* fix donut image processor

* Update test values

* Apply lower bound on resizing size

* Add in missing size param

* Resolve resize channel_dimension bug

* Update src/transformers/image_transforms.py

cf1b8c34

Fix CIs for PyTorch 1.13 (#20686) · e3cc4487

Yih-Dar authored Dec 08, 2022



* fix 1

* fix 2

* fix 3

* fix 4
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e3cc4487

Enable bf16 option for XLA devices (#20684) · bcc069dd
jeffhataws authored Dec 08, 2022

bcc069dd
[`ViTHybrid`] Fix `accelerate` slow tests (#20679) · 9858ecd7
Younes Belkada authored Dec 08, 2022
```
* fix failing `accelerate` tests

* make fixup

* smaller values

* even lower
```
9858ecd7
Whilelist Transformers private method in DummyObject (#20681) · 69038ce0
Sylvain Gugger authored Dec 08, 2022

69038ce0

Migrate torchdynamo to torch.compile (#20634) · 9cc65f87

Sylvain Gugger authored Dec 08, 2022

* Migrate torchdynamo to torch.compile

* Add docstring and generic option

* Properly use the function...

* Reorg args

9cc65f87

Bump certifi in /examples/research_projects/visual_bert (#20673) · da95f6ca

dependabot[bot] authored Dec 08, 2022

Bumps [certifi](https://github.com/certifi/python-certifi) from 2020.6.20 to 2022.12.7.
- [Release notes](https://github.com/certifi/python-certifi/releases)
- [Commits](https://github.com/certifi/python-certifi/compare/2020.06.20...2022.12.07

)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

da95f6ca

Bump certifi in /examples/research_projects/decision_transformer (#20677) · efd7c021

dependabot[bot] authored Dec 08, 2022

Bumps [certifi](https://github.com/certifi/python-certifi) from 2021.10.8 to 2022.12.7.
- [Release notes](https://github.com/certifi/python-certifi/releases)
- [Commits](https://github.com/certifi/python-certifi/compare/2021.10.08...2022.12.07

)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

efd7c021

Bump certifi in /examples/research_projects/lxmert (#20672) · 9e33e19b

dependabot[bot] authored Dec 08, 2022

Bumps [certifi](https://github.com/certifi/python-certifi) from 2020.6.20 to 2022.12.7.
- [Release notes](https://github.com/certifi/python-certifi/releases)
- [Commits](https://github.com/certifi/python-certifi/compare/2020.06.20...2022.12.07

)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

9e33e19b

Add `BackboneMixin` (#20660) · 6eae3f78

Yih-Dar authored Dec 08, 2022



* add BackboneBaseModel

* add BackboneBaseModel

* Rename to BackboneMixin

* remove nn.Module
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6eae3f78

Fix expected values for TF-ESM tests (#20680) · be3d6c84
Matt authored Dec 08, 2022

be3d6c84
Update the list of contributors to reflect current organization (#20603) · c83703cb
Sylvain Gugger authored Dec 08, 2022
```
* Update the list of contributors to reflect current organization

* Proper indent
```
c83703cb