Commits · 89159651babbd4008a42ed08730d0f707ed91758 · chenpangpang / transformers

24 May, 2023 14 commits

Fix the regex in `get_imports` to support multiline try blocks and excepts... · 89159651

Daniel King authored May 24, 2023

Fix the regex in `get_imports` to support multiline try blocks and excepts with specific exception types (#23725)

* fix and test get_imports for multiline try blocks, and excepts with specific errors

* fixup

* add some more tests

* add license

89159651

[Whisper] Reduce batch size in tests (#23736) · d8222be5
Sanchit Gandhi authored May 24, 2023

d8222be5

Overhaul TF serving signatures + dummy inputs (#23234) · 814de8fa

Matt authored May 24, 2023

* Let's try autodetecting serving sigs

* Don't clobber existing sigs

* Change shapes for multiplechoice models

* Make default dummy inputs smarter too

* Fix missing f-string

* Let's YOLO a serving output too

* Read __class__.__name__ properly

* Don't just pass naked lists in there and expect it to be okay

* Code cleanup

* Update default serving sig

* Clearer error messages

* Further updates to the default serving output

* make fixup

* Update the serving output a bit more

* Cleanups and renames, raise errors appropriately when we can't infer inputs

* More renames

* we're building in a functional context again, yolo

* import DUMMY_INPUTS from the right place

* import DUMMY_INPUTS from the right place

* Support cross-attention in the dummies

* Support cross-attention in the dummies

* Complete removal of dummy/serving overrides in BERT

* Complete removal of dummy/serving overrides in RoBERTa

* Obliterate lots and lots of serving sig and dummy overrides

* merge type hint changes

* Fix for token_type_ids with vocab_size 1

* Add missing property decorator

* Fix T5 and hopefully some models that take conv inputs

* More signature pruning

* Fix T5's signature

* Fix Wav2Vec2 signature

* Fix LongformerForMultipleChoice input signature

* Fix BLIP and LED

* Better default serving output error handling

* Fix BART dummies

* Fix dummies for cross-attention, esp encoder-decoder models

* Fix visionencoderdecoder signature

* Fix BLIP serving output

* Small tweak to BART dummies

* Cleanup the ugly parameter inspection line that I used in a few places

* committed a breakpoint again

* Move the text_dims check

* Remove blip_text serving_output

* Add decoder_input_ids to the default input sig

* Remove all the manual overrides for encoder-decoder model signatures

* Tweak longformer/led input sigs

* Tweak default serving output

* output.keys() -> output

* make fixup

814de8fa

fix: Whisper generate, move text_prompt_ids trim up for max_new_tokens calculation (#23724) · 3d7baef1
Connor Henderson authored May 24, 2023
```
move text_prompt_ids trimming to top
```
3d7baef1
fix: delete duplicate sentences in `document_question_answering.mdx` (#23735) · 50a56bed
Jungnerd authored May 25, 2023
```
fix: delete duplicate sentence
```
50a56bed

TF SAM memory reduction (#23732) · d2d88226

Matt authored May 24, 2023

* Extremely small change to TF SAM dummies to reduce memory usage on build

* remove debug breakpoint

* Debug print statement to track array sizes

* More debug shape printing

* More debug shape printing

* Now remove the debug shape printing

* make fixup

* make fixup

d2d88226

Minor awesome-transformers.md fixes (#23453) · 28aa438c
pagarsky authored May 24, 2023
```
Minor docs fixes
```
28aa438c

Better TF docstring types (#23477) · f8b25744

Matt authored May 24, 2023

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Rework TF type hints to use | None instead of Optional[] for tf.Tensor

* Don't forget the imports

* Add the imports to tests too

* make fixup

* Refactor tests that depended on get_type_hints

* Better test refactor

* Fix an old hidden bug in the test_keras_fit input creation code

* Fix for the Deit tests

f8b25744

fix gptj could not jit.trace in GPU (#23317) · 767e6b53
Wang, Yi authored May 24, 2023
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
767e6b53

fix: use bool instead of uint8/byte in Deberta/DebertaV2/SEW-D to make it... · b4698b7e

uchuhimo authored May 24, 2023


fix: use bool instead of uint8/byte in Deberta/DebertaV2/SEW-D to make it compatible with TensorRT (#23683)

* Use bool instead of uint8/byte in DebertaV2 to make it compatible with TensorRT

TensorRT cannot accept onnx graph with uint8/byte intermediate tensors. This PR uses bool tensors instead of unit8/byte tensors to make the exported onnx file can work with TensorRT.

* fix: use bool instead of uint8/byte in Deberta and SEW-D

---------
Co-authored-by: Yuxian Qiu <yuxianq@nvidia.com>

b4698b7e

Export to ONNX doc refocused on using optimum, added tflite (#23434) · 2eaaf17a

Maria Khalusova authored May 24, 2023



* doc refocused on using optimum, tflite

* minor updates to fix checks

* Apply suggestions from code review
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

* TFLite to separate page, added links

* Removed the onnx list builder

* make style

* Update docs/source/en/serialization.mdx
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

---------
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

2eaaf17a

Paged Optimizer + Lion Optimizer for Trainer (#23217) · 796162c5

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

796162c5

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) (#23479) · 9d73b922

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added fix for fp32 layer norms and bf16 compute in LLaMA.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Fixing issues for PR #23479.

* Added fix for fp32 layer norms and bf16 compute in LLaMA.

* Reverted variable name change.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Added missing tests.

* Fixup changes.

* Added fixup changes.

* Missed some variables to rename.

* revert trainer tests

* revert test trainer

* another revert

* fix tests and safety checkers

* protect import

* simplify a bit

* Update src/transformers/trainer.py

* few fixes

* add warning

* replace with `load_in_kbit = load_in_4bit or load_in_8bit`

* fix test

* fix tests

* this time fix tests

* safety checker

* add docs

* revert torch_dtype

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* multiple fixes

* update docs

* version checks and multiple fixes

* replace `is_loaded_in_kbit`

* replace `load_in_kbit`

* change methods names

* better checks

* oops

* oops

* address final comments

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9d73b922

add GPTJ/bloom/llama/opt into model list and enhance the jit support (#23291) · 33687a3f
Wang, Yi authored May 24, 2023
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
33687a3f

23 May, 2023 17 commits

Fix some docs what layerdrop does (#23691) · 003a0cf8

zspo authored May 24, 2023



* Fix some docs what layerdrop does

* Update src/transformers/models/data2vec/configuration_data2vec_audio.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix more docs

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

003a0cf8

fix: load_best_model_at_end error when load_in_8bit is True (#23443) · 357f281b

小桐桐 authored May 24, 2023

Ref: https://github.com/huggingface/peft/issues/394
Loading a quantized checkpoint into non-quantized Linear8bitLt is not supported.
call module.cuda() before module.load_state_dict()

357f281b

Skip `TFCvtModelTest::test_keras_fit_mixed_precision` for now (#23699) · de5f86e5
Yih-Dar authored May 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
de5f86e5

is_batched fix for remaining 2-D numpy arrays (#23309) · 3d574044

LWprogramming authored May 23, 2023

* Fix is_batched code to allow 2-D numpy arrays for audio

* Tests

* Fix typo

* Incorporate comments from PR #23223

3d574044

[`Blip`] Fix blip doctest (#23698) · 6b7d6f84
Younes Belkada authored May 23, 2023
```
fix blip doctest
```
6b7d6f84

TF version compatibility fixes (#23663) · 876d9a32

Matt authored May 23, 2023

* New TF version compatibility fixes

* Remove dummy print statement, move expand_1d

* Make a proper framework inference function

* Make a proper framework inference function

* ValueError -> TypeError

876d9a32

[`SAM`] Fixes pipeline and adds a dummy pipeline test (#23684) · 42baa58f
Younes Belkada authored May 23, 2023
```
* add a dummy pipeline test

* change test name
```
42baa58f
Fix a `BridgeTower` test (#23694) · 71a5ed34
Yih-Dar authored May 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
71a5ed34

🌐

[i18n-KO] Translated `tasks/monocular_depth_estimation.mdx` to Korean (#23621) · 1fe1e3ca

Nayeon Han authored May 23, 2023



docs: ko: `tasks/monocular_depth_estimation`
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

1fe1e3ca

Making `safetensors` a core dependency. (#23254) · 9e8d7066

Nicolas Patry authored May 23, 2023

* Making `safetensors` a core dependency.

To be merged later, I'm creating the PR so we can try it out.

* Update setup.py

* Remove duplicates.

* Even more redundant.

9e8d7066

Fix PyTorch SAM tests (#23682) · abf691aa
Yih-Dar authored May 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
abf691aa

Fix typo in a parameter name for open llama model (#23637) · b687af0b

Alex authored May 23, 2023

* Update modeling_open_llama.py

Fix typo in `use_memorry_efficient_attention` parameter name

* Update configuration_open_llama.py

Fix typo in `use_memorry_efficient_attention` parameter name

* Update configuration_open_llama.py

Take care of backwards compatibility ensuring that the previous parameter name is taken into account if used

* Update configuration_open_llama.py

format to adjust the line length

* Update configuration_open_llama.py

proper code formatting using `make fixup`

* Update configuration_open_llama.py

pop the argument not to let it be set later down the line

b687af0b

Add PerSAM [bis] (#23659) · 527ab894

NielsRogge authored May 23, 2023

* Add PerSAM args

* Make attn_sim optional

* Rename to attention_similarity

* Add docstrigns

* Improve docstrings

527ab894

Bump requests from 2.22.0 to 2.31.0 in /examples/research_projects/lxmert (#23668) · aa30cd4f

dependabot[bot] authored May 23, 2023

Bump requests in /examples/research_projects/lxmert

Bumps [requests](https://github.com/psf/requests) from 2.22.0 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.22.0...v2.31.0

)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

aa30cd4f

Bump requests from 2.22.0 to 2.31.0 in /examples/research_projects/visual_bert (#23670) · 9bf72ae5

dependabot[bot] authored May 23, 2023

Bump requests in /examples/research_projects/visual_bert

Bumps [requests](https://github.com/psf/requests) from 2.22.0 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.22.0...v2.31.0

)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

9bf72ae5

Bump requests from 2.27.1 to 2.31.0 in /examples/research_projects/decision_transformer (#23673) · ecc05f8c

dependabot[bot] authored May 23, 2023

Bump requests in /examples/research_projects/decision_transformer

Bumps [requests](https://github.com/psf/requests) from 2.27.1 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.27.1...v2.31.0

)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

ecc05f8c

small fix to remove unused eos in processor when it's not used. (#23408) · e30ceae0
Nicolas Patry authored May 23, 2023

e30ceae0

22 May, 2023 9 commits

[image-to-text pipeline] Add conditional text support + GIT (#23362) · 2f424d79

NielsRogge authored May 22, 2023

* First draft

* Remove print statements

* Add conditional generation

* Add more tests

* Remove scripts

* Remove BLIP specific linkes

* Add support for pix2struct

* Add fast test

* Address comment

* Fix style

2f424d79

Update workflow files (#23658) · e69feab8

Yih-Dar authored May 22, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e69feab8

Update all no_trainer with skip_first_batches (#23664) · b191d7db
Zachary Mueller authored May 22, 2023

b191d7db

Fix SAM tests and use smaller checkpoints (#23656) · 26a06814

Matt authored May 22, 2023

* Fix SAM tests and use smaller checkpoints

* Override test_model_from_pretrained to use sam-vit-base as well

* make fixup

26a06814

changing the requirements to a cpu torch version that works (#23483) · 6f72e71f
sshahrokhi authored May 22, 2023

6f72e71f

Fix wav2vec2 is_batched check to include 2-D numpy arrays (#23223) · 5de2a6d5

LWprogramming authored May 22, 2023



* Fix wav2vec2 is_batched check to include 2-D numpy arrays

* address comment

* Add tests

* oops

* oops

* Switch to np array
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Switch to np array

* condition merge

* Specify mono channel only in comment

* oops, add other comment too

* make style

* Switch list check from falsiness to empty

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

5de2a6d5

Bugfix: LLaMA layer norm incorrectly changes input type and consumers lots of memory (#23535) · 4ddd9de9

Tim Dettmers authored May 22, 2023



* Fixed bug where LLaMA layer norm would change input type.

* make fix-copies

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

4ddd9de9

Muellerzr fix deepspeed (#23657) · fe34486f
Zachary Mueller authored May 22, 2023
```
* Fix deepspeed recursion

* Better fix
```
fe34486f

Fix accelerate logger bug (#23650) · 7bbdfd7b

Younes Belkada authored May 22, 2023



* fix logger bug

* Update tests/mixed_int8/test_mixed_int8.py
Co-authored-by: Zachary Mueller <muellerzr@gmail.com>

* import `PartialState`

---------
Co-authored-by: Zachary Mueller <muellerzr@gmail.com>

7bbdfd7b