Commits · 2eaaf17a0b0ab4c13cb1b1e87accd2d5dee47be4 · chenpangpang / transformers

24 May, 2023 4 commits

Export to ONNX doc refocused on using optimum, added tflite (#23434) · 2eaaf17a

Maria Khalusova authored May 24, 2023



* doc refocused on using optimum, tflite

* minor updates to fix checks

* Apply suggestions from code review
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

* TFLite to separate page, added links

* Removed the onnx list builder

* make style

* Update docs/source/en/serialization.mdx
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

---------
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

2eaaf17a

Paged Optimizer + Lion Optimizer for Trainer (#23217) · 796162c5

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

796162c5

4-bit QLoRA via bitsandbytes (4-bit base model + LoRA) (#23479) · 9d73b922

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added fix for fp32 layer norms and bf16 compute in LLaMA.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Fixing issues for PR #23479.

* Added fix for fp32 layer norms and bf16 compute in LLaMA.

* Reverted variable name change.

* Initial draft. Some tests fail.

* Fixed dtype bug.

* Fixed bug caused by torch_dtype='auto'.

* All test green for 8-bit and 4-bit layers.

* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

* Added missing tests.

* Fixup changes.

* Added fixup changes.

* Missed some variables to rename.

* revert trainer tests

* revert test trainer

* another revert

* fix tests and safety checkers

* protect import

* simplify a bit

* Update src/transformers/trainer.py

* few fixes

* add warning

* replace with `load_in_kbit = load_in_4bit or load_in_8bit`

* fix test

* fix tests

* this time fix tests

* safety checker

* add docs

* revert torch_dtype

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* multiple fixes

* update docs

* version checks and multiple fixes

* replace `is_loaded_in_kbit`

* replace `load_in_kbit`

* change methods names

* better checks

* oops

* oops

* address final comments

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9d73b922

add GPTJ/bloom/llama/opt into model list and enhance the jit support (#23291) · 33687a3f
Wang, Yi authored May 24, 2023
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
33687a3f

23 May, 2023 17 commits

Fix some docs what layerdrop does (#23691) · 003a0cf8

zspo authored May 24, 2023



* Fix some docs what layerdrop does

* Update src/transformers/models/data2vec/configuration_data2vec_audio.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix more docs

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

003a0cf8

fix: load_best_model_at_end error when load_in_8bit is True (#23443) · 357f281b

小桐桐 authored May 24, 2023

Ref: https://github.com/huggingface/peft/issues/394
Loading a quantized checkpoint into non-quantized Linear8bitLt is not supported.
call module.cuda() before module.load_state_dict()

357f281b

Skip `TFCvtModelTest::test_keras_fit_mixed_precision` for now (#23699) · de5f86e5
Yih-Dar authored May 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
de5f86e5

is_batched fix for remaining 2-D numpy arrays (#23309) · 3d574044

LWprogramming authored May 23, 2023

* Fix is_batched code to allow 2-D numpy arrays for audio

* Tests

* Fix typo

* Incorporate comments from PR #23223

3d574044

[`Blip`] Fix blip doctest (#23698) · 6b7d6f84
Younes Belkada authored May 23, 2023
```
fix blip doctest
```
6b7d6f84

TF version compatibility fixes (#23663) · 876d9a32

Matt authored May 23, 2023

* New TF version compatibility fixes

* Remove dummy print statement, move expand_1d

* Make a proper framework inference function

* Make a proper framework inference function

* ValueError -> TypeError

876d9a32

[`SAM`] Fixes pipeline and adds a dummy pipeline test (#23684) · 42baa58f
Younes Belkada authored May 23, 2023
```
* add a dummy pipeline test

* change test name
```
42baa58f
Fix a `BridgeTower` test (#23694) · 71a5ed34
Yih-Dar authored May 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
71a5ed34

🌐

[i18n-KO] Translated `tasks/monocular_depth_estimation.mdx` to Korean (#23621) · 1fe1e3ca

Nayeon Han authored May 23, 2023



docs: ko: `tasks/monocular_depth_estimation`
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

1fe1e3ca

Making `safetensors` a core dependency. (#23254) · 9e8d7066

Nicolas Patry authored May 23, 2023

* Making `safetensors` a core dependency.

To be merged later, I'm creating the PR so we can try it out.

* Update setup.py

* Remove duplicates.

* Even more redundant.

9e8d7066

Fix PyTorch SAM tests (#23682) · abf691aa
Yih-Dar authored May 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
abf691aa

Fix typo in a parameter name for open llama model (#23637) · b687af0b

Alex authored May 23, 2023

* Update modeling_open_llama.py

Fix typo in `use_memorry_efficient_attention` parameter name

* Update configuration_open_llama.py

Fix typo in `use_memorry_efficient_attention` parameter name

* Update configuration_open_llama.py

Take care of backwards compatibility ensuring that the previous parameter name is taken into account if used

* Update configuration_open_llama.py

format to adjust the line length

* Update configuration_open_llama.py

proper code formatting using `make fixup`

* Update configuration_open_llama.py

pop the argument not to let it be set later down the line

b687af0b

Add PerSAM [bis] (#23659) · 527ab894

NielsRogge authored May 23, 2023

* Add PerSAM args

* Make attn_sim optional

* Rename to attention_similarity

* Add docstrigns

* Improve docstrings

527ab894

Bump requests from 2.22.0 to 2.31.0 in /examples/research_projects/lxmert (#23668) · aa30cd4f

dependabot[bot] authored May 23, 2023

Bump requests in /examples/research_projects/lxmert

Bumps [requests](https://github.com/psf/requests) from 2.22.0 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.22.0...v2.31.0

)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

aa30cd4f

Bump requests from 2.22.0 to 2.31.0 in /examples/research_projects/visual_bert (#23670) · 9bf72ae5

dependabot[bot] authored May 23, 2023

Bump requests in /examples/research_projects/visual_bert

Bumps [requests](https://github.com/psf/requests) from 2.22.0 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.22.0...v2.31.0

)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

9bf72ae5

Bump requests from 2.27.1 to 2.31.0 in /examples/research_projects/decision_transformer (#23673) · ecc05f8c

dependabot[bot] authored May 23, 2023

Bump requests in /examples/research_projects/decision_transformer

Bumps [requests](https://github.com/psf/requests) from 2.27.1 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.27.1...v2.31.0

)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

ecc05f8c

small fix to remove unused eos in processor when it's not used. (#23408) · e30ceae0
Nicolas Patry authored May 23, 2023

e30ceae0

22 May, 2023 12 commits

[image-to-text pipeline] Add conditional text support + GIT (#23362) · 2f424d79

NielsRogge authored May 22, 2023

* First draft

* Remove print statements

* Add conditional generation

* Add more tests

* Remove scripts

* Remove BLIP specific linkes

* Add support for pix2struct

* Add fast test

* Address comment

* Fix style

2f424d79

Update workflow files (#23658) · e69feab8

Yih-Dar authored May 22, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

e69feab8

Update all no_trainer with skip_first_batches (#23664) · b191d7db
Zachary Mueller authored May 22, 2023

b191d7db

Fix SAM tests and use smaller checkpoints (#23656) · 26a06814

Matt authored May 22, 2023

* Fix SAM tests and use smaller checkpoints

* Override test_model_from_pretrained to use sam-vit-base as well

* make fixup

26a06814

changing the requirements to a cpu torch version that works (#23483) · 6f72e71f
sshahrokhi authored May 22, 2023

6f72e71f

Fix wav2vec2 is_batched check to include 2-D numpy arrays (#23223) · 5de2a6d5

LWprogramming authored May 22, 2023



* Fix wav2vec2 is_batched check to include 2-D numpy arrays

* address comment

* Add tests

* oops

* oops

* Switch to np array
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Switch to np array

* condition merge

* Specify mono channel only in comment

* oops, add other comment too

* make style

* Switch list check from falsiness to empty

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

5de2a6d5

Bugfix: LLaMA layer norm incorrectly changes input type and consumers lots of memory (#23535) · 4ddd9de9

Tim Dettmers authored May 22, 2023



* Fixed bug where LLaMA layer norm would change input type.

* make fix-copies

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

4ddd9de9

Muellerzr fix deepspeed (#23657) · fe34486f
Zachary Mueller authored May 22, 2023
```
* Fix deepspeed recursion

* Better fix
```
fe34486f

Fix accelerate logger bug (#23650) · 7bbdfd7b

Younes Belkada authored May 22, 2023



* fix logger bug

* Update tests/mixed_int8/test_mixed_int8.py
Co-authored-by: Zachary Mueller <muellerzr@gmail.com>

* import `PartialState`

---------
Co-authored-by: Zachary Mueller <muellerzr@gmail.com>

7bbdfd7b

Fix tensor device while attention_mask is not None (#23538) · 29294b0e

zspo authored May 22, 2023

* Fix tensor device while attention_mask is not None

* Fix tensor device while attention_mask is not None

29294b0e

Remove erroneous `img` closing tag (#23646) · 12ec7f0c
Joshua Lochner authored May 22, 2023
```
See https://github.com/huggingface/transformers/pull/23625
```
12ec7f0c

Debug example code for MegaForCausalLM (#23382) · 6397b7f0

Tyler authored May 22, 2023

* Debug example code for MegaForCausalLM

set ignore_mismatched_sizes=True in model loading code

* Fix up

6397b7f0

20 May, 2023 1 commit
- Fix `tests/repo_utils/test_get_test_info.py` (#23485) · 3658488f
  Yih-Dar authored May 20, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3658488f
19 May, 2023 6 commits

Fix confusing `transformers` installation in CI (#23465) · 9728f113
Yih-Dar authored May 19, 2023
```
* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9728f113
Fix DeepSpeed stuff in the nightly CI (#23478) · 1f2c00d6
Yih-Dar authored May 19, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
1f2c00d6
[`Blip`] Remove redundant shift right (#23153) · 3cb93090
Younes Belkada authored May 19, 2023
```
* remove redundant shit right

* fix failing tests

* this time fix tests
```
3cb93090

Fix: Change tensors to integers for torch.dynamo and torch.compile compatibility (#23475) · 847e5691

Dennis Loevlie authored May 19, 2023

* Fix: Change tensors to integers in torch.split() for torch.dynamo and torch.compile compatibility

* Applied the suggested fix to the utils/check_copies.py test

* Applied the suggested fix by changing the original function that gets copied

847e5691

Fix PretrainedConfig `min_length` docstring (#23471) · 389bdba6
joaoareis authored May 19, 2023

389bdba6

Fix parallel mode check (#23409) · b455ad0a

Zachary Mueller authored May 19, 2023

* Fix sagemaker/distributed state

* Fix correctly

* Bring back -1

* Bring back local rank for distributed check

* better version

* Cleanest option

b455ad0a