Commits · fdd81aea12f06e24ab5cf5ba3c7316df3ab1a779 · chenpangpang / transformers

"docs/source/vscode:/vscode.git/clone" did not exist on "ae710425d2d8edf4d197bf893b90ed0546054701"

04 Aug, 2023 7 commits

[Whisper] Better error message for outdated generation config (#25298) · fdd81aea
Sanchit Gandhi authored Aug 04, 2023

fdd81aea

Make `bark` could have tiny model (#25290) · ce6d153a

Yih-Dar authored Aug 04, 2023



* temp

* update

* update

* update

* small dim

* small dim

* small dim

* fix

* update

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ce6d153a

Document check copies (#25291) · f0fd73a2

Sylvain Gugger authored Aug 04, 2023

* Document check copies better and add tests

* Include header in check for copies

* Manual fixes

* Try autofix

* Fixes

* Clean tests

* Finalize doc

* Remove debug print

* More fixes

f0fd73a2

Deal with nested configs better in base class (#25237) · 29f04002

Sylvain Gugger authored Aug 04, 2023



* Deal better with nested configs

* Fixes

* More fixes

* Fix last test

* Clean up existing configs

* Remove hack in MPT Config

* Update src/transformers/configuration_utils.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Fix setting a nested config via dict in the kwargs

* Adapt common test

* Add test for nested config load with dict

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

29f04002

Add offline mode for agents (#25226) · aeb5a08a
Sylvain Gugger authored Aug 04, 2023
```
* Add offline mode for agents

* Disable second check too
```
aeb5a08a
Generate: get generation mode as an enum (#25292) · bff4313b
Joao Gante authored Aug 04, 2023

bff4313b

Move usage of deprecated logging.warn to logging.warning (#25310) · 67683095

Peter Law authored Aug 04, 2023

The former spelling is deprecated and has been discouraged for a
while. The latter spelling seems to be more common in this project
anyway, so this change ought to be safe.

Fixes https://github.com/huggingface/transformers/issues/25283

67683095

03 Aug, 2023 5 commits

[JAX] Bump min version (#25286) · 66c240f3
Sanchit Gandhi authored Aug 03, 2023
```
* [JAX] Bump min version

* make fixup
```
66c240f3

Add timeout parameter to load_image function (#25184) · d114a6b7

Roland Szabo authored Aug 03, 2023



* Add timeout parameter to load_image function.

* Remove line.

* Reformat code
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Add parameter to docs.

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

d114a6b7

add generate method to SpeechT5ForTextToSpeech (#25233) · 6d3f9c1e

Yoach Lacombe authored Aug 03, 2023



* add generate method to SpeechT5ForTextToSpeech

* update speecht5forTTS docstrings

* Remove defaults to None in generate docstrings
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

6d3f9c1e

Update InstructBLIP & Align values after rescale update (#25209) · 30409af6

amyeroberts authored Aug 03, 2023

* Update InstructBLIP values
Note: the tests are not independent. Running the test independentely produces different logits compared to running all the integration tests

* Update test values after rescale update

* Remove left over commented out code

* Revert to previous rescaling logic

* Update rescale tests

30409af6

Docs: Update list of `report_to` logging integrations in docstring (#25281) · 15082a9d

Tom Aarsen authored Aug 03, 2023

* Update list of logging integrations in docstring

Also update type hint

* Also add 'flyte' to report_to callback list

* Revert 'report_to' type hint update

Due to CLI breaking

15082a9d

02 Aug, 2023 8 commits

[MMS] Fix mms (#25267) · b28ebb26

Patrick von Platen authored Aug 02, 2023

* [MMS] Fix mms

* [MMS] Fix mms

* fix mms loading

* Apply suggestions from code review

* make style

* Update tests/models/wav2vec2/test_modeling_wav2vec2.py

b28ebb26

Fix return_dict_in_generate bug in InstructBlip generate function (#25246) · 1baeed5b

Euan Ong authored Aug 02, 2023

Fix bug in InstructBlip generate function

Previously, the postprocessing conducted on generated sequences in InstructBlip's generate function assumed these sequences were tensors (i.e. that `return_dict_in_generate == False`).

This commit checks whether the result of the call to the wrapped language model `generate()` is a tensor, and if not attempts to postprocess the sequence attribute of the returned results object.

1baeed5b

[DOCS] Add example and modified docs of EtaLogitsWarper (#25125) · eec0d84e

Ashish Thomas Chempolil authored Aug 02, 2023



* added example and modified docs for EtaLogitsWarper

* make style

* fixed styling issue on 544

* removed error info and added set_seed

* Update src/transformers/generation/logits_process.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/generation/logits_process.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* updated the results

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

eec0d84e

Fix some bugs for two stage training of deformable detr (#25045) · 8021c684

Yupeng Jia authored Aug 02, 2023



* Update modeling_deformable_detr.py

Fix bugs for two stage training

* Update modeling_deformable_detr.py

* Add test_two_stage_training to DeformableDetrModelTest

---------
Co-authored-by: yupeng.jia <yupeng.jia@momenta.ai>

8021c684

resolving zero3 init when using accelerate config with Trainer (#25227) · 904e7e0f
Sourab Mangrulkar authored Aug 02, 2023
```
* resolving zero3 init when using accelerate config with Trainer

* refactor

* fix

* fix import
```
904e7e0f

add pathname and line number to logging formatter in debug mode (#25203) · c6a8768d

YQ authored Aug 02, 2023

* add pathname and lineno to logging formatter in debug mode

* use TRANSFORMERS_VERBOSITY="detail" to print pathname and lineno

c6a8768d

fix get_keys_to_not_convert() to return correct modules for full precision inference (#25105) · 2230d149
YQ authored Aug 02, 2023
```
* add test for `get_keys_to_not_convert`

* add minimum patch to keep mpt lm_head from 8bit quantization

* add reivsion to
```
2230d149
Fix set of model parallel in the Trainer when no GPUs are available (#25239) · f6f567d0
Sylvain Gugger authored Aug 02, 2023

f6f567d0

01 Aug, 2023 5 commits

Move rescale dtype recasting to match torchvision ToTensor (#25229) · d27e4c18
amyeroberts authored Aug 01, 2023
```
Move dtype recasting to match torchvision ToTensor
```
d27e4c18

[`Detr`] Fix detr BatchNorm replacement issue (#25230) · 3170af71

Younes Belkada authored Aug 01, 2023



* fix detr weird issue

* Update src/transformers/models/conditional_detr/modeling_conditional_detr.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix copies

* fix copies

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3170af71

[`Docs`/`quantization`] Clearer explanation on how things works under the... · 972fdcc7

Younes Belkada authored Aug 01, 2023


[`Docs`/`quantization`] Clearer explanation on how things works under the hood. + remove outdated info (#25216)

* clearer explanation on how things works under the hood.

* Update docs/source/en/main_classes/quantization.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/main_classes/quantization.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add `load_in_4bit` in `from_pretrained`

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

972fdcc7

[`Pix2Struct`] Fix pix2struct cross attention (#25200) · 77c3973e
Younes Belkada authored Aug 01, 2023
```
* fix pix2struct cross attention

* fix torchscript slow test
```
77c3973e

make build_mpt_alibi_tensor a method of MptModel so that deepspeed co… (#25193) · 4033ea71

Wang, Yi authored Aug 01, 2023



make build_mpt_alibi_tensor a method of MptModel so that deepspeed could override it to make autoTP work
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

4033ea71

31 Jul, 2023 4 commits
- [`pipeline`] revisit device check for pipeline (#25207) · e0c50b27
  Younes Belkada authored Jul 31, 2023
```
* revisit device check for pipeline

* let's raise an error.
```
  e0c50b27
- [`PreTrainedModel`] Wrap `cuda` and `to` method correctly (#25206) · 59dcea3f
  Younes Belkada authored Jul 31, 2023
```
wrap `cuda` and `to` method correctly
```
  59dcea3f
- Better error message in `_prepare_output_docstrings` (#25202) · 67b85f24
  Yih-Dar authored Jul 31, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  67b85f24
- Musicgen: CFG is manually added (#25173) · 4a564490
  Joao Gante authored Jul 31, 2023
  
  4a564490
28 Jul, 2023 7 commits
- 🚨🚨🚨 Fix rescale ViVit Efficientnet (#25174) · 05cda5df
  amyeroberts authored Jul 28, 2023
```
* Fix rescaling bug

* Add tests

* Update integration tests

* Fix up

* Update src/transformers/image_transforms.py

* Update test - new possible order in list
```
  05cda5df
- [MusicGen] Fix integration tests (#25169) · 03f98f96
  Sanchit Gandhi authored Jul 28, 2023
```
* move to device

* update with cuda values

* fix fp16

* more rigorous
```
  03f98f96
- Fix beam search to sample at least 1 non eos token (#25103) (#25115) · c90e14fb
  Yoni Gottesman authored Jul 28, 2023
  
  c90e14fb
- added compiled model support for inference (#25124) · 3cbc560d
  Alexander Markov authored Jul 28, 2023
```
* added compiled model support for inference

* linter

* Fix tests

* linter

* linter

* remove inference mode from pipelines

* Linter

---------
Co-authored-by: amarkov <alexander@inworld.ai>
```
  3cbc560d
- Represent query_length in a different way to solve jit issue (#25164) · d23d2c27
  jiqing-feng authored Jul 28, 2023
```
Fix jit trace
```
  d23d2c27
- override .cuda() to check if model is already quantized (#25166) · 2a787201
  YQ authored Jul 28, 2023
  
  2a787201
- Fix `.push_to_hub` and cleanup `get_full_repo_name` usage (#25120) · 6232c380
  Lucain authored Jul 28, 2023
```
* Fix .push_to_hub and cleanup get_full_repo_name usage

* Do not rely on Python bool conversion magic

* request changes
```
  6232c380
27 Jul, 2023 4 commits

Add new model in doc table of content (#25148) · 400e76ef
Sylvain Gugger authored Jul 27, 2023

400e76ef

Add bloom flax (#25094) · e9310363

Sanchit Gandhi authored Jul 27, 2023



* First commit

* step 1 working

* add alibi

* placeholder for `scan`

* add matrix mult alibi

* beta scaling factor for bmm

* working v1 - simple forward pass

* move layer_number from attribute to arg in call

* partial functioning scan

* hacky working scan

* add more modifs

* add test

* update scan for new kwarg order

* fix position_ids problem

* fix bug in attention layer

* small fix

- do the alibi broadcasting only once

* prelim refactor

* finish refactor

* alibi shifting

* incorporate dropout_add to attention module

* make style

* make padding work again

* update

* remove bogus file

* up

* get generation to work

* clean code a bit

* added small tests

* adding albii test

* make CI tests pass:

- change init weight
- add correct tuple for output attention
- add scan test
- make CI tests work

* fix few nits

* fix nit onnx

* fix onnx nit

* add missing dtype args to nn.Modules

* remove debugging statements

* fix scan generate

* Update modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* fix small test issue + make style

* clean up

* Update tests/models/bloom/test_modeling_flax_bloom.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* fix function name

* small fix test

* forward contrib credits from PR17761

* Fix failing test

* fix small typo documentation

* fix non passing test

- remove device from build alibi

* refactor call

- refactor `FlaxBloomBlockCollection` module

* make style

* upcast to fp32

* cleaner way to upcast

* remove unused args

* remove layer number

* fix scan test

* make style

* fix i4 casting

* fix slow test

* Update src/transformers/models/bloom/modeling_flax_bloom.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove `layer_past`

* refactor a bit

* fix `scan` slow test

* remove useless import

* major changes

- remove unused code
- refactor a bit
- revert import `torch`

* major refactoring

- change build alibi

* remove scan

* fix tests

* make style

* clean-up alibi

* add integration tests

* up

* fix batch norm conversion

* style

* style

* update pt-fx cross tests

* update copyright

* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* per-weight check

* style

* line formats

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <haileyschoelkopf@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e9310363

More `token` things (#25146) · 0c790ddb

Yih-Dar authored Jul 27, 2023



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0c790ddb

Add offload support to Bark (#25037) · 0b92ae34

Yoach Lacombe authored Jul 27, 2023



* initial Bark offload proposal

* use hooks instead of manually offloading

* add test of bark offload to cpu feature

* Apply nit suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docstrings of offload
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove unecessary set_seed in Bark tests

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

0b92ae34