Commits · eec0d84e6a98f597508fc177ec3a961f923d4e5d · chenpangpang / transformers

"examples/vscode:/vscode.git/clone" did not exist on "b9bb417324c0d9013c505dc39c016ab9ca0e23c8"

02 Aug, 2023 8 commits

[DOCS] Add example and modified docs of EtaLogitsWarper (#25125) · eec0d84e

Ashish Thomas Chempolil authored Aug 02, 2023



* added example and modified docs for EtaLogitsWarper

* make style

* fixed styling issue on 544

* removed error info and added set_seed

* Update src/transformers/generation/logits_process.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/generation/logits_process.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* updated the results

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

eec0d84e

Fix some bugs for two stage training of deformable detr (#25045) · 8021c684

Yupeng Jia authored Aug 02, 2023



* Update modeling_deformable_detr.py

Fix bugs for two stage training

* Update modeling_deformable_detr.py

* Add test_two_stage_training to DeformableDetrModelTest

---------
Co-authored-by: yupeng.jia <yupeng.jia@momenta.ai>

8021c684

Update rescale tests - cast to float after rescaling to reflect #25229 (#25259) · 1b354097
amyeroberts authored Aug 02, 2023
```
Rescale tests - cast to float after rescaling to reflect #25229
```
1b354097
resolving zero3 init when using accelerate config with Trainer (#25227) · 904e7e0f
Sourab Mangrulkar authored Aug 02, 2023
```
* resolving zero3 init when using accelerate config with Trainer

* refactor

* fix

* fix import
```
904e7e0f

Add `token` arugment in example scripts (#25172) · 149cb0cc

Yih-Dar authored Aug 02, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

149cb0cc

add pathname and line number to logging formatter in debug mode (#25203) · c6a8768d

YQ authored Aug 02, 2023

* add pathname and lineno to logging formatter in debug mode

* use TRANSFORMERS_VERBOSITY="detail" to print pathname and lineno

c6a8768d

fix get_keys_to_not_convert() to return correct modules for full precision inference (#25105) · 2230d149
YQ authored Aug 02, 2023
```
* add test for `get_keys_to_not_convert`

* add minimum patch to keep mpt lm_head from 8bit quantization

* add reivsion to
```
2230d149
Fix set of model parallel in the Trainer when no GPUs are available (#25239) · f6f567d0
Sylvain Gugger authored Aug 02, 2023

f6f567d0

01 Aug, 2023 6 commits

Move rescale dtype recasting to match torchvision ToTensor (#25229) · d27e4c18
amyeroberts authored Aug 01, 2023
```
Move dtype recasting to match torchvision ToTensor
```
d27e4c18

[`Detr`] Fix detr BatchNorm replacement issue (#25230) · 3170af71

Younes Belkada authored Aug 01, 2023



* fix detr weird issue

* Update src/transformers/models/conditional_detr/modeling_conditional_detr.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix copies

* fix copies

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3170af71

[`MPT`] Add `require_bitsandbytes` on MPT integration tests (#25201) · 05ebb026
Younes Belkada authored Aug 01, 2023
```
* add  `require_bitsandbytes` on MPT integration tests

* add it on mpt as well
```
05ebb026

[`Docs`/`quantization`] Clearer explanation on how things works under the... · 972fdcc7

Younes Belkada authored Aug 01, 2023


[`Docs`/`quantization`] Clearer explanation on how things works under the hood. + remove outdated info (#25216)

* clearer explanation on how things works under the hood.

* Update docs/source/en/main_classes/quantization.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/main_classes/quantization.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add `load_in_4bit` in `from_pretrained`

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

972fdcc7

[`Pix2Struct`] Fix pix2struct cross attention (#25200) · 77c3973e
Younes Belkada authored Aug 01, 2023
```
* fix pix2struct cross attention

* fix torchscript slow test
```
77c3973e

make build_mpt_alibi_tensor a method of MptModel so that deepspeed co… (#25193) · 4033ea71

Wang, Yi authored Aug 01, 2023



make build_mpt_alibi_tensor a method of MptModel so that deepspeed could override it to make autoTP work
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

4033ea71

31 Jul, 2023 8 commits
- Fix docker image build failure (#25214) · 0fd8d2aa
  Yih-Dar authored Jul 31, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0fd8d2aa
- Update tiny model info. and pipeline testing (#25213) · 1b4f6199
  Yih-Dar authored Jul 31, 2023
```
* update tiny_model_summary.json

* update

* update

* update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  1b4f6199
- [`pipeline`] revisit device check for pipeline (#25207) · e0c50b27
  Younes Belkada authored Jul 31, 2023
```
* revisit device check for pipeline

* let's raise an error.
```
  e0c50b27
- [quantization.md] fix (#25190) · 52206066
  Stas Bekman authored Jul 31, 2023
```
Update quantization.md
```
  52206066
- Fix `all_model_classes` in `FlaxBloomGenerationTest` (#25211) · 9ca3aa01
  Yih-Dar authored Jul 31, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9ca3aa01
- [`PreTrainedModel`] Wrap `cuda` and `to` method correctly (#25206) · 59dcea3f
  Younes Belkada authored Jul 31, 2023
```
wrap `cuda` and `to` method correctly
```
  59dcea3f
- Better error message in `_prepare_output_docstrings` (#25202) · 67b85f24
  Yih-Dar authored Jul 31, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  67b85f24
- Musicgen: CFG is manually added (#25173) · 4a564490
  Joao Gante authored Jul 31, 2023
  
  4a564490
28 Jul, 2023 13 commits

🚨

Fix rescale ViVit Efficientnet (#25174) · 05cda5df

amyeroberts authored Jul 28, 2023

* Fix rescaling bug

* Add tests

* Update integration tests

* Fix up

* Update src/transformers/image_transforms.py

* Update test - new possible order in list

05cda5df

[MusicGen] Fix integration tests (#25169) · 03f98f96
Sanchit Gandhi authored Jul 28, 2023
```
* move to device

* update with cuda values

* fix fp16

* more rigorous
```
03f98f96
Fix beam search to sample at least 1 non eos token (#25103) (#25115) · c90e14fb
Yoni Gottesman authored Jul 28, 2023

c90e14fb

🌐

[i18n-KO] Translated `transformers_agents.md` to Korean (#24881) · 31f137c0

Sohyun Sim authored Jul 29, 2023



* docs: ko: transformers_agents.md

* docs: ko: transformers_agents.md

* feat: deepl draft

* fix: manual edits

* fix: resolve suggestions
Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com>
Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com>

---------
Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com>
Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com>

31f137c0

[`InstructBlip`] Fix instructblip slow test (#25171) · dd9d45b6
Younes Belkada authored Jul 28, 2023
```
* fix instruct blip slow test

* Update tests/models/instructblip/test_modeling_instructblip.py
```
dd9d45b6
[`Mpt`] Fix mpt slow test (#25170) · add0895d
Younes Belkada authored Jul 28, 2023
```
fix mpt slow test
```
add0895d

Update `use_auth_token` -> `token` in example scripts (#25167) · d53b8ad7

Yih-Dar authored Jul 28, 2023



* pytorch examples

* tensorflow examples

* flax examples

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d53b8ad7

added compiled model support for inference (#25124) · 3cbc560d

Alexander Markov authored Jul 28, 2023



* added compiled model support for inference

* linter

* Fix tests

* linter

* linter

* remove inference mode from pipelines

* Linter

---------
Co-authored-by: amarkov <alexander@inworld.ai>

3cbc560d

make run_generation more generic for other devices (#25133) · afa96fff

Alan Ji authored Jul 28, 2023



* make run_generation more generic for other devices

* use Accelerate to support any device type it supports.

* make style

* fix error usage of accelerator.prepare_model

* use `PartialState` to make sure everything is running on the right device

---------
Co-authored-by: statelesshz <jihuazhong1@huawei.com>

afa96fff

Represent query_length in a different way to solve jit issue (#25164) · d23d2c27
jiqing-feng authored Jul 28, 2023
```
Fix jit trace
```
d23d2c27
override .cuda() to check if model is already quantized (#25166) · 2a787201
YQ authored Jul 28, 2023

2a787201
Add test when downloading from gated repo (#25039) · c1dba111
Lucain authored Jul 28, 2023

c1dba111

Fix `.push_to_hub` and cleanup `get_full_repo_name` usage (#25120) · 6232c380

Lucain authored Jul 28, 2023

* Fix .push_to_hub and cleanup get_full_repo_name usage

* Do not rely on Python bool conversion magic

* request changes

6232c380

27 Jul, 2023 5 commits

Add new model in doc table of content (#25148) · 400e76ef
Sylvain Gugger authored Jul 27, 2023

400e76ef

Add bloom flax (#25094) · e9310363

Sanchit Gandhi authored Jul 27, 2023



* First commit

* step 1 working

* add alibi

* placeholder for `scan`

* add matrix mult alibi

* beta scaling factor for bmm

* working v1 - simple forward pass

* move layer_number from attribute to arg in call

* partial functioning scan

* hacky working scan

* add more modifs

* add test

* update scan for new kwarg order

* fix position_ids problem

* fix bug in attention layer

* small fix

- do the alibi broadcasting only once

* prelim refactor

* finish refactor

* alibi shifting

* incorporate dropout_add to attention module

* make style

* make padding work again

* update

* remove bogus file

* up

* get generation to work

* clean code a bit

* added small tests

* adding albii test

* make CI tests pass:

- change init weight
- add correct tuple for output attention
- add scan test
- make CI tests work

* fix few nits

* fix nit onnx

* fix onnx nit

* add missing dtype args to nn.Modules

* remove debugging statements

* fix scan generate

* Update modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* fix small test issue + make style

* clean up

* Update tests/models/bloom/test_modeling_flax_bloom.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* fix function name

* small fix test

* forward contrib credits from PR17761

* Fix failing test

* fix small typo documentation

* fix non passing test

- remove device from build alibi

* refactor call

- refactor `FlaxBloomBlockCollection` module

* make style

* upcast to fp32

* cleaner way to upcast

* remove unused args

* remove layer number

* fix scan test

* make style

* fix i4 casting

* fix slow test

* Update src/transformers/models/bloom/modeling_flax_bloom.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove `layer_past`

* refactor a bit

* fix `scan` slow test

* remove useless import

* major changes

- remove unused code
- refactor a bit
- revert import `torch`

* major refactoring

- change build alibi

* remove scan

* fix tests

* make style

* clean-up alibi

* add integration tests

* up

* fix batch norm conversion

* style

* style

* update pt-fx cross tests

* update copyright

* Update src/transformers/modeling_flax_pytorch_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* per-weight check

* style

* line formats

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <haileyschoelkopf@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e9310363

More `token` things (#25146) · 0c790ddb

Yih-Dar authored Jul 27, 2023



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

0c790ddb

Add offload support to Bark (#25037) · 0b92ae34

Yoach Lacombe authored Jul 27, 2023



* initial Bark offload proposal

* use hooks instead of manually offloading

* add test of bark offload to cpu feature

* Apply nit suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docstrings of offload
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove unecessary set_seed in Bark tests

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

0b92ae34

[`MptConfig`] support from pretrained args (#25116) · 9cea3e7b

Arthur authored Jul 27, 2023



* support from pretrained args

* draft addition of tests

* update test

* use parrent assert true

* Update src/transformers/models/mpt/configuration_mpt.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

9cea3e7b