Commits · 38db04ece0870b2f7132c7f2e81b32d489fc7c59 · chenpangpang / transformers

27 Jun, 2023 12 commits

Find module name in an OS-agnostic fashion (#24526) · 38db04ec
Sylvain Gugger authored Jun 27, 2023
```
* Find module name in an OS-agnostic fashion

* address review comment
```
38db04ec
Update `huggingface_hub` commit sha (#24527) · 7d150d68
Yih-Dar authored Jun 27, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7d150d68
set model to training mode before accelerate.prepare (#24520) · 4e8929dc
Wang, Yi authored Jun 27, 2023

4e8929dc

[`T5`] Add T5ForQuestionAnswering and MT5ForQuestionAnswering (#24481) · 06910f5a

Sebastian authored Jun 27, 2023



* Adding T5ForQuestionAnswering

* Changed weight initialization that results in better initial loss when fine-tuning

* Update to class variables

* Running make fixup

* Running make fix-copies

* Remove model_parallel

* Adding MT5ForQuestionAnswering

* Adding docs

* Fix wrong doc

* Update src/transformers/models/mt5/modeling_mt5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/models/t5/modeling_t5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* File formatting

* Undoing change

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

06910f5a

Update hyperparameter_search.py (#24515) · bcf02ec7
Sourab Mangrulkar authored Jun 27, 2023
```
* Update hyperparameter_search.py

* resolve comments
```
bcf02ec7

use accelerate autocast in jit eval path, since mix precision logic is… (#24460) · 6fe8d198

Wang, Yi authored Jun 27, 2023



use accelerate autocast in jit eval path, since mix precision logic is in accelerator currently
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

6fe8d198

🌐

[i18n-KO] Translated `tflite.mdx` to Korean (#24435) · 0863436b

Hyeonseo Yun authored Jun 27, 2023



* docs: ko: tflite.mdx

* feat: nmt and manual edit `tflite.mdx`

* revised: resolve suggestions tflite.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* revised: resolve suggestions and new line tflite.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

0863436b

Fix poor past ci (#24485) · 4abd3ee4

Yih-Dar authored Jun 27, 2023



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4abd3ee4

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15

Generate: `min_tokens_to_keep` has to be `>= 1` (#24453) · ac19871c
Joao Gante authored Jun 27, 2023

ac19871c
Generate: `group_beam_search` requires `diversity_penalty>0.0` (#24456) · 5f3efdf7
Joao Gante authored Jun 27, 2023
```
* add exception

* update docs
```
5f3efdf7

🚨

Fix group beam search (#24407) · 43479ef9

hukuda222 authored Jun 27, 2023



* group_beam_search now works correctly

* add argument descriptions

* add a comment

* format

* make style

* change comment

* Update src/transformers/generation/beam_search.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

---------
Co-authored-by: shogo.fujita <shogo.fujita@legalontech.jp>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

43479ef9

26 Jun, 2023 15 commits
- Fix link in utils (#24501) · 68c92981
  Gema Parreño authored Jun 26, 2023
```
* fix link

* new link

---------
Co-authored-by: Gema <gema@mbp-de-gema-2.lan>
```
  68c92981
- Compute `dropout_probability` only in training mode (SpeechT5) (#24498) · 7b4e3b5b
  Yih-Dar authored Jun 26, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7b4e3b5b
- Fix 'local_rank' AttiributeError in Trainer class (#24297) · c9fd4985
  Tomoko Uchida authored Jun 27, 2023
```
fix attribute error
```
  c9fd4985
- Compute `dropout_probability` only in training mode (#24486) · 850cf4af
  Yih-Dar authored Jun 26, 2023
```
* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  850cf4af
- [`InstructBlip`] Add accelerate support for instructblip (#24488) · 9895670e
  Younes Belkada authored Jun 26, 2023
```
* add accelerate support for instructblip

* add `_keep_in_fp32_modules`

* dynamically adapt `_no_split_modules`

* better fix

* same logic for `_keep_in_fp32_modules`
```
  9895670e
- Add support for for loops in python interpreter (#24429) · 57579238
  Sylvain Gugger authored Jun 26, 2023
```
Add support for for loops
```
  57579238
- Update token_classification.md (#24484) · c2aa5e17
  condor-cp authored Jun 26, 2023
```
Add link to pytorch CrossEntropyLoss so that one understand why '-100' is ignore by the loss function.
```
  c2aa5e17
- Update `InstructBlipModelIntegrationTest` (#24490) · 3ca02223
  Yih-Dar authored Jun 26, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3ca02223
- deepspeed z1/z2 state dict fix (#24489) · 195a9e5b
  Sourab Mangrulkar authored Jun 26, 2023
```
* deepspeed z2/z1 state_dict bloating fix

* update

* version check
```
  195a9e5b
- when resume from peft checkpoint, the model should be trainable (#24463) · c8aff1d3
  Wang, Yi authored Jun 26, 2023
  
  c8aff1d3
- [`pipeline`] Fix str device issue (#24396) · 914289ac
  Younes Belkada authored Jun 26, 2023
```
* fix str device issue

* fixup

* adapt from suggestions

* forward contrib credits from suggestions

* better fix

* added backward compatibility for older PT versions

* final fixes

* oops

* Attempting something with less branching.

---------
Co-authored-by: amyeroberts <amyeroberts@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
```
  914289ac
- Update AlbertModel type annotation (#24450) · 892399c5
  amyeroberts authored Jun 26, 2023
```
Update type annotation
```
  892399c5
- Fix tpu_metrics_debug (#24452) · be2d9f2e
  Meghan Cowan authored Jun 26, 2023
```
fix for tpu metrics debugs string
```
  be2d9f2e
- add missing alignment_heads to Whisper integration test (#24487) · 3b84d86b
  Matthijs Hollemans authored Jun 26, 2023
```
add missing alignment heads
```
  3b84d86b
- Add InstructBLIP (#23460) · 868363ab
  NielsRogge authored Jun 26, 2023
```
* Squash 88 commits

* Use markdown

* Remove mdx files due to bad rebase

* Fix modeling files due to bad rebase

* Fix style

* Update comment

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  868363ab
23 Jun, 2023 11 commits

Improved keras imports (#24448) · 8e164c54

Matt authored Jun 23, 2023

* An end to accursed version-specific imports

* No more K.is_keras_tensor() either

* Update dependency tables

* Use a cleaner call context function getter

* Add a cap to <2.14

* Add cap to examples requirements too

8e164c54

Update `JukeboxConfig.from_pretrained` (#24443) · 1e9da2b0
Yih-Dar authored Jun 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
1e9da2b0

Allow dict input for audio classification pipeline (#23445) · 8767958f

Sanchit Gandhi authored Jun 23, 2023



* Allow dict input for audio classification pipeline

* make style

* Empty commit to trigger CI

* Empty commit to trigger CI

* check for torchaudio

* add pip instructions
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>

* Update src/transformers/pipelines/audio_classification.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* asr -> audio class

* asr -> audio class

---------
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

8767958f

fixes issue when saving fsdp via accelerate's FSDP plugin (#24446) · a6f37f88
Sourab Mangrulkar authored Jun 23, 2023

a6f37f88

Fix some `TFWhisperModelIntegrationTests` (#24428) · 2898fd39

Yih-Dar authored Jun 23, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2898fd39

Fix typo (#24440) · 5e9f6752
Moon Gi Cho authored Jun 23, 2023

5e9f6752

Replace python random with torch.rand to enable dynamo.export (#24434) · a28325e2

Bowen Bao authored Jun 23, 2023

* Replace python random with torch.rand to enable dynamo.export

* revert changes to flax model code

* Remove unused random import

* Fix torch template

* Move torch.manual_seed(0) to right location

a28325e2

fix the grad_acc issue at epoch boundaries (#24415) · c036c814

Sourab Mangrulkar authored Jun 23, 2023



* fix the grad_acc issue at epoch boundaries
Co-Authored-By: Zach Mueller <7831895+muellerzr@users.noreply.github.com>

* add contributors.

Co-authored-by: sumpster

* address comments

---------
Co-authored-by: Zach Mueller <7831895+muellerzr@users.noreply.github.com>

c036c814

[`Trainer`] Fix `.to` call on 4bit models (#24444) · 468aed39
Younes Belkada authored Jun 23, 2023
```
* fix `.to` call on 4bit models

* better check
```
468aed39

[AutoModel] Add AutoModelForTextEncoding (#24305) · ea91c2ad

Sanchit Gandhi authored Jun 23, 2023

* [AutoModel] Add AutoModelForTextEncoding

* add mt5

* add other models

* add to docs

* fix tf imports

* add tf to docs / init

* up

* fix inits

* add to dummy objects

ea91c2ad

[llama] Fix comments in weights converter (#24436) · feb83521
Weiming Zhao authored Jun 22, 2023
```
Explain the reason to clone tensor
```
feb83521

22 Jun, 2023 2 commits
- Save `site-packages` as cache in CircleCI job (#24424) · 2c977e4a
  Yih-Dar authored Jun 22, 2023
```
* fix

* fix

* Upgrade complete!

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  2c977e4a
- Clarify batch size displayed when using DataParallel (#24430) · 2834c17a
  Sylvain Gugger authored Jun 22, 2023
  
  2834c17a