Commits · 68c92981ff2b804979d2e6107eeefe298d1e5183 · chenpangpang / transformers

26 Jun, 2023 15 commits
- Fix link in utils (#24501) · 68c92981
  Gema Parreño authored Jun 26, 2023
```
* fix link

* new link

---------
Co-authored-by: Gema <gema@mbp-de-gema-2.lan>
```
  68c92981
- Compute `dropout_probability` only in training mode (SpeechT5) (#24498) · 7b4e3b5b
  Yih-Dar authored Jun 26, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7b4e3b5b
- Fix 'local_rank' AttiributeError in Trainer class (#24297) · c9fd4985
  Tomoko Uchida authored Jun 27, 2023
```
fix attribute error
```
  c9fd4985
- Compute `dropout_probability` only in training mode (#24486) · 850cf4af
  Yih-Dar authored Jun 26, 2023
```
* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  850cf4af
- [`InstructBlip`] Add accelerate support for instructblip (#24488) · 9895670e
  Younes Belkada authored Jun 26, 2023
```
* add accelerate support for instructblip

* add `_keep_in_fp32_modules`

* dynamically adapt `_no_split_modules`

* better fix

* same logic for `_keep_in_fp32_modules`
```
  9895670e
- Add support for for loops in python interpreter (#24429) · 57579238
  Sylvain Gugger authored Jun 26, 2023
```
Add support for for loops
```
  57579238
- Update token_classification.md (#24484) · c2aa5e17
  condor-cp authored Jun 26, 2023
```
Add link to pytorch CrossEntropyLoss so that one understand why '-100' is ignore by the loss function.
```
  c2aa5e17
- Update `InstructBlipModelIntegrationTest` (#24490) · 3ca02223
  Yih-Dar authored Jun 26, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3ca02223
- deepspeed z1/z2 state dict fix (#24489) · 195a9e5b
  Sourab Mangrulkar authored Jun 26, 2023
```
* deepspeed z2/z1 state_dict bloating fix

* update

* version check
```
  195a9e5b
- when resume from peft checkpoint, the model should be trainable (#24463) · c8aff1d3
  Wang, Yi authored Jun 26, 2023
  
  c8aff1d3
- [`pipeline`] Fix str device issue (#24396) · 914289ac
  Younes Belkada authored Jun 26, 2023
```
* fix str device issue

* fixup

* adapt from suggestions

* forward contrib credits from suggestions

* better fix

* added backward compatibility for older PT versions

* final fixes

* oops

* Attempting something with less branching.

---------
Co-authored-by: amyeroberts <amyeroberts@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
```
  914289ac
- Update AlbertModel type annotation (#24450) · 892399c5
  amyeroberts authored Jun 26, 2023
```
Update type annotation
```
  892399c5
- Fix tpu_metrics_debug (#24452) · be2d9f2e
  Meghan Cowan authored Jun 26, 2023
```
fix for tpu metrics debugs string
```
  be2d9f2e
- add missing alignment_heads to Whisper integration test (#24487) · 3b84d86b
  Matthijs Hollemans authored Jun 26, 2023
```
add missing alignment heads
```
  3b84d86b
- Add InstructBLIP (#23460) · 868363ab
  NielsRogge authored Jun 26, 2023
```
* Squash 88 commits

* Use markdown

* Remove mdx files due to bad rebase

* Fix modeling files due to bad rebase

* Fix style

* Update comment

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  868363ab
23 Jun, 2023 11 commits

Improved keras imports (#24448) · 8e164c54

Matt authored Jun 23, 2023

* An end to accursed version-specific imports

* No more K.is_keras_tensor() either

* Update dependency tables

* Use a cleaner call context function getter

* Add a cap to <2.14

* Add cap to examples requirements too

8e164c54

Update `JukeboxConfig.from_pretrained` (#24443) · 1e9da2b0
Yih-Dar authored Jun 23, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
1e9da2b0

Allow dict input for audio classification pipeline (#23445) · 8767958f

Sanchit Gandhi authored Jun 23, 2023



* Allow dict input for audio classification pipeline

* make style

* Empty commit to trigger CI

* Empty commit to trigger CI

* check for torchaudio

* add pip instructions
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>

* Update src/transformers/pipelines/audio_classification.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* asr -> audio class

* asr -> audio class

---------
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

8767958f

fixes issue when saving fsdp via accelerate's FSDP plugin (#24446) · a6f37f88
Sourab Mangrulkar authored Jun 23, 2023

a6f37f88

Fix some `TFWhisperModelIntegrationTests` (#24428) · 2898fd39

Yih-Dar authored Jun 23, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2898fd39

Fix typo (#24440) · 5e9f6752
Moon Gi Cho authored Jun 23, 2023

5e9f6752

Replace python random with torch.rand to enable dynamo.export (#24434) · a28325e2

Bowen Bao authored Jun 23, 2023

* Replace python random with torch.rand to enable dynamo.export

* revert changes to flax model code

* Remove unused random import

* Fix torch template

* Move torch.manual_seed(0) to right location

a28325e2

fix the grad_acc issue at epoch boundaries (#24415) · c036c814

Sourab Mangrulkar authored Jun 23, 2023



* fix the grad_acc issue at epoch boundaries
Co-Authored-By: Zach Mueller <7831895+muellerzr@users.noreply.github.com>

* add contributors.

Co-authored-by: sumpster

* address comments

---------
Co-authored-by: Zach Mueller <7831895+muellerzr@users.noreply.github.com>

c036c814

[`Trainer`] Fix `.to` call on 4bit models (#24444) · 468aed39
Younes Belkada authored Jun 23, 2023
```
* fix `.to` call on 4bit models

* better check
```
468aed39

[AutoModel] Add AutoModelForTextEncoding (#24305) · ea91c2ad

Sanchit Gandhi authored Jun 23, 2023

* [AutoModel] Add AutoModelForTextEncoding

* add mt5

* add other models

* add to docs

* fix tf imports

* add tf to docs / init

* up

* fix inits

* add to dummy objects

ea91c2ad

[llama] Fix comments in weights converter (#24436) · feb83521
Weiming Zhao authored Jun 22, 2023
```
Explain the reason to clone tensor
```
feb83521

22 Jun, 2023 11 commits

Save `site-packages` as cache in CircleCI job (#24424) · 2c977e4a

Yih-Dar authored Jun 22, 2023



* fix

* fix

* Upgrade complete!

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2c977e4a

Clarify batch size displayed when using DataParallel (#24430) · 2834c17a
Sylvain Gugger authored Jun 22, 2023

2834c17a

Refactor hyperparameter search backends (#24384) · b6295b26

Alex Hall authored Jun 22, 2023

* Refactor hyperparameter search backends

* Simpler refactoring without abstract base class

* black

* review comments:
specify name in class
use methods instead of callable class attributes
name constant better

* review comments: safer bool checking, log multiple available backends

* test ALL_HYPERPARAMETER_SEARCH_BACKENDS vs HPSearchBackend in unit test, not module. format with black.

* copyright

b6295b26

TF CI fix for Segformer (#24426) · a1c4b630
Matt authored Jun 22, 2023
```
Fix segformer so compilation can figure out the channel dim
```
a1c4b630

Update RayTune doc link for Hyperparameter tuning (#24422) · 754f61ca

Josh authored Jun 22, 2023

Update outdated hyperlink hpo_train.md

Link to RayTune search space API docs was outdated - have provided correct new link for docs.
Co-authored-by: Joshua Samuel <66880119+Joshsamuel101@users.noreply.github.com>

754f61ca

Fix `save_cache` version in `config.yml` (#24419) · 8f2ef52f
Yih-Dar authored Jun 22, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8f2ef52f
Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) · 3ce3385c
Younes Belkada authored Jun 22, 2023
```
Revert "Fix gradient checkpointing + fp16 autocast for most models (#24247)"

This reverts commit 285a4801.
```
3ce3385c
[`bnb`] Fix bnb serialization issue with new release (#24416) · ebb62e88
Younes Belkada authored Jun 22, 2023
```
* fix bnb issue

* fixup

* revert and do simple patching instead

* add more details
```
ebb62e88
Skip `test_conditional_generation_pt_pix2struct` in Past CI (torch < 1.11) (#24417) · 652ece07
Yih-Dar authored Jun 22, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
652ece07

TF safetensors reduced mem usage (#24404) · 22fe73c3

Matt authored Jun 22, 2023

* Slight comment cleanup

* Reduce peak mem usage when loading TF-format safetensor weights

* Tweak the PyTorch loading code to support lazy loading from safetensors

* Pass safe_open objects to the PyTorch loading function

* Do GPU transposes for speed

* One more tweak to reduce peak usage further

* One-line hasattr

* Fix bug when there's a shape mismatch

* Rename state_dict in the loading code to be clearer

* Use TF format everywhere for consistency

22fe73c3

[ASR pipeline] Check for torchaudio (#23953) · 7e03e469

Sanchit Gandhi authored Jun 22, 2023



* [ASR pipeline] Check for torchaudio

* add pip instructions
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

---------
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

7e03e469

21 Jun, 2023 3 commits

Explicit arguments in `from_pretrained` (#24306) · 6ce6d62b

Yih-Dar authored Jun 21, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

6ce6d62b

Remove redundant code from TrainingArgs (#24401) · 127e81c2
Zach Mueller authored Jun 21, 2023
```
Remove redundant code
```
127e81c2

add word-level timestamps to Whisper (#23205) · cd927a47

Matthijs Hollemans authored Jun 21, 2023

* let's go!

* initial implementation of token-level timestamps

* only return a single timestamp per token

* remove token probabilities

* fix return type

* fix doc comment

* strip special tokens

* rename

* revert to not stripping special tokens

* only support models that have alignment_heads

* add integration test

* consistently name it token-level timestamps

* small DTW tweak

* initial support for ASR pipeline

* fix pipeline doc comments

* resolve token timestamps in pipeline with chunking

* change warning when no final timestamp is found

* return word-level timestamps

* fixup

* fix bug that skipped final word in each chunk

* fix failing unit tests

* merge punctuations into the words

* also return word tokens

* also return token indices

* add (failing) unit test for combine_tokens_into_words

* make combine_tokens_into_words private

* restore OpenAI's punctuation rules

* add pipeline tests

* make requested changes

* PR review changes

* fix failing pipeline test

* small stuff from PR

* only return words and their timestamps, not segments

* move alignment_heads into generation config

* forgot to set alignment_heads in pipeline tests

* tiny comment fix

* grr

cd927a47