Commits · fd6735102abcc560cb2b68523b3f5012da54a956 · chenpangpang / transformers

28 Jun, 2023 7 commits

Make PT/Flax tests could be run on GPU (#24557) · fd673510
Yih-Dar authored Jun 28, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
fd673510
[`InstructBlip`] Add instruct blip int8 test (#24555) · 33b5ef5c
Younes Belkada authored Jun 28, 2023
```
* add 8bit instructblip test

* update tests
```
33b5ef5c
[`gpt2-int8`] Add gpt2-xl int8 test (#24543) · 903b97d8
Younes Belkada authored Jun 28, 2023
```
add gpt2-xl test
```
903b97d8

Update `EncodecIntegrationTest` (#24553) · b0651655

Yih-Dar authored Jun 28, 2023



* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b0651655

⚠️ Time to say goodbye to py37 (#24091) · e84bf1f7
Yih-Dar authored Jun 28, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e84bf1f7

Add bitsandbytes support for gpt2 models (#24504) · 12240925

Dario Sučić authored Jun 28, 2023



* Add bitsandbytes support for gpt2 models

* Guard Conv1D import to pass tensorflow test

* Appease ruff linter

* Fix 4bit test and remove int8 test boilerplate

* Update tests/bnb/test_mixed_int8.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

12240925

Finishing tidying keys to ignore on load (#24535) · 89b6ee49
Sylvain Gugger authored Jun 27, 2023

89b6ee49

27 Jun, 2023 4 commits

Clean load keys (#24505) · 8e5d1619

Sylvain Gugger authored Jun 27, 2023

* Preliminary work on some models

* Fix test load missing and make sure nonpersistent buffers are tested

* Always ignore nonpersistent buffers if in state_dict

* Treat models

* More models

* Treat remaining models

* Fix quality

* Fix tests

* Remove draft

* This test is not needed anymore

* Fix copies

* Fix last test

* Newly added models

* Fix last tests

* Address review comments

8e5d1619

[`T5`] Add T5ForQuestionAnswering and MT5ForQuestionAnswering (#24481) · 06910f5a

Sebastian authored Jun 27, 2023



* Adding T5ForQuestionAnswering

* Changed weight initialization that results in better initial loss when fine-tuning

* Update to class variables

* Running make fixup

* Running make fix-copies

* Remove model_parallel

* Adding MT5ForQuestionAnswering

* Adding docs

* Fix wrong doc

* Update src/transformers/models/mt5/modeling_mt5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/models/t5/modeling_t5.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* File formatting

* Undoing change

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

06910f5a

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15

Generate: `group_beam_search` requires `diversity_penalty>0.0` (#24456) · 5f3efdf7
Joao Gante authored Jun 27, 2023
```
* add exception

* update docs
```
5f3efdf7

26 Jun, 2023 6 commits

Compute `dropout_probability` only in training mode (#24486) · 850cf4af

Yih-Dar authored Jun 26, 2023



* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

850cf4af

Add support for for loops in python interpreter (#24429) · 57579238
Sylvain Gugger authored Jun 26, 2023
```
Add support for for loops
```
57579238
Update `InstructBlipModelIntegrationTest` (#24490) · 3ca02223
Yih-Dar authored Jun 26, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3ca02223

[`pipeline`] Fix str device issue (#24396) · 914289ac

Younes Belkada authored Jun 26, 2023



* fix str device issue

* fixup

* adapt from suggestions

* forward contrib credits from suggestions

* better fix

* added backward compatibility for older PT versions

* final fixes

* oops

* Attempting something with less branching.

---------
Co-authored-by: amyeroberts <amyeroberts@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

914289ac

add missing alignment_heads to Whisper integration test (#24487) · 3b84d86b
Matthijs Hollemans authored Jun 26, 2023
```
add missing alignment heads
```
3b84d86b

Add InstructBLIP (#23460) · 868363ab

NielsRogge authored Jun 26, 2023



* Squash 88 commits

* Use markdown

* Remove mdx files due to bad rebase

* Fix modeling files due to bad rebase

* Fix style

* Update comment

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

868363ab

23 Jun, 2023 3 commits

Allow dict input for audio classification pipeline (#23445) · 8767958f

Sanchit Gandhi authored Jun 23, 2023



* Allow dict input for audio classification pipeline

* make style

* Empty commit to trigger CI

* Empty commit to trigger CI

* check for torchaudio

* add pip instructions
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>

* Update src/transformers/pipelines/audio_classification.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* asr -> audio class

* asr -> audio class

---------
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

8767958f

Fix some `TFWhisperModelIntegrationTests` (#24428) · 2898fd39

Yih-Dar authored Jun 23, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* fix

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2898fd39

Replace python random with torch.rand to enable dynamo.export (#24434) · a28325e2

Bowen Bao authored Jun 23, 2023

* Replace python random with torch.rand to enable dynamo.export

* revert changes to flax model code

* Remove unused random import

* Fix torch template

* Move torch.manual_seed(0) to right location

a28325e2

22 Jun, 2023 3 commits

Refactor hyperparameter search backends (#24384) · b6295b26

Alex Hall authored Jun 22, 2023

* Refactor hyperparameter search backends

* Simpler refactoring without abstract base class

* black

* review comments:
specify name in class
use methods instead of callable class attributes
name constant better

* review comments: safer bool checking, log multiple available backends

* test ALL_HYPERPARAMETER_SEARCH_BACKENDS vs HPSearchBackend in unit test, not module. format with black.

* copyright

b6295b26

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) · 3ce3385c
Younes Belkada authored Jun 22, 2023
```
Revert "Fix gradient checkpointing + fp16 autocast for most models (#24247)"

This reverts commit 285a4801.
```
3ce3385c
Skip `test_conditional_generation_pt_pix2struct` in Past CI (torch < 1.11) (#24417) · 652ece07
Yih-Dar authored Jun 22, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
652ece07

21 Jun, 2023 3 commits

add word-level timestamps to Whisper (#23205) · cd927a47

Matthijs Hollemans authored Jun 21, 2023

* let's go!

* initial implementation of token-level timestamps

* only return a single timestamp per token

* remove token probabilities

* fix return type

* fix doc comment

* strip special tokens

* rename

* revert to not stripping special tokens

* only support models that have alignment_heads

* add integration test

* consistently name it token-level timestamps

* small DTW tweak

* initial support for ASR pipeline

* fix pipeline doc comments

* resolve token timestamps in pipeline with chunking

* change warning when no final timestamp is found

* return word-level timestamps

* fixup

* fix bug that skipped final word in each chunk

* fix failing unit tests

* merge punctuations into the words

* also return word tokens

* also return token indices

* add (failing) unit test for combine_tokens_into_words

* make combine_tokens_into_words private

* restore OpenAI's punctuation rules

* add pipeline tests

* make requested changes

* PR review changes

* fix failing pipeline test

* small stuff from PR

* only return words and their timestamps, not segments

* move alignment_heads into generation config

* forgot to set alignment_heads in pipeline tests

* tiny comment fix

* grr

cd927a47

Fix gradient checkpointing + fp16 autocast for most models (#24247) · 285a4801

Younes Belkada authored Jun 21, 2023



* fix gc bug

* continue PoC on OPT

* fixes

* :exploding_head:

* fix tests

* remove pytest.mark

* fixup

* forward contrib credits from discussions

* forward contrib credits from discussions

* reverting changes on untouched files.

---------
Co-authored-by: zhaoqf123 <zhaoqf123@users.noreply.github.com>
Co-authored-by: 7eu7d7 <7eu7d7@users.noreply.github.com>

285a4801

Generate: add SequenceBiasLogitsProcessor (#24334) · 5f0801d1
Joao Gante authored Jun 21, 2023

5f0801d1

20 Jun, 2023 10 commits

Migrate doc files to Markdown. (#24376) · eb849f66

Sylvain Gugger authored Jun 20, 2023



* Rename index.mdx to index.md

* With saved modifs

* Address review comment

* Treat all files

* .mdx -> .md

* Remove special char

* Update utils/tests_fetcher.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

eb849f66

[Wav2Vec2 - MMS] Correct directly loading adapters weights (#24335) · b0513b01

Patrick von Platen authored Jun 20, 2023

* Correct direct lang loading

* correct more

* revert black

* Use tie weights instead=

* add tests

* add tests

* make style

b0513b01

[GPTNeoX] Nit in config (#24349) · e5c760d6

Arthur authored Jun 20, 2023

* add raise value error for attention size

* nits to fix test_config

* style

e5c760d6

Skip a tapas (tokenization) test in past CI (#24378) · 83dc5762
Yih-Dar authored Jun 20, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
83dc5762

Better test name and enable pipeline test for `pix2struct` (#24377) · 297d769d

Yih-Dar authored Jun 20, 2023



* best test name forever

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

297d769d

Add a check in `ImageToTextPipeline._forward` (#24373) · 0527c1c0

Yih-Dar authored Jun 20, 2023



* fix

* fix

* fix

* Update src/transformers/pipelines/image_to_text.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

0527c1c0

Rename test to be more accurate (#24374) · dc444991
Sylvain Gugger authored Jun 20, 2023

dc444991
[Whisper] Make tests faster (#24105) · 6c134444
Sanchit Gandhi authored Jun 20, 2023

6c134444
Update tiny models for pipeline testing. (#24364) · c23d131e
Yih-Dar authored Jun 20, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c23d131e

TensorFlow CI fixes (#24360) · 56efbf43

Matt authored Jun 20, 2023

* Fix saved_model_creation_extended

* Skip the BLIP model creation test for now

* Fix TF SAM test

* Fix longformer tests

* Fix Wav2Vec2

* Add a skip for XLNet

* make fixup

* make fix-copies

* Add comments

56efbf43

16 Jun, 2023 4 commits

Add test for proper TF input signatures (#24320) · 91389950

Matt authored Jun 16, 2023

* Add test for proper input signatures

* No more signature pruning

* Test the dummy inputs are valid too

* fine-tine -> fine-tune

* Fix indent in test_dataset_conversion

91389950

Tied weights load (#24310) · 096f2cf1

Sylvain Gugger authored Jun 16, 2023

* Use tied weight keys

* More

* Fix tied weight missing warning

* Only give info on unexpected keys with different classes

* Deal with empty archs

* Fix tests

* Refine test

096f2cf1

Big TF test cleanup (#24282) · 34037129

Matt authored Jun 16, 2023

* Fix one BLIP arg not being optional, remove misspelled arg

* Remove the lxmert test overrides and just use the base test_saved_model_creation

* saved_model_creation fixes and re-enabling tests across the board

* Remove unnecessary skip

* Stop caching sinusoidal embeddings in speech_to_text

* Fix transfo_xl compilation

* Fix transfo_xl compilation

* Fix the conditionals in xglm

* Set the save spec only when building

* Clarify comment

* Move comment correctly

* Correct embeddings generation for speech2text

* Mark RAG generation tests as @slow

* Remove redundant else:

* Add comment to clarify the save_spec line in build()

* Fix size tests for XGLM at last!

* make fixup

* Remove one band_part operation

* Mark test_keras_fit as @slow

34037129

Byebye pytorch 1.9 (#24080) · 896a58de

Yih-Dar authored Jun 16, 2023



byebye

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

896a58de