Commits · fdd81aea12f06e24ab5cf5ba3c7316df3ab1a779 · chenpangpang / transformers

04 Aug, 2023 1 commit
- [Whisper] Better error message for outdated generation config (#25298) · fdd81aea
  Sanchit Gandhi authored Aug 04, 2023
  
  fdd81aea
13 Jul, 2023 1 commit
- Removing unnecessary `device=device` in modeling_llama.py (#24696) · 1f6f32c2
  Liyang90 authored Jul 13, 2023
```
* Update modeling_llama.py

Removing unnecessary `device=device`

* fix in all occurrences of _make_causal_mask
```
  1f6f32c2
11 Jul, 2023 1 commit
- Docs: add `kwargs` type to fix formatting (#24733) · 2642d8d0
  Joao Gante authored Jul 11, 2023
  
  2642d8d0
07 Jul, 2023 1 commit
- Whisper: fix prompted max length (#24666) · f614b6e3
  Joao Gante authored Jul 07, 2023
  
  f614b6e3
29 Jun, 2023 1 commit
- Fix annotations (#24571) · 1fd52e6e
  MS Kim(tony9402) authored Jun 29, 2023
```
* fix annotations

* fix copies
```
  1fd52e6e
27 Jun, 2023 1 commit

Sylvain Gugger authored Jun 27, 2023

* Preliminary work on some models

* Fix test load missing and make sure nonpersistent buffers are tested

* Always ignore nonpersistent buffers if in state_dict

* Treat models

* More models

* Treat remaining models

* Fix quality

* Fix tests

* Remove draft

* This test is not needed anymore

* Fix copies

* Fix last test

* Newly added models

* Fix last tests

* Address review comments

8e5d1619

26 Jun, 2023 1 commit

Compute `dropout_probability` only in training mode (#24486) · 850cf4af

Yih-Dar authored Jun 26, 2023



* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

850cf4af

23 Jun, 2023 1 commit

Replace python random with torch.rand to enable dynamo.export (#24434) · a28325e2

Bowen Bao authored Jun 23, 2023

* Replace python random with torch.rand to enable dynamo.export

* revert changes to flax model code

* Remove unused random import

* Fix torch template

* Move torch.manual_seed(0) to right location

a28325e2

22 Jun, 2023 1 commit
- Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) · 3ce3385c
  Younes Belkada authored Jun 22, 2023
```
Revert "Fix gradient checkpointing + fp16 autocast for most models (#24247)"

This reverts commit 285a4801.
```
  3ce3385c
21 Jun, 2023 2 commits

add word-level timestamps to Whisper (#23205) · cd927a47

Matthijs Hollemans authored Jun 21, 2023

* let's go!

* initial implementation of token-level timestamps

* only return a single timestamp per token

* remove token probabilities

* fix return type

* fix doc comment

* strip special tokens

* rename

* revert to not stripping special tokens

* only support models that have alignment_heads

* add integration test

* consistently name it token-level timestamps

* small DTW tweak

* initial support for ASR pipeline

* fix pipeline doc comments

* resolve token timestamps in pipeline with chunking

* change warning when no final timestamp is found

* return word-level timestamps

* fixup

* fix bug that skipped final word in each chunk

* fix failing unit tests

* merge punctuations into the words

* also return word tokens

* also return token indices

* add (failing) unit test for combine_tokens_into_words

* make combine_tokens_into_words private

* restore OpenAI's punctuation rules

* add pipeline tests

* make requested changes

* PR review changes

* fix failing pipeline test

* small stuff from PR

* only return words and their timestamps, not segments

* move alignment_heads into generation config

* forgot to set alignment_heads in pipeline tests

* tiny comment fix

* grr

cd927a47

Fix gradient checkpointing + fp16 autocast for most models (#24247) · 285a4801

Younes Belkada authored Jun 21, 2023



* fix gc bug

* continue PoC on OPT

* fixes

* :exploding_head:

* fix tests

* remove pytest.mark

* fixup

* forward contrib credits from discussions

* forward contrib credits from discussions

* reverting changes on untouched files.

---------
Co-authored-by: zhaoqf123 <zhaoqf123@users.noreply.github.com>
Co-authored-by: 7eu7d7 <7eu7d7@users.noreply.github.com>

285a4801

13 Jun, 2023 1 commit

Tied params cleanup (#24211) · 695928e1

Sylvain Gugger authored Jun 13, 2023

* First test

* Add info for all models

* style

* Repo consistency

* Fix last model and cleanup prints

* Repo consistency

* Use consistent function for detecting tied weights

695928e1

12 Jun, 2023 1 commit
- Update `WhisperForAudioClassification` doc example (#24188) · 0675600a
  Yih-Dar authored Jun 12, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  0675600a
08 Jun, 2023 1 commit
- Fix a tiny typo in `WhisperForConditionalGeneration::generate` docstring (#24045) · 5fa0a1b2
  Sadra Barikbin authored Jun 08, 2023
  
  5fa0a1b2
24 May, 2023 1 commit
- fix: Whisper generate, move text_prompt_ids trim up for max_new_tokens calculation (#23724) · 3d7baef1
  Connor Henderson authored May 24, 2023
```
move text_prompt_ids trimming to top
```
  3d7baef1
19 May, 2023 1 commit

feat: Whisper prompting (#22496) · 2acedf47

Connor Henderson authored May 19, 2023

* initial working additions

* clean and rename, add cond stripping initial prompt to decode

* cleanup, edit create_initial_prompt_ids, add tests

* repo consistency, flip order of conditional

* fix error, move the processor fn to the tokenizer

* repo consistency, update test ids to corresponding tokenizer

* use convert_tokens_to_ids not get_vocab...

* use actual conditional in generate

* make sytle

* initial address comments

* initial working add new params to pipeline

* first draft of sequential generation for condition_on_previous_text

* add/update tests, make compatible with timestamps

* make compatible with diff. input kwargs and max length

* add None check

* add temperature check

* flip temp check operand

* refocusing to prev pr scope

* remove the params too

* make style

* edits, move max length incorporating prompt to whisper

* address comments

* remove asr pipeline prompt decoding, fix indexing

* address comments (more tests, validate prompt)

* un-comment out tests (from debug)

* remove old comment

* address comments

* fix typo

* remove timestamp token from test

* make style

* cleanup

* copy method to fast tokenizer, set max_new_tokens for test

* prompt_ids type just pt

* address Amy's comments

* make style

2acedf47

05 May, 2023 2 commits
- fix: Passing language as acronym to Whisper generate (#23141) · 17083b9b
  Connor Henderson authored May 05, 2023
```
* add fix

* address comments

* remove error formatting
```
  17083b9b
- fixed whisper positional encoding (#23167) · 77412343
  Andrei Filatov authored May 05, 2023
  
  77412343
14 Apr, 2023 1 commit
- Move labels to the same device as logits for Whisper (#22779) · fb3aa06c
  oscar-garzon authored Apr 14, 2023
  
  fb3aa06c
04 Apr, 2023 1 commit
- fix `_no_split_modules` for Whisper model (#22486) · 48fbd8fa
  Sourab Mangrulkar authored Apr 04, 2023
  
  48fbd8fa
28 Mar, 2023 1 commit

[performance] ensure `causal_mask` is created directly on device (#22378) · ae5fc2db

Jeff Rasley authored Mar 28, 2023

* ensure causal_mask is created directly on device

* add copy tag to opt, update bart implementation

* add device to all _make_causal_mask copies

* formatting fixes

* more manual fixes due to unlinked versions of _prepare_decoder_attention_mask

ae5fc2db

14 Mar, 2023 1 commit

[

🛠

️] Fix-whisper-breaking-changes (#21965) · 2beabd24

Arthur authored Mar 14, 2023



* temp fix

* temporary fix

* update

* fix tests

* fixup

* update based on reveiew
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* update to fix tests

* update docstring

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

2beabd24

13 Mar, 2023 1 commit

[`Whiper`] add `get_input_embeddings` to `WhisperForAudioClassification` (#22133) · d979cf6e

Younes Belkada authored Mar 13, 2023



* add `get_input_embeddings` to `WhisperForAudioClassification`

* add common tests

* fix another common test

* Update tests/models/whisper/test_modeling_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

d979cf6e

11 Mar, 2023 1 commit
- [Whisper] Remove embed_tokens from encoder docstring (#21996) · b90fbc7e
  Sanchit Gandhi authored Mar 11, 2023
```
* [Whisper] Remove embed_tokens from encoder docstring

* new line to retrigger CI

* remove new line
```
  b90fbc7e
08 Mar, 2023 1 commit

fixes the gradient checkpointing of whisper (#22019) · 99839506

Somasree Majumder authored Mar 09, 2023



* fixing

* Update modeling_whisper.py

* Update modeling_whisper.py

* Update src/transformers/models/whisper/modeling_whisper.py

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

99839506

07 Mar, 2023 1 commit

[Whisper] Add model for audio classification (#21754) · 7c393181

Sanchit Gandhi authored Mar 07, 2023

* [Whisper] Add model for audio classification

* make fix-copies

* add to docs

* add docstring

* empty returns

* add code example

* switch to fleurs

* stick everything on one line

7c393181

02 Mar, 2023 1 commit
- fix typo in Bart's attention (#21898) · 648d0deb
  Kashif Rasul authored Mar 02, 2023
  
  648d0deb
01 Mar, 2023 1 commit

Change the way tensor is reshaped in BartAttention (from .view to .reshape) (#21860) · ebd52589

raghavanone authored Mar 01, 2023

* Change the .view call to .reshape

* Change the .view call to .reshape to all the copies from bart attention

* Fix copies and style

* Fix copies and style

* Fix copies and style

* Fix copies and style

* Fix copies and style

* Revert unneccessary changes

* Revert unneccessary changes

* Revert unneccessary changes

* Revert unneccessary changes

ebd52589

24 Feb, 2023 1 commit

[Whisper] Add SpecAugment (#21298) · c8545d2a

bofeng huang authored Feb 24, 2023



* Return and rescale attention_mask

* Add SpecAugment to Whisper modeling

* Fix test

* Update docstring

* Add SpecAug related parameters to model config

* Add the _mask_input_features function to doc

* Fix quality

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove dev comments

* Add test

* Resolve conflict

* feat: mask {feature, time} prob fast tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c8545d2a

16 Feb, 2023 2 commits
- [WhisperModel] fix bug in reshaping labels (#21653) · 96d4fa46
  Jonatas Grosman authored Feb 16, 2023
```
fix bug in reshaping labels
```
  96d4fa46
- Update document of WhisperDecoderLayer (#21621) · 212c42a1
  Xiaoyang Chen authored Feb 16, 2023
```
* Update document of WhisperDecoderLayer

* Update modeling_mbart.py

* Update doc with utils/check_copies.py --fix_and_overwrite

* Update modeling_xlm_prophetnet.py
```
  212c42a1
07 Feb, 2023 1 commit

[CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b

Arthur authored Feb 07, 2023

* fix past renamed to past_key_value

* update more `past`that were ski^êd

* fixup

* remove changes made to rag

* refactor `_reorder_cache` to use `past_key_values`

* fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache

12eb528b

06 Feb, 2023 1 commit

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

25 Jan, 2023 1 commit

[Whisper] Refactor whisper (#21252) · 255257f3

Arthur authored Jan 25, 2023

* update whisper logit processor

* add generate for whisper

* remove part of the whisper specific code from pipeline

* update logit processes

* major update

* enforce first timestamp

* update generate

* add more tests

* update new decoding strategy

* Apply suggestions from code review

* update docstring

* fixup

* default config will not have multilingual ar

* update expected tokenizer size, see pull on the hub for whisper-tiny

255257f3

23 Jan, 2023 1 commit

Models docstring (#21225) · fd5cdaee

Sylvain Gugger authored Jan 23, 2023

* Clean all models

* Style

* Last to remove

* address review comments

* Address review comments

fd5cdaee

08 Jan, 2023 1 commit

Replace `past` with `past_key_values` (#20944) · f0577df6

Arthur authored Jan 08, 2023

* start cleanup

* more updates

* more models are affected

* more updates

* update generation utils

* style

* revert change that removed reorder cachce

* update generation utils

* style

* style

* remove reorder cache

f0577df6

20 Dec, 2022 1 commit
- [S2T, Whisper] Add copied from statements (#20787) · bd1a43b6
  Sanchit Gandhi authored Dec 20, 2022
```
* [S2T, Whisper] Add copied from statements

* rebase and fix-copies
```
  bd1a43b6
06 Dec, 2022 1 commit
- updating T5 and BART models to support Prefix Tuning (#20601) · 97a51b0c
  Sourab Mangrulkar authored Dec 06, 2022
```
* updating T5 and BART models to support Prefix Tuning

* `make fix-copies`

* address comments

* address comments
```
  97a51b0c
05 Dec, 2022 1 commit

Fix whisper and speech to text doc (#20595) · 9763f829

Arthur authored Dec 05, 2022

* Fix whisper and speech to text doc
# What does this PR do?
Previously the documentation was badly indented for both models and indicated that
> If `decoder_input_ids` and `decoder_inputs_embeds` are both unset, `decoder_inputs_embeds` takes the value of `inputs_embeds`.`
Which is on valid for the forward pass of the `ForConditionnalGeneration` not for the model alone.

* other fixes

9763f829

30 Nov, 2022 1 commit

remove `attention_mask` truncation in whisper (#20488) · d0c1ded5

Yih-Dar authored Nov 30, 2022



* remove truncation

* For TFWhisper
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d0c1ded5