Commits · bc7ce1808f6c30df87fd9dff871a53ef510ccf77 · chenpangpang / transformers

14 Sep, 2023 3 commits

[Whisper] Fix word-level timestamps for audio < 30 seconds (#25607) · 95fe0f5d

Joshua Lochner authored Sep 14, 2023



* Fix word-level timestamps for audio < 30 seconds

* Fix code quality

* fix unit tests

* Fix unit tests

* Fix unit test

* temp: print out result

* temp: set max diff to None

* fix unit tests

* fix typo

* Fix typo
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Use generation config for `num_frames`

* fix docs

* Move `num_frames` to kwargs

* compute stride/attn_mask once

* mark test as slow

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>

95fe0f5d

[MusicGen] Add sampling rate to config (#26136) · 44a0490d

Sanchit Gandhi authored Sep 14, 2023



* [MusicGen] Add sampling rate to config

* remove tiny

* make property

* Update tests/pipelines/test_pipelines_text_to_audio.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

44a0490d

Overhaul Conversation class and prompt templating (#25323) · 866df66f

Matt authored Sep 14, 2023



* First commit while I figure this out

* make fixup

* Remove unused method

* Store prompt attrib

* Fix prompt argument for tests

* Make same changes in fast tokenizer

* Remove global prompts from fast tokenizer too

* stash commit

* stash commit

* Migrate PromptConfig to its True Final Location

* Replace Conversation entirely with the new class

* Import/dependency fixes

* Import/dependency fixes

* Change format for lots of default prompts

* More default prompt fixups

* Revert llama old methods so we can compare

* Fix some default configs

* Fix some default configs

* Fix misspelled kwarg

* Fixes for Blenderbot

* make fixup

* little rebase cleanup

* Add basic documentation

* Quick doc fix

* Truncate docstring for now

* Add handling for the case when messages is a single string

* Quick llama merges

* Update conversational pipeline and tests

* Add a couple of legacy properties for backward compatibility

* More legacy handling

* Add docstring for build_conversation_input_ids

* Restructure PromptConfig

* Let's start T E M P L A T I N G

* Refactor all default configs to use templates instead

* Revert changes to the special token properties since we don't need them anymore

* More class templates

* Make the sandbox even sandier

* Everything replaced with pure templating

* Remove docs for PromptConfig

* Add testing and optional requirement boilerplate

* Fix imports and make fixup

* Fix LLaMA tests and add Conversation docstring

* Finally get LLaMA working with the template system

* Finally get LLaMA working with the template system

* make fixup

* make fixup

* fmt-off for the long lists of test tokens

* Rename method to apply_chat_template for now

* Start on documentation

* Make chat_template a property that reads through to the default if it's not set

* Expand docs

* Expand chat templating doc some more

* trim/lstrip blocks by default and update doc

* Few doc tweaks

* rebase cleanup

* Clarify docstring

* rebase cleanup

* rebase cleanup

* make fixup

* Quick doc edit

* Reformat the standard template to match ChatML

* Re-add PEFT check

* Update docs/source/en/chat_templating.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Add apply_chat_template to the tokenizer doc

* make fixup

* Add doc links

* Fix chat links

* Fix chat links

* Explain system messages in the doc

* Add chat template test

* Proper save-loading for chat template attribute

* Add test skips for layout models

* Remove _build_conversation_input_ids, add default_chat_template to code_llama

* Make sure all LLaMA models are using the latest template

* Remove default_system_prompt block in code_llama because it has no default prompt

* Update ConversationPipeline preprocess

* Add correct #Copied from links to the default_chat_templates

* Remove unneeded type checking line

* Add a dummy mark_processsed method

* Reorganize Conversation to have **deprecated_kwargs

* Update chat_templating.md

* Quick fix to LLAMA tests

* Small doc tweaks

* Add proper docstrings and "copied from" statements to all default chat templates

* Merge use_default_system_prompt support for code_llama too

* Improve clarity around self.chat_template

* Docstring fix

* Fix blenderbot default template

* More doctest fix

* Break out some tokenizer kwargs

* Update doc to explain default templates

* Quick tweaks to tokenizer args

* Cleanups for tokenizer args

* Add note about cacheing

* Quick tweak to the chat-templating doc

* Update the LLaMA template with error checking and correct system message embedding

* make fixup

* make fixup

* add requires_jinja

* Cleanup to expected output formatting

* Add cacheing

* Fix typo in llama default template

* Update LLaMA tests

* Update documentation

* Improved legacy handling in the Conversation class

* Update Jinja template with proper error handling

* Quick bugfix

* Proper exception raising

* Change cacheing behaviour so it doesn't try to pickle an entire Jinja env

* make fixup

* rebase cleanup

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

866df66f

05 Sep, 2023 2 commits

[`CI`] Fix red CI and ERROR failed should show (#25995) · d0354e5e

Arthur authored Sep 05, 2023

* start with error too

* fix ?

* start with nit

* one more path

* use `job_name`

* mark pipeline test as slow

d0354e5e

[Wav2Vec2 Conformer] Fix inference float16 (#25985) · 8d518013
Sanchit Gandhi authored Sep 05, 2023
```
* [Wav2Vec2 Conformer] Fix inference float16

* fix test

* fix test more

* clean pipe test
```
8d518013

01 Sep, 2023 1 commit

[VITS] Add to TTA pipeline (#25906) · b439129e

Sanchit Gandhi authored Sep 01, 2023



* [VITS] Add to TTA pipeline

* Update tests/pipelines/test_pipelines_text_to_audio.py
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

* remove extra spaces

---------
Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>

b439129e

31 Aug, 2023 1 commit
- Save image_processor while saving pipeline (ImageSegmentationPipeline) (#25884) · 2be8a909
  raghavanone authored Aug 31, 2023
```
* Save image_processor while saving pipeline (ImageSegmentationPipeline)

* Fix black issues
```
  2be8a909
30 Aug, 2023 1 commit

Add Blip2 model in VQA pipeline (#25532) · 09dc9951

Juan Pizarro authored Aug 30, 2023

* Add Blip2 model in VQA pipeline

* use require_torch_gpu for test_large_model_pt_blip2

* use can_generate in vqa pipeline

* test Blip2ForConditionalGeneration using float16

* remove custom can_generate from Blip2ForConditionalGeneration

09dc9951

24 Aug, 2023 1 commit
- [ASR Pipe Test] Fix CTC timestamps error message (#25727) · 02188768
  Sanchit Gandhi authored Aug 24, 2023
  
  02188768
18 Aug, 2023 1 commit

[`Llama`] remove prompt and fix prefix finetuning (#25565) · bc3e20dc

Arthur authored Aug 18, 2023

* nit

* update

* make sure use_default_system_prompt is saved

* update checkpointing

* consistency

* use_default_system_prompt for test

bc3e20dc

17 Aug, 2023 1 commit

Add Text-To-Speech pipeline (#24952) · b8f69d0d

Yoach Lacombe authored Aug 17, 2023



* add AutoModelForTextToSpeech class

* add TTS pipeline and tessting

* add docstrings to text_to_speech pipeline

* fix torch dependency

* corrector 'processor is None' case in Pipeline

* correct repo id

* modify text-to-speech -> text-to-audio

* remove processor

* rename text_to_speech pipelines files to text_audio

* add textToWaveform and textToSpectrogram instead of textToAudio classes

* update TTS pipeline to the bare minimum

* update tests TTS pipeline

* make style and erase useless import torch in TTS pipeline tests

* modify how to check if generate or forward in TTS pipeline

* remove unnecessary extra new lines

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* refactor input_texts -> text_inputs

* correct docstrings of TTS.__call__

* correct the shape of generated waveform

* take care of Bark tokenizer special case

* correct run_pipeline_test TTS

* make style

* update TTS docstrings

* address Sylvain nit refactors

* make style

* refactor into one liners

* correct squeeze

* correct way to test if forward or generate

* Update output audio waveform shape

* make style

* correct import

* modify how the TTS pipeline test if a model can generate

* align shape output of TTS pipeline with consistent shape

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

b8f69d0d

16 Aug, 2023 1 commit

[ASR Pipeline] Fix init with timestamps (#25438) · 36f183eb

Sanchit Gandhi authored Aug 16, 2023

* [ASR Pipeline] Fix init

* refactor test

* change default kwarg setting

* only perform checks if we have to

* override init

* move pre/forward/post checks to sanitize

36f183eb

08 Aug, 2023 1 commit

[ASR Pipeline] Clarify return timestamps (#25344) · dedd1116

Sanchit Gandhi authored Aug 08, 2023

* [ASR Pipeline] Clarify return timestamps

* fix indentation

* fix ctc check

* fix ctc error message!

* fix test

* fix other test

* add new tests

* final comment

dedd1116

28 Jul, 2023 1 commit

🚨

Fix rescale ViVit Efficientnet (#25174) · 05cda5df

amyeroberts authored Jul 28, 2023

* Fix rescaling bug

* Add tests

* Update integration tests

* Fix up

* Update src/transformers/image_transforms.py

* Update test - new possible order in list

05cda5df

18 Jul, 2023 2 commits

[`Llama2`] Add support for Llama 2 (#24891) · 07360b6c

Arthur authored Jul 18, 2023



* add llama

* add other readmes

* update padding id in readme

* add link to paper

* fix paths and tokenizer

* more nits

* styling

* fit operation in 2 lines when possible

* nits

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add form

* update reademe

* update readme, we don't have a default pad token

* update test and tokenization

* LLaMA instead of Llama

* nits

* add expected text

* add greeedy output

* styling

* Update src/transformers/models/llama/modeling_llama.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* sequential device map

* skip relevant changes

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

07360b6c

Enable `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` (#24882) · 57da42ad
Yih-Dar authored Jul 18, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
57da42ad

17 Jul, 2023 1 commit
- Skip failing `ZeroShotAudioClassificationPipelineTests::test_small_model_pt` for now (#24867) · 870dfc15
  Yih-Dar authored Jul 17, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  870dfc15
13 Jul, 2023 1 commit

Llama/GPTNeoX: add RoPE scaling (#24653) · 34d94094

Joao Gante authored Jul 13, 2023



* add rope_scaling

* tmp commit

* add gptneox

* add tests

* GPTNeoX can now handle long inputs, so the pipeline test was wrong

* Update src/transformers/models/open_llama/configuration_open_llama.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove ntk

* remove redundant validation

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

34d94094

26 Jun, 2023 1 commit

[`pipeline`] Fix str device issue (#24396) · 914289ac

Younes Belkada authored Jun 26, 2023



* fix str device issue

* fixup

* adapt from suggestions

* forward contrib credits from suggestions

* better fix

* added backward compatibility for older PT versions

* final fixes

* oops

* Attempting something with less branching.

---------
Co-authored-by: amyeroberts <amyeroberts@users.noreply.github.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

914289ac

23 Jun, 2023 1 commit

Allow dict input for audio classification pipeline (#23445) · 8767958f

Sanchit Gandhi authored Jun 23, 2023



* Allow dict input for audio classification pipeline

* make style

* Empty commit to trigger CI

* Empty commit to trigger CI

* check for torchaudio

* add pip instructions
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>

* Update src/transformers/pipelines/audio_classification.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* asr -> audio class

* asr -> audio class

---------
Co-authored-by: Sylvain <sylvain.gugger@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

8767958f

22 Jun, 2023 1 commit
- Skip `test_conditional_generation_pt_pix2struct` in Past CI (torch < 1.11) (#24417) · 652ece07
  Yih-Dar authored Jun 22, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  652ece07
21 Jun, 2023 1 commit

add word-level timestamps to Whisper (#23205) · cd927a47

Matthijs Hollemans authored Jun 21, 2023

* let's go!

* initial implementation of token-level timestamps

* only return a single timestamp per token

* remove token probabilities

* fix return type

* fix doc comment

* strip special tokens

* rename

* revert to not stripping special tokens

* only support models that have alignment_heads

* add integration test

* consistently name it token-level timestamps

* small DTW tweak

* initial support for ASR pipeline

* fix pipeline doc comments

* resolve token timestamps in pipeline with chunking

* change warning when no final timestamp is found

* return word-level timestamps

* fixup

* fix bug that skipped final word in each chunk

* fix failing unit tests

* merge punctuations into the words

* also return word tokens

* also return token indices

* add (failing) unit test for combine_tokens_into_words

* make combine_tokens_into_words private

* restore OpenAI's punctuation rules

* add pipeline tests

* make requested changes

* PR review changes

* fix failing pipeline test

* small stuff from PR

* only return words and their timestamps, not segments

* move alignment_heads into generation config

* forgot to set alignment_heads in pipeline tests

* tiny comment fix

* grr

cd927a47

20 Jun, 2023 1 commit
- Update tiny models for pipeline testing. (#24364) · c23d131e
  Yih-Dar authored Jun 20, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c23d131e
14 Jun, 2023 1 commit
- Skip some `TQAPipelineTests` tests in past CI (#24267) · eac8dede
  Yih-Dar authored Jun 14, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  eac8dede
09 Jun, 2023 1 commit

Fix Pipeline CI OOM issue (#24124) · d0d16329

Yih-Dar authored Jun 09, 2023



* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d0d16329

22 May, 2023 1 commit

[image-to-text pipeline] Add conditional text support + GIT (#23362) · 2f424d79

NielsRogge authored May 22, 2023

* First draft

* Remove print statements

* Add conditional generation

* Add more tests

* Remove scripts

* Remove BLIP specific linkes

* Add support for pix2struct

* Add fast test

* Address comment

* Fix style

2f424d79

18 May, 2023 1 commit
- Fix (skip) a pipeline test for `RwkvModel` (#23444) · 5777c3cb
  Yih-Dar authored May 18, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5777c3cb
04 May, 2023 1 commit
- Generate: text generation pipeline no longer emits `max_length` warning when it is not set (#23139) · b369e507
  Joao Gante authored May 04, 2023
  
  b369e507
24 Apr, 2023 1 commit

Update tiny models and a few fixes (#22928) · 975159bb

Yih-Dar authored Apr 24, 2023



* run_check_tiny_models

* update summary

* update mixin

* update pipeline_model_mapping

* update pipeline_model_mapping

* Update for gpt_bigcode

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

975159bb

21 Apr, 2023 1 commit
- Fix `FillMaskPipelineTests` (#22894) · 1e1cb6f8
  Yih-Dar authored Apr 21, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  1e1cb6f8
20 Apr, 2023 1 commit

Add `automatic-mask-generation` pipeline for Segment Anything Model (SAM) (#22840) · f1430377

Arthur authored Apr 20, 2023



* cleanup

* updates

* more refactoring

* make style

* update inits

* support other inputs in base

* update based on review
Co-authored-by: Nicolas Patry <patry.nicolas@gmail.com>

* Update tests/pipelines/test_pipelines_automatic_mask_generation.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* update

* fixup

* TODO x and y to refactor, _h _w refactored here

* update docstring

* more nits

* style on these

* more doc fix

* rename variables

* update

* updates

* style

* update

* fix `_mask_to_rle_pytorch`

* styling

* fix ask to rle, wrong outputs

* add device arg

* update

* more updates, fix tets

* udpate

* update docstrings

* styling

* fixup

* add notebook on the docs

* update orginal sizes

* fix docstring

* updat condition on point_per-batch

* updates tests

* fix CI  test

* extend is required, append does not work!

* fixup

* fix CI tests

* whit pixels left

* address doc comments

* fix doc

* slow pipeline tests

* update auto init

* add revision

* make fixup

* update p!ipoeline tag when calling tests

* alphabeitcal order in inits

* fix copies

* last style nits

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* reformat docstring

* more reformat

* address most of the comments

* Update src/transformers/pipelines/mask_generation.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* final refactor

* Update src/transformers/models/sam/image_processing_sam.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fixup and fix slow tests

* revert

---------
Co-authored-by: Nicolas Patry <patry.nicolas@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f1430377

17 Apr, 2023 1 commit
- Don't use `LayoutLMv2` and `LayoutLMv3` in some pipeline tests (#22774) · 5269718c
  Yih-Dar authored Apr 17, 2023
```
* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  5269718c
04 Apr, 2023 1 commit

Soft error whisper. (#22475) · a515d0a7

Nicolas Patry authored Apr 04, 2023



* Soft error whisper.

* Fix format.

---------
Co-authored-by: Ubuntu <ubuntu@ip-172-31-34-94.taildb5d.ts.net>

a515d0a7

23 Mar, 2023 2 commits
- Really fix quality due to ruff release · 80e3b363
  Sylvain Gugger authored Mar 22, 2023
  
  80e3b363
- Fix quality due to ruff release · ef28df05
  Sylvain authored Mar 22, 2023
  
  ef28df05
22 Mar, 2023 1 commit

Chunkable token classification pipeline (#21771) · d62e7d88

Luc CAILLIAU authored Mar 22, 2023



* Chunkable classification pipeline 

The TokenClassificationPipeline is now able to process sequences longer than 512. No matter the framework, the model, the tokenizer. We just have to pass process_all=True and a stride number (optional). The behavior remains the same if you don't pass these optional parameters. For overlapping parts when using stride above 0, we consider only the max scores for each overlapped token in all chunks where the token is.

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* update with latest black format

* update black format

* Update token_classification.py

* Update token_classification.py

* format correction

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update comments

* Update src/transformers/pipelines/token_classification.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* Update token_classification.py

Correct spaces, remove process_all and keep only stride. If stride is provided, the pipeline is applied to the whole text.

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update chunk aggregation

Update the chunk aggregation strategy based on entities aggregation.

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

Remove unnecessary pop from outputs dict

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update token_classification.py

* Update src/transformers/pipelines/token_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add chunking tests

* correct formating

* correct formatting

* correct model id for test chunking

* update scores with nested simplify

* Update test_pipelines_token_classification.py

* Update test_pipelines_token_classification.py

* update model to a tiny one

* Update test_pipelines_token_classification.py

* Adding smaller test for chunking.

* Fixup

* Update token_classification.py

* Update src/transformers/pipelines/token_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines/token_classification.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d62e7d88

16 Mar, 2023 1 commit

(#22204) · 5110e574

Yih-Dar authored Mar 16, 2023



* py38 + torch 2

* increment cache versions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5110e574

15 Mar, 2023 1 commit
- Regression pipeline device (#22190) · 42ad693b
  Sylvain Gugger authored Mar 15, 2023
```
* Fix regression in pipeline when device=-1 is passed

* Add regression test
```
  42ad693b
09 Mar, 2023 1 commit

Remove set_access_token usage + fail tests if FutureWarning (#22051) · 923110b7

Lucain authored Mar 09, 2023



* Remove set_access_token usage + fail tests if FutureWarning

* do not fail on FutureWarning in CI

---------
Co-authored-by: testbot <lucainp@hf.co>

923110b7

08 Mar, 2023 1 commit
- Update `AudioClassificationPipelineTests::test_small_model_pt` for PT 2.0.0 (#22023) · dfe9a319
  Yih-Dar authored Mar 08, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  dfe9a319