• Yoach Lacombe's avatar
    Add Text-To-Speech pipeline (#24952) · b8f69d0d
    Yoach Lacombe authored
    
    
    * add AutoModelForTextToSpeech class
    
    * add TTS pipeline and tessting
    
    * add docstrings to text_to_speech pipeline
    
    * fix torch dependency
    
    * corrector 'processor is None' case in Pipeline
    
    * correct repo id
    
    * modify text-to-speech -> text-to-audio
    
    * remove processor
    
    * rename text_to_speech pipelines files to text_audio
    
    * add textToWaveform and textToSpectrogram instead of textToAudio classes
    
    * update TTS pipeline to the bare minimum
    
    * update tests TTS pipeline
    
    * make style and erase useless import torch in TTS pipeline tests
    
    * modify how to check if generate or forward in TTS pipeline
    
    * remove unnecessary extra new lines
    
    * Apply suggestions from code review
    Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
    
    * refactor input_texts -> text_inputs
    
    * correct docstrings of TTS.__call__
    
    * correct the shape of generated waveform
    
    * take care of Bark tokenizer special case
    
    * correct run_pipeline_test TTS
    
    * make style
    
    * update TTS docstrings
    
    * address Sylvain nit refactors
    
    * make style
    
    * refactor into one liners
    
    * correct squeeze
    
    * correct way to test if forward or generate
    
    * Update output audio waveform shape
    
    * make style
    
    * correct import
    
    * modify how the TTS pipeline test if a model can generate
    
    * align shape output of TTS pipeline with consistent shape
    
    ---------
    Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
    b8f69d0d
update_metadata.py 14.1 KB