• Connor Henderson's avatar
    feat: Whisper prompting (#22496) · 2acedf47
    Connor Henderson authored
    * initial working additions
    
    * clean and rename, add cond stripping initial prompt to decode
    
    * cleanup, edit create_initial_prompt_ids, add tests
    
    * repo consistency, flip order of conditional
    
    * fix error, move the processor fn to the tokenizer
    
    * repo consistency, update test ids to corresponding tokenizer
    
    * use convert_tokens_to_ids not get_vocab...
    
    * use actual conditional in generate
    
    * make sytle
    
    * initial address comments
    
    * initial working add new params to pipeline
    
    * first draft of sequential generation for condition_on_previous_text
    
    * add/update tests, make compatible with timestamps
    
    * make compatible with diff. input kwargs and max length
    
    * add None check
    
    * add temperature check
    
    * flip temp check operand
    
    * refocusing to prev pr scope
    
    * remove the params too
    
    * make style
    
    * edits, move max length incorporating prompt to whisper
    
    * address comments
    
    * remove asr pipeline prompt decoding, fix indexing
    
    * address comments (more tests, validate prompt)
    
    * un-comment out tests (from debug)
    
    * remove old comment
    
    * address comments
    
    * fix typo
    
    * remove timestamp token from test
    
    * make style
    
    * cleanup
    
    * copy method to fast tokenizer, set max_new_tokens for test
    
    * prompt_ids type just pt
    
    * address Amy's comments
    
    * make style
    2acedf47
test_modeling_whisper.py 72.6 KB