1. 28 Jun, 2023 7 commits
  2. 27 Jun, 2023 4 commits
  3. 26 Jun, 2023 6 commits
  4. 23 Jun, 2023 3 commits
  5. 22 Jun, 2023 3 commits
  6. 21 Jun, 2023 3 commits
    • Matthijs Hollemans's avatar
      add word-level timestamps to Whisper (#23205) · cd927a47
      Matthijs Hollemans authored
      * let's go!
      
      * initial implementation of token-level timestamps
      
      * only return a single timestamp per token
      
      * remove token probabilities
      
      * fix return type
      
      * fix doc comment
      
      * strip special tokens
      
      * rename
      
      * revert to not stripping special tokens
      
      * only support models that have alignment_heads
      
      * add integration test
      
      * consistently name it token-level timestamps
      
      * small DTW tweak
      
      * initial support for ASR pipeline
      
      * fix pipeline doc comments
      
      * resolve token timestamps in pipeline with chunking
      
      * change warning when no final timestamp is found
      
      * return word-level timestamps
      
      * fixup
      
      * fix bug that skipped final word in each chunk
      
      * fix failing unit tests
      
      * merge punctuations into the words
      
      * also return word tokens
      
      * also return token indices
      
      * add (failing) unit test for combine_tokens_into_words
      
      * make combine_tokens_into_words private
      
      * restore OpenAI's punctuation rules
      
      * add pipeline tests
      
      * make requested changes
      
      * PR review changes
      
      * fix failing pipeline test
      
      * small stuff from PR
      
      * only return words and their timestamps, not segments
      
      * move alignment_heads into generation config
      
      * forgot to set alignment_heads in pipeline tests
      
      * tiny comment fix
      
      * grr
      cd927a47
    • Younes Belkada's avatar
      Fix gradient checkpointing + fp16 autocast for most models (#24247) · 285a4801
      Younes Belkada authored
      
      
      * fix gc bug
      
      * continue PoC on OPT
      
      * fixes
      
      * :exploding_head:
      
      * fix tests
      
      * remove pytest.mark
      
      * fixup
      
      * forward contrib credits from discussions
      
      * forward contrib credits from discussions
      
      * reverting changes on untouched files.
      
      ---------
      Co-authored-by: default avatarzhaoqf123 <zhaoqf123@users.noreply.github.com>
      Co-authored-by: default avatar7eu7d7 <7eu7d7@users.noreply.github.com>
      285a4801
    • Joao Gante's avatar
      5f0801d1
  7. 20 Jun, 2023 10 commits
  8. 16 Jun, 2023 4 commits
    • Matt's avatar
      Add test for proper TF input signatures (#24320) · 91389950
      Matt authored
      * Add test for proper input signatures
      
      * No more signature pruning
      
      * Test the dummy inputs are valid too
      
      * fine-tine -> fine-tune
      
      * Fix indent in test_dataset_conversion
      91389950
    • Sylvain Gugger's avatar
      Tied weights load (#24310) · 096f2cf1
      Sylvain Gugger authored
      * Use tied weight keys
      
      * More
      
      * Fix tied weight missing warning
      
      * Only give info on unexpected keys with different classes
      
      * Deal with empty archs
      
      * Fix tests
      
      * Refine test
      096f2cf1
    • Matt's avatar
      Big TF test cleanup (#24282) · 34037129
      Matt authored
      * Fix one BLIP arg not being optional, remove misspelled arg
      
      * Remove the lxmert test overrides and just use the base test_saved_model_creation
      
      * saved_model_creation fixes and re-enabling tests across the board
      
      * Remove unnecessary skip
      
      * Stop caching sinusoidal embeddings in speech_to_text
      
      * Fix transfo_xl compilation
      
      * Fix transfo_xl compilation
      
      * Fix the conditionals in xglm
      
      * Set the save spec only when building
      
      * Clarify comment
      
      * Move comment correctly
      
      * Correct embeddings generation for speech2text
      
      * Mark RAG generation tests as @slow
      
      * Remove redundant else:
      
      * Add comment to clarify the save_spec line in build()
      
      * Fix size tests for XGLM at last!
      
      * make fixup
      
      * Remove one band_part operation
      
      * Mark test_keras_fit as @slow
      34037129
    • Yih-Dar's avatar
      Byebye pytorch 1.9 (#24080) · 896a58de
      Yih-Dar authored
      
      
      byebye
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      896a58de