1. 04 Aug, 2023 1 commit
  2. 13 Jul, 2023 1 commit
  3. 11 Jul, 2023 1 commit
  4. 07 Jul, 2023 1 commit
  5. 29 Jun, 2023 1 commit
  6. 27 Jun, 2023 1 commit
    • Sylvain Gugger's avatar
      Clean load keys (#24505) · 8e5d1619
      Sylvain Gugger authored
      * Preliminary work on some models
      
      * Fix test load missing and make sure nonpersistent buffers are tested
      
      * Always ignore nonpersistent buffers if in state_dict
      
      * Treat models
      
      * More models
      
      * Treat remaining models
      
      * Fix quality
      
      * Fix tests
      
      * Remove draft
      
      * This test is not needed anymore
      
      * Fix copies
      
      * Fix last test
      
      * Newly added models
      
      * Fix last tests
      
      * Address review comments
      8e5d1619
  7. 26 Jun, 2023 1 commit
  8. 23 Jun, 2023 1 commit
  9. 22 Jun, 2023 1 commit
  10. 21 Jun, 2023 2 commits
    • Matthijs Hollemans's avatar
      add word-level timestamps to Whisper (#23205) · cd927a47
      Matthijs Hollemans authored
      * let's go!
      
      * initial implementation of token-level timestamps
      
      * only return a single timestamp per token
      
      * remove token probabilities
      
      * fix return type
      
      * fix doc comment
      
      * strip special tokens
      
      * rename
      
      * revert to not stripping special tokens
      
      * only support models that have alignment_heads
      
      * add integration test
      
      * consistently name it token-level timestamps
      
      * small DTW tweak
      
      * initial support for ASR pipeline
      
      * fix pipeline doc comments
      
      * resolve token timestamps in pipeline with chunking
      
      * change warning when no final timestamp is found
      
      * return word-level timestamps
      
      * fixup
      
      * fix bug that skipped final word in each chunk
      
      * fix failing unit tests
      
      * merge punctuations into the words
      
      * also return word tokens
      
      * also return token indices
      
      * add (failing) unit test for combine_tokens_into_words
      
      * make combine_tokens_into_words private
      
      * restore OpenAI's punctuation rules
      
      * add pipeline tests
      
      * make requested changes
      
      * PR review changes
      
      * fix failing pipeline test
      
      * small stuff from PR
      
      * only return words and their timestamps, not segments
      
      * move alignment_heads into generation config
      
      * forgot to set alignment_heads in pipeline tests
      
      * tiny comment fix
      
      * grr
      cd927a47
    • Younes Belkada's avatar
      Fix gradient checkpointing + fp16 autocast for most models (#24247) · 285a4801
      Younes Belkada authored
      
      
      * fix gc bug
      
      * continue PoC on OPT
      
      * fixes
      
      * :exploding_head:
      
      * fix tests
      
      * remove pytest.mark
      
      * fixup
      
      * forward contrib credits from discussions
      
      * forward contrib credits from discussions
      
      * reverting changes on untouched files.
      
      ---------
      Co-authored-by: default avatarzhaoqf123 <zhaoqf123@users.noreply.github.com>
      Co-authored-by: default avatar7eu7d7 <7eu7d7@users.noreply.github.com>
      285a4801
  11. 13 Jun, 2023 1 commit
    • Sylvain Gugger's avatar
      Tied params cleanup (#24211) · 695928e1
      Sylvain Gugger authored
      * First test
      
      * Add info for all models
      
      * style
      
      * Repo consistency
      
      * Fix last model and cleanup prints
      
      * Repo consistency
      
      * Use consistent function for detecting tied weights
      695928e1
  12. 12 Jun, 2023 1 commit
  13. 08 Jun, 2023 1 commit
  14. 24 May, 2023 1 commit
  15. 19 May, 2023 1 commit
    • Connor Henderson's avatar
      feat: Whisper prompting (#22496) · 2acedf47
      Connor Henderson authored
      * initial working additions
      
      * clean and rename, add cond stripping initial prompt to decode
      
      * cleanup, edit create_initial_prompt_ids, add tests
      
      * repo consistency, flip order of conditional
      
      * fix error, move the processor fn to the tokenizer
      
      * repo consistency, update test ids to corresponding tokenizer
      
      * use convert_tokens_to_ids not get_vocab...
      
      * use actual conditional in generate
      
      * make sytle
      
      * initial address comments
      
      * initial working add new params to pipeline
      
      * first draft of sequential generation for condition_on_previous_text
      
      * add/update tests, make compatible with timestamps
      
      * make compatible with diff. input kwargs and max length
      
      * add None check
      
      * add temperature check
      
      * flip temp check operand
      
      * refocusing to prev pr scope
      
      * remove the params too
      
      * make style
      
      * edits, move max length incorporating prompt to whisper
      
      * address comments
      
      * remove asr pipeline prompt decoding, fix indexing
      
      * address comments (more tests, validate prompt)
      
      * un-comment out tests (from debug)
      
      * remove old comment
      
      * address comments
      
      * fix typo
      
      * remove timestamp token from test
      
      * make style
      
      * cleanup
      
      * copy method to fast tokenizer, set max_new_tokens for test
      
      * prompt_ids type just pt
      
      * address Amy's comments
      
      * make style
      2acedf47
  16. 05 May, 2023 2 commits
  17. 14 Apr, 2023 1 commit
  18. 04 Apr, 2023 1 commit
  19. 28 Mar, 2023 1 commit
  20. 14 Mar, 2023 1 commit
  21. 13 Mar, 2023 1 commit
  22. 11 Mar, 2023 1 commit
  23. 08 Mar, 2023 1 commit
  24. 07 Mar, 2023 1 commit
  25. 02 Mar, 2023 1 commit
  26. 01 Mar, 2023 1 commit
    • raghavanone's avatar
      Change the way tensor is reshaped in BartAttention (from .view to .reshape) (#21860) · ebd52589
      raghavanone authored
      * Change the .view call to .reshape
      
      * Change the .view call to .reshape to all the copies from bart attention
      
      * Fix copies and style
      
      * Fix copies and style
      
      * Fix copies and style
      
      * Fix copies and style
      
      * Fix copies and style
      
      * Revert unneccessary changes
      
      * Revert unneccessary changes
      
      * Revert unneccessary changes
      
      * Revert unneccessary changes
      ebd52589
  27. 24 Feb, 2023 1 commit
  28. 16 Feb, 2023 2 commits
  29. 07 Feb, 2023 1 commit
    • Arthur's avatar
      [CI ] Remove `past` in favor of `pat_key_values` (#21443) · 12eb528b
      Arthur authored
      * fix past renamed to past_key_value
      
      * update more `past`that were ski^锚d
      
      * fixup
      
      * remove changes made to rag
      
      * refactor `_reorder_cache` to use `past_key_values`
      
      * fix git `prepare_inputs_for_generation` to pass tests when false is needed in use_cache
      12eb528b
  30. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  31. 25 Jan, 2023 1 commit
    • Arthur's avatar
      [Whisper] Refactor whisper (#21252) · 255257f3
      Arthur authored
      * update whisper logit processor
      
      * add generate for whisper
      
      * remove part of the whisper specific code from pipeline
      
      * update logit processes
      
      * major update
      
      * enforce first timestamp
      
      * update generate
      
      * add more tests
      
      * update new decoding strategy
      
      * Apply suggestions from code review
      
      * update docstring
      
      * fixup
      
      * default config will not have multilingual ar
      
      * update expected tokenizer size, see pull on the hub for whisper-tiny
      255257f3
  32. 23 Jan, 2023 1 commit
  33. 08 Jan, 2023 1 commit
    • Arthur's avatar
      Replace `past` with `past_key_values` (#20944) · f0577df6
      Arthur authored
      * start cleanup
      
      * more updates
      
      * more models are affected
      
      * more updates
      
      * update generation utils
      
      * style
      
      * revert change that removed reorder cachce
      
      * update generation utils
      
      * style
      
      * style
      
      * remove reorder cache
      f0577df6
  34. 20 Dec, 2022 1 commit
  35. 06 Dec, 2022 1 commit
  36. 05 Dec, 2022 1 commit
    • Arthur's avatar
      Fix whisper and speech to text doc (#20595) · 9763f829
      Arthur authored
      * Fix whisper and speech to text doc
      # What does this PR do?
      Previously the documentation was badly indented for both models and indicated that
      > If `decoder_input_ids` and `decoder_inputs_embeds` are both unset, `decoder_inputs_embeds` takes the value of `inputs_embeds`.`
      Which is on valid for the forward pass of the `ForConditionnalGeneration` not for the model alone.
      
      * other fixes
      9763f829
  37. 30 Nov, 2022 1 commit