1. 08 Aug, 2024 1 commit
  2. 01 Aug, 2024 1 commit
  3. 29 Jul, 2024 2 commits
    • Kamil Akesbi's avatar
      Whisper tokenizer word level timestamps (#32197) · 3fbaaaa6
      Kamil Akesbi authored
      * fix _fix_key in PreTrainedModel
      
      * fix _find_longest_common_sequence
      
      * add test
      
      * remove result.json
      
      * nit
      
      * update test
      3fbaaaa6
    • Joao Gante's avatar
      Generate: end-to-end compilation (#30788) · 7ffe25f2
      Joao Gante authored
      * mvp
      
      * added test (a few models need fixes)
      
      * fix a few test cases
      
      * test nits
      
      * harder test 馃槇
      
      * revert changes in stablelm
      
      * test with improved condition
      
      * add todo
      
      * tmp commit
      
      * merged with main
      
      * nits
      
      * add todo
      
      * final corrections
      
      * add docs for generation compilation
      
      * docs nits
      
      * add  tip
      
      * PR suggestions
      
      * add more details to the compilation docs
      
      * fix cache positions
      
      * cache is now init in generate; update docs
      
      * tag test as flaky
      
      * docs
      
      * post rebase make fixup and other nits
      
      * remove unintended changes
      
      * whisper (encoder-decoder) not supported
      
      * move token default updates to ; add tests for token defaults
      
      * push changes
      
      * manual rebase
      
      * chameleon doesn't support this
      
      * fix test_static_cache_mha_mqa_gqa (broken in another PR)
      
      * docs: dynamic is better with end-to-end compilation
      7ffe25f2
  4. 25 Jul, 2024 1 commit
  5. 23 Jul, 2024 2 commits
  6. 19 Jul, 2024 2 commits
  7. 17 Jul, 2024 1 commit
  8. 15 Jul, 2024 1 commit
  9. 12 Jul, 2024 1 commit
  10. 02 Jul, 2024 1 commit
    • Sanchit Gandhi's avatar
      [whisper] static kv cache (#31166) · a9701953
      Sanchit Gandhi authored
      
      
      * make work with cache abstraction
      
      * correct for static cache
      
      * hacks for compile
      
      * make fast
      
      * fix
      
      * fix pos ids
      
      * generate
      
      * fix sdpa
      
      * fix sdpa cache pos
      
      * fix fa2
      
      * clean fa2
      
      * integrate cache into generate
      
      * make style
      
      * copies
      
      * more copies
      
      * update eager
      
      * update sdpa
      
      * update fa2
      
      * simplify
      
      * use cache pos
      
      * always compute cross-cache for debug
      
      * avoid recompiles
      Co-authored-by: default avatarArthur Zucker <arthur@huggingface.co>
      
      * fix fix
      
      * fix fix fix
      
      * more fix
      
      * try encoder-decoder cache (too messy)
      
      * revert encoder-decoder cache
      
      * check cross-attn cache
      
      * use enc-dec dataclass
      
      * use richer enc-dec dataclass
      
      * clean-up
      
      * revert static cache changes
      
      * small fixes
      
      * revert to cpu flag
      
      * fix copies
      
      * add static slow test
      
      * past k/v docstring
      
      * more docstrings
      
      * cache_position docstrings
      
      * add to docs
      
      * add enc-dec cache to docs
      
      * make style
      
      * fix after rebase
      
      * fix beam
      
      * style
      
      * fix generation strategies
      
      * fix most decoder-only tests
      
      * style
      
      * skip test
      
      * more clean up
      
      * small docstrings
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      
      * add todo
      
      * only crop self-attn
      
      * check cache in mixin
      
      * style
      
      * fix re-compile after rebase
      
      * move `is_updated` logic to enc-dec wrapper
      
      * revert back
      
      * revert cache back
      
      * finalise design
      
      * fix
      
      * fix fix
      
      * style
      
      * Update src/transformers/cache_utils.py
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * deprecate
      
      * updates
      
      * final updates
      
      * style
      
      * style
      
      ---------
      Co-authored-by: default avatarJoao Gante <joaofranciscocardosogante@gmail.com>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      a9701953
  11. 26 Jun, 2024 1 commit
  12. 17 Jun, 2024 1 commit
    • Albert Villanova del Moral's avatar
      Pass datasets trust_remote_code (#31406) · a14b055b
      Albert Villanova del Moral authored
      * Pass datasets trust_remote_code
      
      * Pass trust_remote_code in more tests
      
      * Add trust_remote_dataset_code arg to some tests
      
      * Revert "Temporarily pin datasets upper version to fix CI"
      
      This reverts commit b7672826.
      
      * Pass trust_remote_code in librispeech_asr_dummy docstrings
      
      * Revert "Pin datasets<2.20.0 for examples"
      
      This reverts commit 833fc17a.
      
      * Pass trust_remote_code to all examples
      
      * Revert "Add trust_remote_dataset_code arg to some tests" to research_projects
      
      * Pass trust_remote_code to tests
      
      * Pass trust_remote_code to docstrings
      
      * Fix flax examples tests requirements
      
      * Pass trust_remote_dataset_code arg to tests
      
      * Replace trust_remote_dataset_code with trust_remote_code in one example
      
      * Fix duplicate trust_remote_code
      
      * Replace args.trust_remote_dataset_code with args.trust_remote_code
      
      * Replace trust_remote_dataset_code with trust_remote_code in parser
      
      * Replace trust_remote_dataset_code with trust_remote_code in dataclasses
      
      * Replace trust_remote_dataset_code with trust_remote_code arg
      a14b055b
  13. 07 Jun, 2024 1 commit
  14. 27 May, 2024 1 commit
  15. 22 May, 2024 1 commit
  16. 20 May, 2024 1 commit
  17. 15 May, 2024 1 commit
  18. 09 May, 2024 1 commit
  19. 19 Apr, 2024 3 commits
  20. 09 Apr, 2024 1 commit
  21. 08 Apr, 2024 1 commit
  22. 03 Apr, 2024 2 commits
  23. 01 Apr, 2024 1 commit
  24. 13 Mar, 2024 1 commit
  25. 12 Mar, 2024 1 commit
  26. 08 Mar, 2024 1 commit
  27. 01 Mar, 2024 1 commit
  28. 27 Feb, 2024 1 commit
  29. 31 Jan, 2024 1 commit
  30. 19 Jan, 2024 1 commit
  31. 18 Jan, 2024 1 commit
  32. 10 Jan, 2024 1 commit
  33. 22 Dec, 2023 1 commit
  34. 21 Dec, 2023 1 commit