1. 06 Aug, 2024 1 commit
  2. 05 Aug, 2024 4 commits
  3. 01 Aug, 2024 2 commits
  4. 31 Jul, 2024 4 commits
  5. 30 Jul, 2024 1 commit
    • Joshua Lochner's avatar
      Fix slow GemmaTokenizer and improve SPM slow -> fast conversion process (#32191) · 6e2d04e4
      Joshua Lochner authored
      * Remove user-defined tokens which can be obtained through merges
      
      * Remove debug line
      
      * formatting
      
      * Refactor spm slow -> fast converter
      
      * revert unnecessary refactor
      
      * set comprehension
      
      * remove test files
      
      * Use `vocab_scores`
      
      * Always replace spiece underline with space in decode
      
      * we no longer need token filtering
      
      * Add save fast load slow unit test
      
      * Remove tokenizers version check
      
      * Remove duplicate code
      
      * Make `<start_of_turn>` and `<end_of_turn>` special tokens
      
      * Bias merge priority with length if score is the same
      
      * Add unit test for merge priority
      
      * CI
      6e2d04e4
  6. 29 Jul, 2024 2 commits
    • Kamil Akesbi's avatar
      Whisper tokenizer word level timestamps (#32197) · 3fbaaaa6
      Kamil Akesbi authored
      * fix _fix_key in PreTrainedModel
      
      * fix _find_longest_common_sequence
      
      * add test
      
      * remove result.json
      
      * nit
      
      * update test
      3fbaaaa6
    • Joao Gante's avatar
      Generate: end-to-end compilation (#30788) · 7ffe25f2
      Joao Gante authored
      * mvp
      
      * added test (a few models need fixes)
      
      * fix a few test cases
      
      * test nits
      
      * harder test 馃槇
      
      * revert changes in stablelm
      
      * test with improved condition
      
      * add todo
      
      * tmp commit
      
      * merged with main
      
      * nits
      
      * add todo
      
      * final corrections
      
      * add docs for generation compilation
      
      * docs nits
      
      * add  tip
      
      * PR suggestions
      
      * add more details to the compilation docs
      
      * fix cache positions
      
      * cache is now init in generate; update docs
      
      * tag test as flaky
      
      * docs
      
      * post rebase make fixup and other nits
      
      * remove unintended changes
      
      * whisper (encoder-decoder) not supported
      
      * move token default updates to ; add tests for token defaults
      
      * push changes
      
      * manual rebase
      
      * chameleon doesn't support this
      
      * fix test_static_cache_mha_mqa_gqa (broken in another PR)
      
      * docs: dynamic is better with end-to-end compilation
      7ffe25f2
  7. 26 Jul, 2024 2 commits
  8. 25 Jul, 2024 3 commits
  9. 24 Jul, 2024 3 commits
  10. 23 Jul, 2024 9 commits
  11. 22 Jul, 2024 4 commits
  12. 19 Jul, 2024 3 commits
  13. 18 Jul, 2024 2 commits
    • Longjie Zheng's avatar
      Add torch.compile Support For Mamba (#31247) · c75969ee
      Longjie Zheng authored
      * modify mamba cache
      
      * set up cache
      
      * add test
      
      * [run-slow] mamba
      
      * [run-slow] mamba
      
      * address comments
      
      * [run-slow] mamba
      
      * use_cache_position
      
      * [run-slow] mamba
      
      * [run-slow] mamba
      
      * [run-slow] mamba
      
      * [run-slow] mamba
      
      * fix
      
      * cache in generate
      
      * [run-slow] mamba
      
      * address comments
      
      * [run-slow] mamba
      
      * [run-slow] mamba
      
      * address comments
      
      * [run-slow] mamba
      
      * fix
      
      * [run-slow] mamba
      
      * fix
      
      * [run-slow] mamba
      
      * fix cache name
      
      * [run-slow] mamba
      c75969ee
    • Raushan Turganbay's avatar
      Chameleon: minor fixes after shipping (#32037) · 673d30b8
      Raushan Turganbay authored
      * fix merging
      
      * make chameleon conditional
      673d30b8