1. 29 Jul, 2024 2 commits
    • Joao Gante's avatar
      Generate: end-to-end compilation (#30788) · 7ffe25f2
      Joao Gante authored
      * mvp
      
      * added test (a few models need fixes)
      
      * fix a few test cases
      
      * test nits
      
      * harder test 馃槇
      
      * revert changes in stablelm
      
      * test with improved condition
      
      * add todo
      
      * tmp commit
      
      * merged with main
      
      * nits
      
      * add todo
      
      * final corrections
      
      * add docs for generation compilation
      
      * docs nits
      
      * add  tip
      
      * PR suggestions
      
      * add more details to the compilation docs
      
      * fix cache positions
      
      * cache is now init in generate; update docs
      
      * tag test as flaky
      
      * docs
      
      * post rebase make fixup and other nits
      
      * remove unintended changes
      
      * whisper (encoder-decoder) not supported
      
      * move token default updates to ; add tests for token defaults
      
      * push changes
      
      * manual rebase
      
      * chameleon doesn't support this
      
      * fix test_static_cache_mha_mqa_gqa (broken in another PR)
      
      * docs: dynamic is better with end-to-end compilation
      7ffe25f2
    • Raushan Turganbay's avatar
      馃毃 Bloom support for cache class (#31445) · f7396876
      Raushan Turganbay authored
      
      
      * bloom dynamic cache
      
      * bloom follows standard cache format
      
      * no skips for bloom anymore
      
      * use cache position when possible
      
      * clean up
      
      * codestyle
      
      * Update src/transformers/models/bloom/modeling_bloom.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update src/transformers/models/bloom/modeling_bloom.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * Update src/transformers/models/bloom/modeling_bloom.py
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * pr comments
      
      * isinstance fix
      
      * address comments
      
      * make musicgen test happy
      
      * [run-slow] bloom
      
      ---------
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      f7396876
  2. 26 Jul, 2024 5 commits
  3. 25 Jul, 2024 3 commits
  4. 24 Jul, 2024 5 commits
  5. 23 Jul, 2024 12 commits
  6. 22 Jul, 2024 9 commits
  7. 19 Jul, 2024 4 commits