1. 29 Jul, 2024 1 commit
    • Joao Gante's avatar
      Generate: end-to-end compilation (#30788) · 7ffe25f2
      Joao Gante authored
      * mvp
      
      * added test (a few models need fixes)
      
      * fix a few test cases
      
      * test nits
      
      * harder test 馃槇
      
      * revert changes in stablelm
      
      * test with improved condition
      
      * add todo
      
      * tmp commit
      
      * merged with main
      
      * nits
      
      * add todo
      
      * final corrections
      
      * add docs for generation compilation
      
      * docs nits
      
      * add  tip
      
      * PR suggestions
      
      * add more details to the compilation docs
      
      * fix cache positions
      
      * cache is now init in generate; update docs
      
      * tag test as flaky
      
      * docs
      
      * post rebase make fixup and other nits
      
      * remove unintended changes
      
      * whisper (encoder-decoder) not supported
      
      * move token default updates to ; add tests for token defaults
      
      * push changes
      
      * manual rebase
      
      * chameleon doesn't support this
      
      * fix test_static_cache_mha_mqa_gqa (broken in another PR)
      
      * docs: dynamic is better with end-to-end compilation
      7ffe25f2
  2. 26 Jun, 2024 1 commit
  3. 23 May, 2024 1 commit
    • Marc Sun's avatar
      Fix accelerate failing tests (#30836) · 8366b572
      Marc Sun authored
      * Fix accelerate tests
      
      * fix clip
      
      * skip dbrx tests
      
      * fix GPTSan
      
      * fix M2M100Model
      
      * same fix as jamba
      
      * fix mt5
      
      * Fix T5Model
      
      * Fix umt5 model
      
      * fix switch_transformers
      
      * fix whisper
      
      * fix gptsan again
      
      * fix siglip recent test
      
      * skip siglip tests
      
      * wrong place fixed
      8366b572
  4. 22 May, 2024 1 commit
  5. 08 May, 2024 1 commit
  6. 18 Apr, 2024 1 commit