1. 09 Jul, 2024 2 commits
  2. 08 Jul, 2024 10 commits
  3. 05 Jul, 2024 6 commits
    • Daniël de Kok's avatar
      Consistently take `prefix` in model constructors (#2191) · 05c094fc
      Daniël de Kok authored
      * Consistently take `prefix` in model constructors
      
      * Release test check fix
      
      * Misc refactor-related fixes
      05c094fc
    • Daniël de Kok's avatar
      GPTQ CI improvements (#2151) · 67ef0649
      Daniël de Kok authored
      * Add more representative Llama GPTQ test
      
      The Llama GPTQ test is updated to use a model with the commonly-used
      quantizer config format and activation sorting. The old test is
      kept around (but renamed) since it tests the format produced by
      `text-generation-server quantize`.
      
      * Add support for manually triggering a release build
      67ef0649
    • Daniël de Kok's avatar
      Fix Starcoder2 after refactor (#2189) · b67d4633
      Daniël de Kok authored
      b67d4633
    • Nicolas Patry's avatar
      Hotfixing after refactor. · 853d4eb9
      Nicolas Patry authored
      853d4eb9
    • Nicolas Patry's avatar
      Refactor dead code - Removing all `flash_xxx.py` files. (#2166) · fb2f74e2
      Nicolas Patry authored
      * Refactor dead code.
      
      * First working step.
      
      * Remove a lot of duplicated code.
      
      * More dead code.
      
      * More cleanup.
      
      * Fix Santacoder test.
      
      * Fixing the simple tests.
      
      * Fixing sharding.
      
      * Fixes for VLM.
      
      * Fixing santacoder (num_kv_heads hardcoded).
      
      * Removing more dead code.
      
      * Fixing `config.n_head`.
      
      * Stopping earlier because of `<end_of_utterance>` in idefics2.
      
      * Addresses comments.
      
      * Removing the dead code.
      
      * Fuse back mistral into FlashCausalLM.
      
      * Finish removal.
      
      * Fixing docs + causal_lm `batch_class`.
      
      * Fixing docs + causal.lm.
      
      * Add default to Gemma Causality.
      
      * Default value for gemma/gemma2.
      
      * Wrong default.
      fb2f74e2
    • Aaron Mihalik's avatar
      Adding "longrope" for Phi-3 (#2172) (#2179) · c6bcadf8
      Aaron Mihalik authored
      Adding "longrope" for phi-3
      c6bcadf8
  4. 04 Jul, 2024 1 commit
  5. 03 Jul, 2024 5 commits
  6. 02 Jul, 2024 6 commits
  7. 01 Jul, 2024 10 commits