• Nicolas Patry's avatar
    Refactor dead code - Removing all `flash_xxx.py` files. (#2166) · fb2f74e2
    Nicolas Patry authored
    * Refactor dead code.
    
    * First working step.
    
    * Remove a lot of duplicated code.
    
    * More dead code.
    
    * More cleanup.
    
    * Fix Santacoder test.
    
    * Fixing the simple tests.
    
    * Fixing sharding.
    
    * Fixes for VLM.
    
    * Fixing santacoder (num_kv_heads hardcoded).
    
    * Removing more dead code.
    
    * Fixing `config.n_head`.
    
    * Stopping earlier because of `<end_of_utterance>` in idefics2.
    
    * Addresses comments.
    
    * Removing the dead code.
    
    * Fuse back mistral into FlashCausalLM.
    
    * Finish removal.
    
    * Fixing docs + causal_lm `batch_class`.
    
    * Fixing docs + causal.lm.
    
    * Add default to Gemma Causality.
    
    * Default value for gemma/gemma2.
    
    * Wrong default.
    fb2f74e2
flash_causal_lm.py 64.8 KB