• Tom Aarsen's avatar
    Refactor: Use Llama RoPE implementation for Falcon (#26933) · 05ea7b79
    Tom Aarsen authored
    * Use Llama RoPE implementation for Falcon
    
    + Add copy functionalities
    
    * Use standard cache format for Falcon
    
    * Simplify apply_rotary_pos_emb, copy from Llama
    
    * Remove unnecessary cache conversion test
    
    We don't need to convert any caches anymore!
    
    * Resolve copy complaint
    05ea7b79
test_modeling_falcon.py 21 KB