• drbh's avatar
    feat: support phi3.5 moe (#2479) · 93a7042d
    drbh authored
    
    
    * feat: support phi3.5 moe model loading
    
    * fix: prefer llama base model and improve rotary logic
    
    * feat: return reasonable generation and add integration test
    
    * fix: run lint and update docs
    
    * fix: rerun lint for openapi docs
    
    * fix: prefer do_sample false unless temp is set by user, and update chat tests
    
    * fix: small typo adjustments
    
    * fix: consolidate long rope paths
    
    * fix: revert greedy by default and test changes
    
    * Vendor configuration so that we don't have to `trust_remote_code`
    
    * Use SparseMoELayer
    
    * Add support for dense MoE
    
    * Some type annotations
    
    * Add the usual model tests
    
    * Ruff.
    
    ---------
    Co-authored-by: default avatarDaniël de Kok <me@danieldk.eu>
    Co-authored-by: default avatarNicolas Patry <patry.nicolas@protonmail.com>
    93a7042d
supported_models.md 2.81 KB