• Sam's avatar
    feat: add support for flash_attn (#4120) · e15307fd
    Sam authored
    * feat: enable flash attention if supported
    
    * feat: enable flash attention if supported
    
    * feat: enable flash attention if supported
    
    * feat: add flash_attn support
    e15307fd
server.cpp 126 KB