• Lei Wang's avatar
    [Example] Add example (#894) · 4424fa9a
    Lei Wang authored
    * [Refactor] Enhance CopyNode Lower method to support disable_tma flag and improve flash attention implementation
    
    * Updated the CopyNode Lower method to correctly include the disable_tma flag in the GetCopyInst call.
    * Refactored the flash attention implementation to selectively disable TMA for specific copy operations while allowing it for others.
    * Addressed linting issues for improved code quality
    
    * sparse mla kernels
    
    * Remove deprecated sparse MLA and utility files to streamline the codebase.
    4424fa9a
README.md 14 Bytes