• Lei Wang's avatar
    [Examples] Implement NSA Backward kernels (#180) · 6891d3ec
    Lei Wang authored
    
    * Update native sparse attention example with scale parameter handling
    
    - Add scale parameter processing in native_sparse_attention function
    - Modify example script to include custom scale value
    - Update function calls to pass scale parameter
    - Enhance flexibility of sparse attention implementation
    
    * Refactor Triton Native Sparse Attention Example
    
    - Improve code formatting and readability in example_triton_nsa_bwd.py
    - Standardize function and parameter alignment
    - Remove unnecessary whitespaces and optimize imports
    - Enhance code style consistency with previous commits
    6891d3ec
example_tilelang_nsa_fwd.py 6.46 KB