• alex_xiao's avatar
    [Enhancement] Add flash attn example for AMD MI300 series(#671) · e8cc372f
    alex_xiao authored
    
    
    * [Enhancement] Refactor buffer index handling for improved precision and clarity (#668)
    
    - Enhanced buffer index handling to address precision issues by removing redundant operations.
    - Streamlined the logic for determining buffer overlaps, ensuring more accurate conflict detection.
    - Updated related documentation to reflect changes in buffer management practices.
    
    * Remove obsolete test script for AMD example, streamlining the examples directory.
    
    * Remove unused dtype_size variable in AMD example script to streamline code.
    
    ---------
    Co-authored-by: default avatarxinxyxiao <xinyxiao@amd.com>
    e8cc372f
example_amd_flash_attn_fwd.py 9.46 KB