• Lei Wang's avatar
    [Enhancement] Update warp specialization checking (#580) · 6cede73d
    Lei Wang authored
    * Fix L2 cache size calculation to handle symbolic expressions and ensure float conversion of hit ratios in annotation
    
    * [Enhancement] Update warp specialization check in phase.py
    
    * lint fix
    
    * [Enhancement] Add ContainsSeqStmt method to improve statement handling in merge_shared_memory_allocations.cc
    
    * [Refactor] Simplify memory copy operations in GEMM kernel tests
    
    - Updated memory copy operations in `test_tilelang_kernel_gemm.py` to use shared memory allocations for both A and B matrices, improving clarity and performance.
    - Adjusted the main execution block to include a new `run_gemm_rs` function call for testing, enhancing the test structure.
    
    * revert memory reuse pass.
    
    * revert the memory resue and thread sync pass/
    
    * Update test_tilelang_kernel_gemm.py
    
    * Update test_tilelang_kernel_mha_bwd.py
    6cede73d
test_tilelang_kernel_gemm.py 11 KB