• Lei Wang's avatar
    [Bugfix] Fix safe memory legalization for fragment store (#446) · e46653ac
    Lei Wang authored
    * [Enhancement] Improve layout inference accuracy in ParallelOp (#441)
    
    * Added logic to use non-replicated buffers as source buffers for more accurate layout inference.
    * Enhanced comments to clarify the rationale behind buffer selection in layout inference process.
    
    * [Enhancement] Add error handling macros and refactor loop partitioning logic
    
    * Introduced TILELANG_CHECK macro for improved error handling in CUDA and HIP code, providing detailed error messages for kernel launches.
    * Enhanced loop partitioning logic to handle fragment buffers more effectively, ensuring correct replication based on thread extent.
    * Added logging for thread range in PlanLoopPartition to aid in debugging and performance analysis.
    * Updated pass configuration management to streamline vectorization control in the optimization process.
    
    * lint fix
    
    * remove debug print
    
    * [Refactor] Update legalize_safe_memory_access.cc to improve memory access handling
    
    * Replaced Apache License header with MIT License.
    * Added logic to handle local buffer conditions in memory access.
    * Introduced IsLocalBuffer function to check buffer scope.
    * Enhanced comments for clarity on memory access operations.
    e46653ac
legalize_safe_memory_access.cc 9.48 KB