"src/optimize_module.cpp" did not exist on "ca69e522d194f6e54401555cccb3fc1a60410760"
  • Lei Wang's avatar
    [Enhancement] Enhance let binding handling in layout inference and warp specialized pass (#1484) · 7e8d1f82
    Lei Wang authored
    * [Feature] Add FullyReplicated Fragment Layout and Enhance Layout Inference
    
    * Introduced a new static method `FullyReplicated` in the `Fragment` class to create fully replicated fragment layouts, ensuring all threads hold identical copies of the buffer.
    * Updated `CopyNode` to collect fragment layouts and mark them as fully replicated during layout inference.
    * Enhanced `ParallelOpNode` to expand let bindings for fragment buffer accesses, improving layout inference accuracy.
    * Added documentation for new methods and updated existing methods to support the new layout features.
    
    * lint fix
    
    * Remove debug logging statements from layout inference process to streamline output and improve performance.
    7e8d1f82
layout.h 9.51 KB