• Lei Wang's avatar
    [Enhancement] Enhance let binding handling in layout inference and warp specialized pass (#1484) · 7e8d1f82
    Lei Wang authored
    * [Feature] Add FullyReplicated Fragment Layout and Enhance Layout Inference
    
    * Introduced a new static method `FullyReplicated` in the `Fragment` class to create fully replicated fragment layouts, ensuring all threads hold identical copies of the buffer.
    * Updated `CopyNode` to collect fragment layouts and mark them as fully replicated during layout inference.
    * Enhanced `ParallelOpNode` to expand let bindings for fragment buffer accesses, improving layout inference accuracy.
    * Added documentation for new methods and updated existing methods to support the new layout features.
    
    * lint fix
    
    * Remove debug logging statements from layout inference process to streamline output and improve performance.
    7e8d1f82
parallel.cc 28.1 KB