• Lei Wang's avatar
    [Enhancement] Introduce wrapper util `pythonic_expr` to transform a PrimExpr... · 916ee60e
    Lei Wang authored
    [Enhancement] Introduce wrapper util `pythonic_expr` to transform a PrimExpr into python string (#577)
    
    * [Feature] Add Quarter Bank Swizzle Layout and Update GEMM Layout Logic
    
    - Introduced a new `makeQuarterBankSwizzleLayout` function for layout swizzling of 32 bytes.
    - Updated `makeGemmABLayout` to include an `enable_padding` parameter, allowing for conditional layout selection between padded and quarter bank swizzle layouts.
    - Adjusted layout inference in GEMM operations to utilize the new quarter bank swizzle layout when appropriate.
    - Enhanced bulk copy operations to recognize and handle the new layout type, improving memory access patterns.
    
    * lint fix
    
    * lint fix
    
    * rebase
    
    * rebase
    
    * typo
    
    * requirement fix
    
    * revert flash atten requirenemts
    916ee60e
bulk_copy.cc 15.6 KB