• Adam Osewski's avatar
    CK-Tile first draft of universal block gemm with interwave & intrawave scheduler (#1676) · b6bcd76d
    Adam Osewski authored
    * Block universal gemm.
    
    * Universal block gemm with interwave scheduler - draft.
    
    * Refactoring
    
    * Move a/b_warp_tiles into BlockGemmImpl
    * set BlockGemmImpl as a class member
    
    * Change tile size for more suitable to memory bound cases.
    
    * Introduce kKPerThread to WarpGemm
    
    * Add documentation comment.
    
    * Fix Interwave scheduler block gemm.
    
    * Add compute/memory friendly tile configuration.
    
    * Clean
    
    * New tile configurations in gemm mem example.
    
    * Add more static checks and fix loop order in block gemm.
    
    * Add more static checks and use warp gemm mfma dispatcher.
    
    * Add default scheduler block gemm.
    
    * Remove logging in example.
    b6bcd76d
gemm.hpp 2.81 KB