• zjing14's avatar
    Split k f16 (#97) · e221d11e
    zjing14 authored
    
    
    * init for splitk f16
    
    * a working prototype
    
    * debug
    
    * perf debug
    
    * update example
    
    * instances for mk kn
    
    * add instances for all layers
    
    * clean
    
    * clean
    
    * add tuning
    
    * format
    
    * add mn_padding into irregular tile
    
    * clean
    Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
    e221d11e
profile_gemm.cpp 8.08 KB