"vscode:/vscode.git/clone" did not exist on "ebe38f3d480b5f6ebec59d6f89fbbcec692073fb"
  • Chao Liu's avatar
    Gemm+Reduce Fusion (#128) · f95267f1
    Chao Liu authored
    * add gridwise gemm v4r1
    
    * rename
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * use sfc in shuffling
    
    * remove hardcode
    
    * remove hardcode
    
    * refactor
    
    * fix build
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * format
    
    * clean
    
    * adding gemm+reduce
    
    * adding profiler for gemm+reduce
    
    * adding gemm+reduce profiler
    
    * fix build
    
    * clean up
    
    * gemm+reduce
    
    * fix build
    
    * update DeviceGemm_Xdl_CShuffle; update enum to enum class
    
    * clean up
    
    * add test for gemm+reduce
    
    * clean up
    
    * refactor
    
    * fix build
    
    * fix build
    f95267f1
profile_batched_gemm.cpp 15.1 KB