".github/vscode:/vscode.git/clone" did not exist on "eb5554232acfc9339cfde43e2c46b371c531107f"
  • Chao Liu's avatar
    Gemm+Reduce Fusion (#128) · f95267f1
    Chao Liu authored
    * add gridwise gemm v4r1
    
    * rename
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * use sfc in shuffling
    
    * remove hardcode
    
    * remove hardcode
    
    * refactor
    
    * fix build
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * adding gemm+reduce
    
    * format
    
    * clean
    
    * adding gemm+reduce
    
    * adding profiler for gemm+reduce
    
    * adding gemm+reduce profiler
    
    * fix build
    
    * clean up
    
    * gemm+reduce
    
    * fix build
    
    * update DeviceGemm_Xdl_CShuffle; update enum to enum class
    
    * clean up
    
    * add test for gemm+reduce
    
    * clean up
    
    * refactor
    
    * fix build
    
    * fix build
    f95267f1
profile_gemm_bias_relu.cpp 5.23 KB