"src/include/blockwise_3d_tensor_op.hpp" did not exist on "96ee9571e2c96ba6eb6972da1be75453d6c6e9fa"
  • Chao Liu's avatar
    Tweak GEMM kernel (#38) · b3e8d57d
    Chao Liu authored
    * add parameters
    
    * tweak gemm
    
    * tweak
    
    * update conv
    
    * update script
    
    * adding bwd 1x1
    
    * update script
    
    * adding 1x1 bwd
    
    * debugging bwd 1x1 failure
    
    * update script
    
    * update script
    
    * test
    
    * test v100
    
    * clean up
    b3e8d57d
host_gemm.hpp 4.55 KB