"src/geometry/cuda/edge_coarsening_impl.hip" did not exist on "f1689ad0e12c2d6f4b00b7564b9b81dcc1301a39"
  • Jianfeng Yan's avatar
    Batched gemm and reduction (#156) · 34c661e7
    Jianfeng Yan authored
    * adding batched_gemm_and_reduction
    
    * batched_gemm_reduce works with bactch_count=1
    
    * fix a bug in grid_size; batched_gemm_reduce works for batch_count > 1
    
    * adding profiler for batched_gemm_fp16
    
    * fixed a bug in declaration of d1 and d0; both example and profiler work
    
    * clang-format
    
    * cleanup
    
    * batched_gemm_reduce: add test
    
    * minor change
    
    * fixed some typo in function names
    34c661e7
gemm_xdl_int8.cpp 8.4 KB