1. 18 Sep, 2019 1 commit
    • Shucai Xiao's avatar
      Remove gemm copy and simplify rocblas call (#356) · a0f9b785
      Shucai Xiao authored
      * Remove extra copy in gemm
      
      * combine rocblas gemm call
      
      * clang format
      
      * fix a bug in calling rocblas function
      
      * clang format'
      
      * backup of temporary changes
      
      * clang format
      
      * unify the gemm call to avoid multiple gpu implemantation
      
      * clang format
      
      * remove unnecessary code
      
      * backup temp changes
      
      * clang format
      
      * fix cppcheck error
      
      * code backup
      
      * clang format
      
      * remove unnecessary synchronization function
      
      * clang format
      
      * fix bugs
      
      * clang format
      
      * more optimization related to gemm
      
      * clang format
      
      * code cleanup
      
      * implementation that can achieves better performance
      
      * clang format
      
      * temp changes to try performance
      
      * clang format
      
      * revert to previous commits
      
      * fixed review comments
      
      * clang format
      
      * fix review comments
      a0f9b785
  2. 02 May, 2019 1 commit
  3. 27 Nov, 2018 1 commit
  4. 14 Nov, 2018 2 commits
  5. 06 Nov, 2018 11 commits
  6. 18 Oct, 2018 2 commits
  7. 26 Sep, 2018 2 commits
  8. 19 Sep, 2018 2 commits
  9. 16 Sep, 2018 2 commits
  10. 13 Sep, 2018 1 commit
  11. 11 Sep, 2018 1 commit
  12. 01 Sep, 2018 1 commit
  13. 27 Aug, 2018 1 commit
  14. 23 Aug, 2018 1 commit
  15. 22 Aug, 2018 2 commits
  16. 25 Jul, 2018 2 commits
  17. 18 Jul, 2018 1 commit
  18. 16 Jul, 2018 1 commit