1. 27 Sep, 2019 1 commit
    • Shucai Xiao's avatar
      Ceil floor operators (#375) · 7d06cdbd
      Shucai Xiao authored
      * add two operators ceil and floor
      
      * clang format
      
      * add unit test for the ceil and floor operators
      
      * remove unintended code
      7d06cdbd
  2. 25 Sep, 2019 1 commit
    • Shucai Xiao's avatar
      Reduce_min/max operators (#363) · 3962c2ad
      Shucai Xiao authored
      * first version of refactoring reduce operators.
      
      * clang format
      
      * refactor the gpu implemantation of the reduce_mean operator
      
      * clang format
      
      * refactor gpu implementation of the resuce_sum operator
      
      * fix cpp check error
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * fix a jenkin error
      
      * fixed review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * add implemenation of reduce_min and reduce_max
      
      * clang format
      
      * add unit test for reduce_min/max operator
      
      * clang format
      
      * add more unit tests
      
      * clang format
      
      * fix review comments
      3962c2ad
  3. 18 Sep, 2019 1 commit
    • Shucai Xiao's avatar
      Remove gemm copy and simplify rocblas call (#356) · a0f9b785
      Shucai Xiao authored
      * Remove extra copy in gemm
      
      * combine rocblas gemm call
      
      * clang format
      
      * fix a bug in calling rocblas function
      
      * clang format'
      
      * backup of temporary changes
      
      * clang format
      
      * unify the gemm call to avoid multiple gpu implemantation
      
      * clang format
      
      * remove unnecessary code
      
      * backup temp changes
      
      * clang format
      
      * fix cppcheck error
      
      * code backup
      
      * clang format
      
      * remove unnecessary synchronization function
      
      * clang format
      
      * fix bugs
      
      * clang format
      
      * more optimization related to gemm
      
      * clang format
      
      * code cleanup
      
      * implementation that can achieves better performance
      
      * clang format
      
      * temp changes to try performance
      
      * clang format
      
      * revert to previous commits
      
      * fixed review comments
      
      * clang format
      
      * fix review comments
      a0f9b785
  4. 27 Aug, 2019 2 commits
  5. 14 Aug, 2019 1 commit
  6. 06 Aug, 2019 2 commits
  7. 05 Aug, 2019 2 commits
  8. 03 Aug, 2019 1 commit
  9. 02 Aug, 2019 1 commit
  10. 01 Aug, 2019 1 commit
  11. 12 Jul, 2019 2 commits
  12. 11 Jul, 2019 2 commits
  13. 10 Jul, 2019 2 commits
  14. 09 Jul, 2019 2 commits
  15. 03 Jul, 2019 3 commits
  16. 02 Jul, 2019 1 commit
  17. 25 Jun, 2019 1 commit
  18. 24 Jun, 2019 1 commit
  19. 20 Jun, 2019 1 commit
  20. 19 Jun, 2019 1 commit
  21. 04 Jun, 2019 2 commits
  22. 03 Jun, 2019 4 commits
  23. 24 May, 2019 1 commit
  24. 23 May, 2019 1 commit
  25. 10 May, 2019 3 commits