1. 17 Jun, 2022 6 commits
  2. 13 Jun, 2022 4 commits
  3. 10 Jun, 2022 1 commit
  4. 09 Jun, 2022 2 commits
  5. 07 Jun, 2022 1 commit
  6. 03 Jun, 2022 1 commit
    • Paul Fultz II's avatar
      Group code objects by kernel name in perf report summary (#1234) · 7271ddbc
      Paul Fultz II authored
      Break up the gpu::code_object  print to show the actual kernels...
      
      gpu::code_object::add_kernel: 0.646121ms, 5%
      gpu::code_object::mul_kernel: 0.623822ms, 5%
      gpu::code_object::add_mul_erf_add_mul_mul_kernel: 0.498902ms, 4%
      gpu::code_object::mul_add_kernel: 0.478352ms, 4%
      7271ddbc
  7. 02 Jun, 2022 1 commit
  8. 26 May, 2022 1 commit
  9. 25 May, 2022 2 commits
  10. 24 May, 2022 5 commits
  11. 20 May, 2022 1 commit
    • kahmed10's avatar
      Rename pointwise ops (#1145) · 4a312201
      kahmed10 authored
      For clarity on kernel names found when profiling. The new names are set to the order of the ops being compiled. For example: add + relu = add_relu_kernel.
      4a312201
  12. 18 May, 2022 3 commits
  13. 17 May, 2022 1 commit
  14. 11 May, 2022 1 commit
  15. 09 May, 2022 1 commit
  16. 06 May, 2022 3 commits
  17. 05 May, 2022 6 commits
    • Paul Fultz II's avatar
      Cppcheck fixes (#1195) · d582425b
      Paul Fultz II authored
      Fixes the #error when using cppcheck. This no longer suppresses cppcheck errors when including those errors. This fixes the cppcheck errors that was there already.
      d582425b
    • Paul's avatar
      Format · fd313588
      Paul authored
      fd313588
    • Paul's avatar
      Add namespace · 9ff87ee1
      Paul authored
      9ff87ee1
    • Paul's avatar
      Format · bf6cf5b0
      Paul authored
      bf6cf5b0
    • Paul's avatar
      Whitelist operators · f7a59edb
      Paul authored
      f7a59edb
    • Paul's avatar
      Format · 561456e7
      Paul authored
      561456e7