1. 10 Jun, 2022 1 commit
  2. 07 Jun, 2022 1 commit
  3. 03 Jun, 2022 1 commit
    • Paul Fultz II's avatar
      Group code objects by kernel name in perf report summary (#1234) · 7271ddbc
      Paul Fultz II authored
      Break up the gpu::code_object  print to show the actual kernels...
      
      gpu::code_object::add_kernel: 0.646121ms, 5%
      gpu::code_object::mul_kernel: 0.623822ms, 5%
      gpu::code_object::add_mul_erf_add_mul_mul_kernel: 0.498902ms, 4%
      gpu::code_object::mul_add_kernel: 0.478352ms, 4%
      7271ddbc
  4. 02 Jun, 2022 3 commits
  5. 30 May, 2022 1 commit
    • shivadbhavsar's avatar
      Improve eliminate contiguous pass (#1223) · 86061b4d
      shivadbhavsar authored
      Following up on issue #1166 and PR #1220. Using the same approach as in #1220 for parallelizing the eval calls, we can significantly reduce the time spent on eliminate_contiguous pass.
      86061b4d
  6. 26 May, 2022 2 commits
    • shivadbhavsar's avatar
      Parallelize evaluations in propagate_constant (#1220) · bf603a76
      shivadbhavsar authored
      Addressing issue #1166 - propagate_constant pass currently uses a recursive approach to find all instructions in a module that can be evaluated to a literal and performs the replacement in the same call.
      
      New approach:
      
      Perform single pass though instructions in the module to determine which instructions can be evaluated
      Evaluate selected instructions in parallel
      Replace the selected instructions with the corresponding literal
      bf603a76
    • Paul Fultz II's avatar
      Upgrade to cppcheck 2.8 and fix new issues found (#1225) · a401e72a
      Paul Fultz II authored
      * Upgrade to cppcheck 2.8
      a401e72a
  7. 25 May, 2022 3 commits
  8. 24 May, 2022 4 commits
  9. 23 May, 2022 2 commits
  10. 20 May, 2022 2 commits
  11. 19 May, 2022 1 commit
  12. 18 May, 2022 1 commit
  13. 17 May, 2022 3 commits
  14. 12 May, 2022 3 commits
  15. 11 May, 2022 5 commits
  16. 10 May, 2022 3 commits
  17. 09 May, 2022 1 commit
  18. 06 May, 2022 1 commit
  19. 05 May, 2022 1 commit
    • Paul Fultz II's avatar
      Cppcheck fixes (#1195) · d582425b
      Paul Fultz II authored
      Fixes the #error when using cppcheck. This no longer suppresses cppcheck errors when including those errors. This fixes the cppcheck errors that was there already.
      d582425b
  20. 03 May, 2022 1 commit
    • Paul's avatar
      Format · bb0fff52
      Paul authored
      bb0fff52