1. 08 May, 2020 1 commit
    • Paul Fultz II's avatar
      Horizontal fusions of gemms and convolutions (#472) · 1a4ff504
      Paul Fultz II authored
      
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * Add remap
      
      * Formatting
      
      * Add compute method for dot
      
      * Formatting
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * Add gemm fusions
      
      * Formatting
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Add more testing
      
      * Formatting
      
      * Fix when there is duplicate splits in inputs
      
      * Formatting
      
      * Fix mismatch iterators
      
      * Add tests for dot fusions
      
      * Formatting
      
      * Add test for convolution
      
      * Formatting
      
      * Fix tidy issues
      
      * Add more tests
      
      * Formatting
      
      * Ignore build directory for codecov
      
      * Add test for groups
      
      * Formatting
      
      * Add more tests for groups
      
      * Formatting
      
      * Add test for missing end slice
      
      * Add newline
      
      * Remove unused function
      
      * Add support for when beta is not 1
      
      * Formatting
      
      * Add test for scalar
      
      * Add one more scalar test
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      1a4ff504
  2. 15 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Add option to do offload copying automatically (#403) · 81b0ff5d
      Paul Fultz II authored
      * Add compiler options
      
      * Add copy operators
      
      * Formatting
      
      * Use run_passes in tests
      
      * Formatting
      
      * Use run_pass in schedule test
      
      * Formatting
      
      * Add compile_options to get_passes in target
      
      * Formatting
      
      * Offload copy option
      
      * Formatting
      
      * Copy using pinned memory
      
      * Formatting
      
      * Improve performance of gpu copying
      
      * Formatting
      
      * Dont copy
      
      * Formatting
      
      * Always make an extra copy
      
      * Formatting
      
      * Remove unused write op
      
      * Add missing include
      
      * Remove copy_to_gpu function in python api
      
      * Make offload copy disabled by default on C++
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Fix namespace
      
      * Fix python tests
      
      * Turn clang format off since its broken
      
      * Fix compile error on gcc 5
      
      * Remove commented code
      81b0ff5d
  3. 04 Nov, 2019 1 commit
  4. 30 Oct, 2019 1 commit
    • Paul Fultz II's avatar
      Enable scheduler for 1 stream (#399) · ca17bcd6
      Paul Fultz II authored
      * Enable scheduler for 1 stream
      
      * Formatting
      
      * Improve performance of sorting
      
      * Formatting
      
      * Adjust the weight calculation
      
      * Formatting
      
      * Simplify formula
      
      * Formatting
      
      * Avoid division by zero
      
      * Fix scheduler test
      
      * Check for either 1 or 2
      
      * Check for waits when order may change
      
      * Formatting
      ca17bcd6
  5. 28 Aug, 2019 1 commit
  6. 26 Aug, 2019 2 commits
  7. 16 Aug, 2019 1 commit
  8. 15 Aug, 2019 1 commit
  9. 12 Aug, 2019 1 commit
  10. 06 Aug, 2019 1 commit
  11. 10 Jul, 2019 1 commit
  12. 09 Jul, 2019 1 commit
  13. 28 Jun, 2019 1 commit
  14. 10 May, 2019 1 commit
  15. 09 May, 2019 1 commit
  16. 06 May, 2019 1 commit
  17. 17 Apr, 2019 2 commits
  18. 16 Apr, 2019 1 commit
  19. 13 Apr, 2019 1 commit
  20. 28 Mar, 2019 1 commit
  21. 26 Mar, 2019 2 commits
  22. 19 Mar, 2019 1 commit
  23. 13 Mar, 2019 1 commit
  24. 12 Mar, 2019 1 commit
  25. 09 Mar, 2019 2 commits
  26. 04 Mar, 2019 1 commit
  27. 02 Mar, 2019 1 commit
  28. 01 Mar, 2019 2 commits
  29. 28 Feb, 2019 1 commit
  30. 18 Feb, 2019 1 commit
  31. 17 Feb, 2019 1 commit
  32. 12 Feb, 2019 1 commit
  33. 07 Feb, 2019 1 commit
  34. 04 Feb, 2019 2 commits