1. 08 Jul, 2020 1 commit
  2. 03 Jun, 2020 1 commit
    • Shucai Xiao's avatar
      Bert fuse slice reshape trans contiguous (#542) · 93be5e2b
      Shucai Xiao authored
      
      
      * fix pad calc
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * bert tf passes correctness
      
      * formatting
      
      * Add remap
      
      * Formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * Add compute method for dot
      
      * Formatting
      
      * add inline
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * add test for gelu
      
      * formatting
      
      * added add_gelu fusion
      
      * Add gemm fusions
      
      * Formatting
      
      * add files
      
      * formatting
      
      * test no mul_add
      
      * formatting
      
      * progress on div
      
      * formatting
      
      * continue work on pass
      
      * remove layernorm opt
      
      * revert reduce file
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * add optimization for bert
      
      * code backup for bert optimization
      
      * continue testing
      
      * formatting
      
      * fix matcher
      
      * formatting
      
      * add gelu_fn and tests
      
      * formatting
      
      * fix matcher, remove extra tests
      
      * formatting
      
      * fix matcher
      
      * add missing files
      
      * add find_layernorm
      
      * add add_transpose to cmake file
      
      * code backup for the contigous fusion
      
      * refine ops fusion
      
      * clang format
      
      * fixed bug in previous optimization
      
      * clang format
      
      * add more optimization
      
      * remove unnecessary code
      
      * refinement of the fustion code
      
      * clang format
      
      * fixed a bug
      
      * add used_once
      
      * formatting
      
      * start on new gelu
      
      * formatting
      
      * add matchers in fuse_ops
      
      * formatting
      
      * add dce to fix add_gelu
      
      * add simplify_rsqrt and test
      
      * formatting
      
      * debugging value for matcher
      
      * formatting
      
      * add more to matchers
      
      * formatting
      
      * fix errors
      
      * remove onnx gen
      
      * add any_arg, change matchers to use either_arg
      
      * formatting
      
      * clang format
      
      * formatting
      
      * add used_once
      
      * formatting
      
      * code cleanup
      
      * clang format
      
      * fixed a bug
      
      * remove unnecessary code
      
      * refine comments
      
      * optimize bert to remove more contiguous
      
      * clang format
      
      * remove unnecessary code
      
      * add unit tests for bert optimization
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * refine a fusion of reshape and slice
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * add the fusion of slice and transpose
      
      * clang format
      
      * add another optimization to fuse slice and transpose
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarShucai Xiao <scxiao@prj47-rack-99.local.lan>
      93be5e2b
  3. 28 May, 2020 1 commit
  4. 21 May, 2020 1 commit
  5. 15 May, 2020 1 commit
  6. 08 May, 2020 1 commit
    • Paul Fultz II's avatar
      Horizontal fusions of gemms and convolutions (#472) · 1a4ff504
      Paul Fultz II authored
      
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * Add remap
      
      * Formatting
      
      * Add compute method for dot
      
      * Formatting
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * Add gemm fusions
      
      * Formatting
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Add more testing
      
      * Formatting
      
      * Fix when there is duplicate splits in inputs
      
      * Formatting
      
      * Fix mismatch iterators
      
      * Add tests for dot fusions
      
      * Formatting
      
      * Add test for convolution
      
      * Formatting
      
      * Fix tidy issues
      
      * Add more tests
      
      * Formatting
      
      * Ignore build directory for codecov
      
      * Add test for groups
      
      * Formatting
      
      * Add more tests for groups
      
      * Formatting
      
      * Add test for missing end slice
      
      * Add newline
      
      * Remove unused function
      
      * Add support for when beta is not 1
      
      * Formatting
      
      * Add test for scalar
      
      * Add one more scalar test
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      1a4ff504
  7. 06 May, 2020 1 commit
  8. 08 Apr, 2020 1 commit
  9. 02 Apr, 2020 1 commit
  10. 04 Mar, 2020 1 commit
  11. 15 Feb, 2020 1 commit
    • Shucai Xiao's avatar
      Change api to multiple prog outputs (only API change) (#433) · 1b692d0f
      Shucai Xiao authored
      
      
      * Add initial api
      
      * Formatting
      
      * Add more api
      
      * Formatting
      
      * Add auto api generation
      
      * Formatting
      
      * Fix some compilation errors
      
      * Change handle struct
      
      * Formatting
      
      * Fix reamining compilation errors
      
      * Formatting
      
      * Simplify using ctype
      
      * Formatting
      
      * Initial c++ generation
      
      * Formatting
      
      * Add C++header
      
      * Formatting
      
      * Add test
      
      * Formatting
      
      * Add initial tests
      
      * Formatting
      
      * Try to fix formatting
      
      * Cleanup formatting
      
      * Formatting
      
      * Fix constructors on the same line
      
      * Fix tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Fix tidy issues
      
      * Fix naming issue
      
      * Add onnx API to parse buffer
      
      * Formatting
      
      * Add arguments api
      
      * Formatting
      
      * Fix verify parameters
      
      * Fix cppcheck issues
      
      * Formatting
      
      * Add method to get output shapes and bytes
      
      * Formatting
      
      * Try formatting
      
      * Formatting
      
      * Improve the test coverage
      
      * Formatting
      
      * Add print method
      
      * Formatting
      
      * Fix cppcheck issue
      
      * Fix package dependency
      
      * change migraphx api to support multiple program outputs
      
      * clang format
      
      * change api implementation
      
      * clang format
      
      * fix a build error
      
      * change api for correct automatic generation
      
      * clang format
      
      * Add nolint
      
      * Try fix formatting
      
      * Formatting
      
      * formatting
      
      * formatting
      
      * Fix formatting
      
      * code cleanup
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarkahmed10 <15948690+kahmed10@users.noreply.github.com>
      1b692d0f
  12. 15 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Add option to do offload copying automatically (#403) · 81b0ff5d
      Paul Fultz II authored
      * Add compiler options
      
      * Add copy operators
      
      * Formatting
      
      * Use run_passes in tests
      
      * Formatting
      
      * Use run_pass in schedule test
      
      * Formatting
      
      * Add compile_options to get_passes in target
      
      * Formatting
      
      * Offload copy option
      
      * Formatting
      
      * Copy using pinned memory
      
      * Formatting
      
      * Improve performance of gpu copying
      
      * Formatting
      
      * Dont copy
      
      * Formatting
      
      * Always make an extra copy
      
      * Formatting
      
      * Remove unused write op
      
      * Add missing include
      
      * Remove copy_to_gpu function in python api
      
      * Make offload copy disabled by default on C++
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Fix namespace
      
      * Fix python tests
      
      * Turn clang format off since its broken
      
      * Fix compile error on gcc 5
      
      * Remove commented code
      81b0ff5d
  13. 16 Oct, 2019 1 commit
    • Paul Fultz II's avatar
      Fuse the add of two convolutions (#386) · 65ea9194
      Paul Fultz II authored
      * Fuse convolution adds
      
      * Formatting
      
      * Fuse more 1x1 convs
      
      * Add some tests
      
      * Formatting
      
      * Add test for 1x1
      
      * Add verification for add-conv fusions
      
      * Fix stride calculation
      
      * Formatting
      
      * Add more tests
      
      * Rename tests
      65ea9194
  14. 20 Sep, 2019 1 commit
  15. 26 Aug, 2019 4 commits
  16. 09 Jul, 2019 2 commits
  17. 08 Jul, 2019 4 commits
  18. 22 May, 2019 2 commits
  19. 03 Apr, 2019 2 commits
  20. 14 Nov, 2018 1 commit
  21. 07 Nov, 2018 1 commit
  22. 16 Oct, 2018 2 commits