- 10 Jul, 2020 1 commit
-
-
Paul Fultz II authored
* Add initial optimization when using a mul over a sliced convolution * Formatting * Add more tests * Formatting * Convert to an assert * Check if used once * Formatting * Add test with horiz fusion * Formatting * Optimize nested slice * Formatting * Fix test * Add const refs * Remove unnecessary assert Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Jul, 2020 1 commit
-
-
Shucai Xiao authored
* code backup * clang format * refine the algorithm to support more scenarios * clang format * fix review comments * clang format * add one more unit tests to have more code change coverage Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 03 Jun, 2020 1 commit
-
-
Shucai Xiao authored
* fix pad calc * Add decompose pass * Add decompose test * Formatting * bert tf passes correctness * formatting * Add remap * Formatting * add test * formatting * remove comment * Add compute method for dot * Formatting * add inline * Add finder for horizontal fusion * Formatting * Formatting * Reuse predicate * formatting * fix order for literal * formatting * add test for gelu * formatting * added add_gelu fusion * Add gemm fusions * Formatting * add files * formatting * test no mul_add * formatting * progress on div * formatting * continue work on pass * remove layernorm opt * revert reduce file * Add some fixes for convolution * Formatting * Fix shape tests * Formatting * Reuse axis equal * Add initial split fusion * Formatting * Update offset * Workaround outputs that cant accept nonstandard shapes * Formatting * Add check for split concat * Formatting * Add missing headers * Formatting * Add tests * Formatting * add optimization for bert * code backup for bert optimization * continue testing * formatting * fix matcher * formatting * add gelu_fn and tests * formatting * fix matcher, remove extra tests * formatting * fix matcher * add missing files * add find_layernorm * add add_transpose to cmake file * code backup for the contigous fusion * refine ops fusion * clang format * fixed bug in previous optimization * clang format * add more optimization * remove unnecessary code * refinement of the fustion code * clang format * fixed a bug * add used_once * formatting * start on new gelu * formatting * add matchers in fuse_ops * formatting * add dce to fix add_gelu * add simplify_rsqrt and test * formatting * debugging value for matcher * formatting * add more to matchers * formatting * fix errors * remove onnx gen * add any_arg, change matchers to use either_arg * formatting * clang format * formatting * add used_once * formatting * code cleanup * clang format * fixed a bug * remove unnecessary code * refine comments * optimize bert to remove more contiguous * clang format * remove unnecessary code * add unit tests for bert optimization * clang format * fix review comments * clang format * refine a fusion of reshape and slice * clang format * fix cppcheck error * fix review comments * add the fusion of slice and transpose * clang format * add another optimization to fuse slice and transpose * clang format * fix review comments * clang format * fix review comments * clang format * fix review comments Co-authored-by:
Khalique <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Shucai Xiao <scxiao@prj47-rack-99.local.lan>
-
- 28 May, 2020 1 commit
-
-
Paul Fultz II authored
* Fuse operators that partially match in a concat operator * Formatting * Remove unused matcher * Properly calculate the output_lens * Formatting * Formatting * Formatting * Fix tidy issue Co-authored-by:Shucai Xiao <shucai.xiao@amd.com>
-
- 21 May, 2020 1 commit
-
-
Paul Fultz II authored
* Skip fusing group convolutions * Formatting * Fix ICE on gcc 5 * Formatting * Fix gcc check * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 15 May, 2020 1 commit
-
-
Paul Fultz II authored
* Ensure the slices are in order before removing them in concat * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 May, 2020 1 commit
-
-
Paul Fultz II authored
* Add decompose pass * Add decompose test * Formatting * Add remap * Formatting * Add compute method for dot * Formatting * Add finder for horizontal fusion * Formatting * Formatting * Reuse predicate * Add gemm fusions * Formatting * Add some fixes for convolution * Formatting * Fix shape tests * Formatting * Reuse axis equal * Add initial split fusion * Formatting * Update offset * Workaround outputs that cant accept nonstandard shapes * Formatting * Add check for split concat * Formatting * Add missing headers * Formatting * Add tests * Formatting * Add more testing * Formatting * Fix when there is duplicate splits in inputs * Formatting * Fix mismatch iterators * Add tests for dot fusions * Formatting * Add test for convolution * Formatting * Fix tidy issues * Add more tests * Formatting * Ignore build directory for codecov * Add test for groups * Formatting * Add more tests for groups * Formatting * Add test for missing end slice * Add newline * Remove unused function * Add support for when beta is not 1 * Formatting * Add test for scalar * Add one more scalar test Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 06 May, 2020 1 commit
-
-
kahmed10 authored
* add simplify_rsqrt and test * formatting * add used_once check * move used_once * formatting * add multi_use test Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Apr, 2020 1 commit
-
-
Paul Fultz II authored
* Add test with odd stride * Use same logic as convolution for stride calculation * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 02 Apr, 2020 1 commit
-
-
kahmed10 authored
* fix pad calc * progress on div * formatting * continue work on pass * continue testing * formatting * add recip and sub matcher * formatting * add tests * formatting * fix review comments * remove unnecessary header * remove headers Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 04 Mar, 2020 1 commit
-
-
mvermeulen authored
* Horizantal fusion of unary and binary ops * Formatting * Fix bugs in matcher * Add tests * Formatting * Add the op generically * Formatting * Rename test for more detail Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 15 Feb, 2020 1 commit
-
-
Shucai Xiao authored
* Add initial api * Formatting * Add more api * Formatting * Add auto api generation * Formatting * Fix some compilation errors * Change handle struct * Formatting * Fix reamining compilation errors * Formatting * Simplify using ctype * Formatting * Initial c++ generation * Formatting * Add C++header * Formatting * Add test * Formatting * Add initial tests * Formatting * Try to fix formatting * Cleanup formatting * Formatting * Fix constructors on the same line * Fix tests * Formatting * Fix tidy issues * Fix tidy issues * Fix naming issue * Add onnx API to parse buffer * Formatting * Add arguments api * Formatting * Fix verify parameters * Fix cppcheck issues * Formatting * Add method to get output shapes and bytes * Formatting * Try formatting * Formatting * Improve the test coverage * Formatting * Add print method * Formatting * Fix cppcheck issue * Fix package dependency * change migraphx api to support multiple program outputs * clang format * change api implementation * clang format * fix a build error * change api for correct automatic generation * clang format * Add nolint * Try fix formatting * Formatting * formatting * formatting * Fix formatting * code cleanup * clang format * fix cppcheck error * fix review comments * clang format Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 15 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add compiler options * Add copy operators * Formatting * Use run_passes in tests * Formatting * Use run_pass in schedule test * Formatting * Add compile_options to get_passes in target * Formatting * Offload copy option * Formatting * Copy using pinned memory * Formatting * Improve performance of gpu copying * Formatting * Dont copy * Formatting * Always make an extra copy * Formatting * Remove unused write op * Add missing include * Remove copy_to_gpu function in python api * Make offload copy disabled by default on C++ * Formatting * Fix tidy issues * Formatting * Fix namespace * Fix python tests * Turn clang format off since its broken * Fix compile error on gcc 5 * Remove commented code
-
- 16 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Fuse convolution adds * Formatting * Fuse more 1x1 convs * Add some tests * Formatting * Add test for 1x1 * Add verification for add-conv fusions * Fix stride calculation * Formatting * Add more tests * Rename tests
-
- 20 Sep, 2019 1 commit
-
-
Paul authored
-
- 26 Aug, 2019 4 commits
- 09 Jul, 2019 2 commits
- 08 Jul, 2019 4 commits
- 22 May, 2019 2 commits
- 03 Apr, 2019 2 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
- 14 Nov, 2018 1 commit
-
-
Paul authored
-
- 07 Nov, 2018 1 commit
-
-
Paul authored
-
- 16 Oct, 2018 2 commits