- 10 Oct, 2023 1 commit
-
-
Khalique Ahmed authored
-
- 02 Oct, 2023 5 commits
-
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
Khalique Ahmed authored
-
- 19 Sep, 2023 1 commit
-
-
shivadbhavsar authored
This resolves an edge case found in multiple huggingface models in some cases the find_split_reshape matcher will match with reshape2, but vec_rsp will consist of reshape1 dims causing a shape mismatch error. Solution is to include rsp when checking all reshapes are the same.
-
- 06 Sep, 2023 1 commit
-
-
Umang Yadav authored
-
- 08 Aug, 2023 1 commit
-
-
kahmed10 authored
* add quant_dot fusion, clip literal opt
-
- 30 Jun, 2023 1 commit
-
-
Paul Fultz II authored
-
- 17 Jun, 2023 1 commit
-
-
Ted Themistokleous authored
* Add trace for SIMPLIFY_ALGEBRA matches * Fix format * handle review comments from Umang -int to size_t for trace -move env arg to top of simplify_algebra.cpp -handle overload beter for find_matches * Rename trace_mod param to trace_pass More representative naming for what this trace flag does
-
- 28 May, 2023 1 commit
-
-
Paul Fultz II authored
* Allow quantizing for both int8 and fp16
-
- 04 May, 2023 1 commit
-
-
Paul Fultz II authored
When multiplying either the input or output across the K dimensions then the multiple can be applied to the constant which can then be folded with propagate_const.
-
- 02 May, 2023 1 commit
-
-
Paul Fultz II authored
Improves the constant propagation for bert models. Larger batch size no longer use as large of constants. Also improves the speed of model compilation
-
- 05 Apr, 2023 1 commit
-
-
Paul Fultz II authored
This will replace conv(x+a, w) with conv(x, w) + conv(a, w) where a is a constant so conv(a, w) can be replaced with a constant.
-
- 13 Feb, 2023 1 commit
-
-
kahmed10 authored
Using add_instruction for the neg op was causing issues on replace_instruction. Changed to use insert_instruction. Tests and added a new one that is failing without the change.
-
- 06 Feb, 2023 1 commit
-
-
Paul Fultz II authored
* Fuse layernorm with different patterns * Only match when using the last axis Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 13 Jan, 2023 1 commit
-
-
shivadbhavsar authored
This PR resolves the bug addressed in #1496.
-
- 27 Oct, 2022 1 commit
-
-
Chris Austen authored
Upgraded Dockerfiles and fixed tidy issues to make Ubuntu 20.04 and ROCm 5.3.0 the default
-
- 07 Oct, 2022 1 commit
-
-
Ted Themistokleous authored
Simplified algebraic operations (x*1), x*(-1), x/1, 0+x & x+0, x-0, 0-x, 0*x, x*0, and 0/x operations
-
- 29 Sep, 2022 1 commit
-
-
Paul Fultz II authored
* Fix invalid program from find_splits
-
- 21 Sep, 2022 1 commit
-
-
Charlie Lin authored
Change find_mul_conv to work with multibroadcast also. Checks the strides instead of the broadcast axis.
-
- 14 Sep, 2022 1 commit
-
-
Umang Yadav authored
* fix slice_dim1 for case
-
- 06 Sep, 2022 1 commit
-
-
Paul Fultz II authored
Using not and or improves readability. The cppcheck rule will help ensure we are doing it consistently.
-
- 29 Aug, 2022 1 commit
-
-
Umang Yadav authored
reshape op requires standard shape. During simplify_algebra, it inserts reshapes without checking for this requirement.
-
- 27 Aug, 2022 1 commit
-
-
Paul Fultz II authored
This will rewrite dot operators like X(Y + b) to XY + Xb when b is constant as we can fold the add away. This improves handling pointwise with broadcasted operators, this helps improves const propagation. Improve gemm fusion with a mul_add Improve support for broadcast shapes in gemm
-
- 22 Jun, 2022 1 commit
-
-
Ted Themistokleous authored
Updated each source file in the repo with the existing license.
-
- 17 May, 2022 1 commit
-
-
shivadbhavsar authored
Updated variable names according to #1193
-
- 11 May, 2022 1 commit
-
-
Paul Fultz II authored
Fuse layernorm and added triadd_layernorm fusion. This is a prep performance booster
-
- 05 May, 2022 1 commit
-
-
Paul Fultz II authored
Fixes the #error when using cppcheck. This no longer suppresses cppcheck errors when including those errors. This fixes the cppcheck errors that was there already.
-
- 02 Mar, 2022 1 commit
-
-
bpickrel authored
Update the base version of clang-format from 5.0 to 10.0
-
- 28 Oct, 2021 1 commit
-
-
Umang Yadav authored
In migraphx, DepthToSpace (d2s) is implemented as reshape --> transpose --> contiguous --> reshape. This PR adds matcher to find d2s + unary pointwise ops. Application of the matcher moves the pointwise unary operation before the contiguous and reshape of the d2s. So it becomes reshape --> transpose --> unary --> contiguous --> reshape. Motivation is that, later pointwise module would be created out of unary --> contiguous --> reshape. Codegen for this pointwise module can write out buffer such that explicit contiguous and reshape wouldn't be required. This transformation is not always guaranteed to improve performance, since unary op will operate on non-standard shape. So, we would need some tuning mechanism to make decision. #905 pending PR for binary operations.
-
- 19 Oct, 2021 1 commit
-
-
Paul Fultz II authored
Adds a pass to fuse pointwise operators into one "pointwsie" op that has a submodule which does the calculation.
-
- 24 Aug, 2021 1 commit
-
-
Umang Yadav authored
* rename broadcast and multibroadcast output_lens attribute to out_lens attribute, and change tests and source code to reflect the same * change the reshape attribute from dims to out_lens * change transpose attribute's name from dims to perm to reflect better meaning * use permutation instead of perm for transpose clang formaating * use dims instead of out_lens for reshape clang formatting
-
- 09 Jun, 2021 1 commit
-
-
Paul Fultz II authored
* Enable libstdc++ debug mode * Add is_end function * Compare addresses in a map or set * Formatting * Check end * Fix comparision of instruction_ref * Formatting * Some more iterator fixes * Formatting * Fix assert * Fix invalid iterators * Fix debug print in program * Remove debug flag for now * Set correct bool type Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 May, 2021 1 commit
-
-
Shucai Xiao authored
* add the operator step * clang formatJ * add unit tests * clang format * add more unit test for step op * clang format * add more unit tests * clang format * fix review comments * clang format * rename two unit tests Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 24 May, 2021 1 commit
-
-
Shucai Xiao authored
* backup implementation of resize enhancement * clang format * code backup for the resize * clang format * fix build error for resize operator * clang format * tmp code backup * clang format * remove changes in parse_resize * remove unnecessary changes * clang format * add unit test for the bug * clang format * remove print code * remove a semi-colon * clang format * fix a tidy error * fix review comments * clang format Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 23 Apr, 2021 1 commit
-
-
Shucai Xiao authored
* code backup * clang format * add a matcher related to the special resize case for optimization * clang format * code backup * clang format * code backup * remove unnecessary code * add optimization for the where op * clang format * fix cppcheck error * add a unit test for optimize resize * clang format * remove unnecessary header include * code backup * clang format * add unit tests for optimizing resize * clang format * add more unit test for optimizing where op * clang format * remove unnecessary code * add one more optimzation to remove contiguous * clang format * add a pointwise requirement * clang format * fix cppcheck error * add one more unit test * fixed a bug * clang format * remove unnecessary code * clang format * fix a build error * fix review comments * clang format * fix a review comments * clang format * code refinement * clang format * refine more code * refine more code * fix a bug related to reshape_cont optimization * clang format * fix a review comment * removed an unnecessary comment * refine code according to comments * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 03 Mar, 2021 1 commit
-
-
Shucai Xiao authored
* fix issue#727 * clang format * refine unit tests * fix cppcheck error * fix review comments * refine a unit test to cover more code changes * fix cppcheck error * remove unnecessary include file * fix review comments * clang format Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Dec, 2020 1 commit
-
-
Paul Fultz II authored
* Load op when serializing * Formatting * Add missing clip field * Use make_op almost everywhere * Formatting * More make ops for rnns * Get rid of spaces * Formatting * Remove operators headers * Formatting * Remove unused op headers * Increase line threshold
-