- 08 Jun, 2020 1 commit
-
-
kahmed10 authored
* initial progress * formatting * add pooling changes * formatting * change eliminate_pad * formatting * rename var * fomratting * update op shape test and compute * formatting * revert conv constructor * formatting * change initializer * formatting * fix tidy * change quant conv and shape check * add tests and fixes * formatting * fix type * fix conv test * formatting * add pooling and bn tests * formatting * add inconsistent attr tests * fix padding issue * formatting * fix review comments, remove duplicate test * formatting * fix variable * fix assert bug * fix attr check * remove std Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 03 Jun, 2020 1 commit
-
-
Shucai Xiao authored
* fix pad calc * Add decompose pass * Add decompose test * Formatting * bert tf passes correctness * formatting * Add remap * Formatting * add test * formatting * remove comment * Add compute method for dot * Formatting * add inline * Add finder for horizontal fusion * Formatting * Formatting * Reuse predicate * formatting * fix order for literal * formatting * add test for gelu * formatting * added add_gelu fusion * Add gemm fusions * Formatting * add files * formatting * test no mul_add * formatting * progress on div * formatting * continue work on pass * remove layernorm opt * revert reduce file * Add some fixes for convolution * Formatting * Fix shape tests * Formatting * Reuse axis equal * Add initial split fusion * Formatting * Update offset * Workaround outputs that cant accept nonstandard shapes * Formatting * Add check for split concat * Formatting * Add missing headers * Formatting * Add tests * Formatting * add optimization for bert * code backup for bert optimization * continue testing * formatting * fix matcher * formatting * add gelu_fn and tests * formatting * fix matcher, remove extra tests * formatting * fix matcher * add missing files * add find_layernorm * add add_transpose to cmake file * code backup for the contigous fusion * refine ops fusion * clang format * fixed bug in previous optimization * clang format * add more optimization * remove unnecessary code * refinement of the fustion code * clang format * fixed a bug * add used_once * formatting * start on new gelu * formatting * add matchers in fuse_ops * formatting * add dce to fix add_gelu * add simplify_rsqrt and test * formatting * debugging value for matcher * formatting * add more to matchers * formatting * fix errors * remove onnx gen * add any_arg, change matchers to use either_arg * formatting * clang format * formatting * add used_once * formatting * code cleanup * clang format * fixed a bug * remove unnecessary code * refine comments * optimize bert to remove more contiguous * clang format * remove unnecessary code * add unit tests for bert optimization * clang format * fix review comments * clang format * refine a fusion of reshape and slice * clang format * fix cppcheck error * fix review comments * add the fusion of slice and transpose * clang format * add another optimization to fuse slice and transpose * clang format * fix review comments * clang format * fix review comments * clang format * fix review comments Co-authored-by:
Khalique <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Shucai Xiao <scxiao@prj47-rack-99.local.lan>
-
- 02 Jun, 2020 1 commit
-
-
Paul authored
-
- 29 May, 2020 1 commit
-
-
mhbliao authored
-
- 22 May, 2020 1 commit
-
-
Paul Fultz II authored
-
- 20 May, 2020 1 commit
-
-
Shucai Xiao authored
* code backup * clang format * fix compiling errors * clang format * rename a few files * rename a few files * fix variable bugs * clang format * add an operator to shift input sequences * clang format * fixed a bug * clang format * fixed a bug * clang format * code backup * clang format * code backup * clang format * code backup * clang format * refine code related lstm operator optimization * clang format * fix various bugs * clang format * fixed a bug in rewrite_lstm * clang format * fixed another bug * refine two operator names * clang format * refine file names * fix cppcheck error * clang format * fix cppcheck error * clang format * fix cppcheck error * fixed review comments * clang format * add unit tests * clang format * add unit tests * clang format * refine unit tests for better coverage * clang format * fixed a bug * fix cppcheck error * fix review comments * clang format * rename two operators according to review comments * clang format * fix review comments * clang format * fix review comments * clang format * fix review comments * fix a cppcheck error * clang format * fix review comments * clang format Co-authored-by:
Shucai Xiao <scxiao@prj47-rack-99.local.lan> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 15 May, 2020 1 commit
-
-
kahmed10 authored
* fix pad calc * bert tf passes correctness * formatting * add test * formatting * remove comment * add inline * formatting * fix order for literal * formatting * add test for gelu * formatting * added add_gelu fusion * add files * formatting * remove layernorm opt * revert reduce file * add gelu_fn and tests * formatting * fix matcher, remove extra tests * formatting * fix matcher * add used_once * formatting * start on new gelu * formatting * add matchers in fuse_ops * formatting * add dce to fix add_gelu * add simplify_rsqrt and test * formatting * debugging value for matcher * formatting * add more to matchers * formatting * fix errors * remove onnx gen * add any_arg, change matchers to use either_arg * formatting * formatting * add used_once * formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 11 May, 2020 1 commit
-
-
Paul Fultz II authored
* Fix handling of lowest values in pad operator * Formatting * Formatting * Formatting * Add cpu test for lowest padding * Add test for max Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 May, 2020 1 commit
-
-
Paul Fultz II authored
* Add decompose pass * Add decompose test * Formatting * Add remap * Formatting * Add compute method for dot * Formatting * Add finder for horizontal fusion * Formatting * Formatting * Reuse predicate * Add gemm fusions * Formatting * Add some fixes for convolution * Formatting * Fix shape tests * Formatting * Reuse axis equal * Add initial split fusion * Formatting * Update offset * Workaround outputs that cant accept nonstandard shapes * Formatting * Add check for split concat * Formatting * Add missing headers * Formatting * Add tests * Formatting * Add more testing * Formatting * Fix when there is duplicate splits in inputs * Formatting * Fix mismatch iterators * Add tests for dot fusions * Formatting * Add test for convolution * Formatting * Fix tidy issues * Add more tests * Formatting * Ignore build directory for codecov * Add test for groups * Formatting * Add more tests for groups * Formatting * Add test for missing end slice * Add newline * Remove unused function * Add support for when beta is not 1 * Formatting * Add test for scalar * Add one more scalar test Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 04 May, 2020 1 commit
-
-
Yaxun (Sam) Liu authored
-
- 21 Apr, 2020 1 commit
-
-
Yaxun (Sam) Liu authored
-
- 17 Apr, 2020 1 commit
-
-
Paul Fultz II authored
* Fix bug with transpose and pad * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 14 Apr, 2020 1 commit
-
-
Shucai Xiao authored
* code changes for small bugs * clang format * remove standard shape requirement for transpose. * add a unit test * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 08 Apr, 2020 1 commit
-
-
kahmed10 authored
* add recip gpu and tests * formatting * remove to_hip_type Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 06 Apr, 2020 1 commit
-
-
Paul Fultz II authored
* Upgrade compiler to hcc * Add missing perl package for hcc * Update hip and rocblas version * Add flag to speedup build * Remove symlink * Add no-gpu-rdc to linking * Add missing define * Disable no unused argument * Suppress warning * Add definition to all targets * Fix compiler flags * Refer to rocblas
-
- 29 Mar, 2020 1 commit
-
-
kahmed10 authored
* fix pad calc * modify clip for more args * formatting * add test, flip order, revert to unary * fix error msg * add min and max args to clip * formatting * fixes to quantization * formatting * fix logic and add extra test * formatting * fix logic, add extra test * formatting * fix bug in test Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com>
-
- 07 Mar, 2020 1 commit
-
-
Shucai Xiao authored
* add prelu operator * clang format * add prelu to gpu lowering * add unit tests for the PRelu operator * clang format * add missing onnx file for PRelu operator * update unit tests for prelu operator * clang format Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com>
-
- 06 Mar, 2020 1 commit
-
-
Shucai Xiao authored
* Add initial api * Formatting * Add more api * Formatting * Add auto api generation * Formatting * Fix some compilation errors * Change handle struct * Formatting * Fix reamining compilation errors * Formatting * fixed a bug related to number of outputs * Simplify using ctype * Formatting * Initial c++ generation * Formatting * Add C++header * Formatting * Add test * Formatting * Add initial tests * Formatting * Try to fix formatting * Cleanup formatting * Formatting * Fix constructors on the same line * Fix tests * Formatting * Fix tidy issues * Fix tidy issues * Fix naming issue * Add onnx API to parse buffer * Formatting * Add arguments api * Formatting * Fix verify parameters * Fix cppcheck issues * Formatting * Add method to get output shapes and bytes * Formatting * Try formatting * Formatting * Improve the test coverage * Formatting * Add print method * Formatting * Fix cppcheck issue * Fix package dependency * code backup for support multiple outputs * clang format * change migraphx api to support multiple program outputs * clang format * change api implementation * clang format * clang format * fix a build error * additional changes * clang format * change api for correct automatic generation * clang format * fix unit test error * fix unit test error * fix unit tests error * support multiple program outputs * clang format * remove @ from the add_return name * Add nolint * Try fix formatting * Formatting * formatting * formatting * Fix formatting * code cleanup * clang format * fix cppcheck error * fix a cppcheck error * clang format * fix review comments * clang format * fix cppcheck error * clang format * record graph output name * clang format * refine print the add_return instruction * clang format * fix cppcheck error * clang format * refine the name of the add_return instruction * fixed a bug related to workspace * fixed two small bugs * clang format * add more unit tests for multiple program outputs * clang format * change an error info * clang format * fix cppcheck error * add unit test for better code coverage * change to reduce code change * clang format * remove storing program output * fix cppcheck error * fix review comments * clang format * clang format * remove unnecessary change * resolve an assert error * clang format * change the output name with prefix '#' * changes in quantization function to support the returns instructin * clang format * refine unit tests * clang format * refine profiling print out report Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
Khalique <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 26 Feb, 2020 1 commit
-
-
Paul authored
-
- 24 Feb, 2020 1 commit
-
-
Paul authored
-
- 12 Feb, 2020 1 commit
-
-
Aaron Enye Shi authored
* Fix HIP-Clang GPU build issues Add missing device attributes for GPU functions. GPU functions must be annotated with __device__ in HIP. * Use HIP device function max and min * Fix clang-format-5.0 issues * Undo change that breaks on HIP-HCC Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 10 Feb, 2020 1 commit
-
-
Shucai Xiao authored
* Add initial api * Formatting * Add more api * Formatting * add more operators (asinh, acosh, atanh, MatMulInteger, ConvInteger) * clang format * add unit tests for new operators * clang format * Add auto api generation * Formatting * Fix some compilation errors * Change handle struct * Formatting * Fix reamining compilation errors * Formatting * Simplify using ctype * Formatting * Initial c++ generation * Formatting * Add C++header * Formatting * Add test * Formatting * Add initial tests * Formatting * Try to fix formatting * Cleanup formatting * Formatting * Fix constructors on the same line * Fix tests * Formatting * Fix tidy issues * Fix tidy issues * Fix naming issue * Add onnx API to parse buffer * Formatting * Add arguments api * Formatting * Fix verify parameters * Fix cppcheck issues * Formatting * Add method to get output shapes and bytes * Formatting * Try formatting * Formatting * Improve the test coverage * Formatting * Add print method * Formatting * Fix cppcheck issue * Fix package dependency * Add nolint * Try fix formatting * Formatting * formatting * formatting * Fix formatting Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com>
-
- 24 Jan, 2020 1 commit
-
-
kahmed10 authored
* initial testing * add shape op * formatting * add env variable for batch sizes * formatting * progress on driver * progress on driver * cleanup * cleanup * add and modified prev tests * formatting * remove comment * add shape op test * formatting * manually insert shape op in test * formatting * create options struct for parsers * formatting * Add documentation for python * Fix c++ documentaion * add documentation to parser * formatting * add argmin and tests * fix doc and definitions * formatting * revert test functions * formatting * cpu impl of conv_transpose * more work on conv_transpose * rename files, added extratests * formatting * add more tests * formatting * changes * fix tests * fix tidy * formatting * fixed function parameter * fix function parameter * add cpu ops test * formatting Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 17 Jan, 2020 1 commit
-
-
Shucai Xiao authored
* add reduce operators as required by onnxruntime * clang format * remove a test since it can cause overflow * resolve cppcheck error * clang format * fix cppcheck error * clang format
-
- 20 Dec, 2019 1 commit
-
-
Shucai Xiao authored
* improve unsqueeze to support negative axis and parsing scalar * clang format * add a test example for the negative axis of unsqueeze * improve the squeeze operator to support negative axis * clang format * fixed a small bug in the lrn implementation * clang format * support negative axis in argmax and argmin * clang format * improve flatten to support negative axis * clang format * change softmax/logsoftmax to support negative axis * clang format * improve transpose by adding default perm * clang format * add one more dimens for tensor size * add one more dimens for tensor size * disable conv ops fusion for non-symmetric cases * clang format * fixed review comments * move computing axis from the device function to the compute function * clang format * move computing axis from device function to the operator computing function * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 27 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add experimental support for c++ output * Format * Fix syntax errors * Add resnet50 model * Formatting * Add inceptionv3 model * Formatting * Add alexnet * Formatting * Fix name of pooling mode * Formatting * Fix tidy issues * Ignore driver directory * Show accetable values
-
- 18 Nov, 2019 1 commit
-
-
Shucai Xiao authored
* improve gather implementation to handle negative input indices * clang format * clang format * improve concat to support neg axis input * clang format * fix cppcheck error * clang format * code cleanup * clang format * fix review comments * clang format
-
- 17 Nov, 2019 1 commit
-
-
Paul authored
-
- 15 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add compiler options * Add copy operators * Formatting * Use run_passes in tests * Formatting * Use run_pass in schedule test * Formatting * Add compile_options to get_passes in target * Formatting * Offload copy option * Formatting * Copy using pinned memory * Formatting * Improve performance of gpu copying * Formatting * Dont copy * Formatting * Always make an extra copy * Formatting * Remove unused write op * Add missing include * Remove copy_to_gpu function in python api * Make offload copy disabled by default on C++ * Formatting * Fix tidy issues * Formatting * Fix namespace * Fix python tests * Turn clang format off since its broken * Fix compile error on gcc 5 * Remove commented code
-
- 04 Nov, 2019 2 commits
-
-
Paul Fultz II authored
* Add functions to do multi-index for local strides as well * Formatting * Use same multi-index path for block_reduce * Formatting * Use multi-index calc in reduce * Formatting * Fix warning * Fix compiler warning * Disable some tidy checks
-
Paul Fultz II authored
* Fix bug in eliminate_concat * Formatting * Skip context_free operators * Formatting * Fix unit test * Formatting
-
- 30 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Enable scheduler for 1 stream * Formatting * Improve performance of sorting * Formatting * Adjust the weight calculation * Formatting * Simplify formula * Formatting * Avoid division by zero * Fix scheduler test * Check for either 1 or 2 * Check for waits when order may change * Formatting
-
- 25 Oct, 2019 1 commit
-
-
Shucai Xiao authored
* simplify cpu implementation of the convolution, softmax, and logsoftmax * clang format * fix cppcheck error * improve code coverage
-
- 15 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* use 32bit integers for indices * Formatting * Update more index types * Formatting
-
- 09 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Fix bug in bert accuraccy * Formatting * add another test * Fix add and overflow * Formatting * Fix bug in shape_for_each * Use front instead of iterator * Use result.front() * Split add_unary files * Formatting * Fix incorrect last index * Remove comment * Inline function * Fix carry check * Fix metadata errors * Formatting * Reflow * Reflow
-
- 07 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Implement fast-div for index calculations * Formatting * Use fast_div for broadcasts * Formatting * Add remiander function * Compute mult-index using lens instead of strides * Formatting * Simplify equation * Formatting
-
- 04 Oct, 2019 1 commit
-
-
kahmed10 authored
* initial testing of add_clip fusion * formatting * clipped relu fusion * formatting * remove some executables, add fusion test * formatting * remove clipped_relu code * fix clang-tidy * revert changes to cmake files * remove fusion from weight map * formatting * fix syntax error * formatting * fix syntax error * fix syntax error * formatting
-
- 03 Oct, 2019 2 commits
-
-
Shucai Xiao authored
* fixed a bug related to removing gemm copy * clang format * fix review comments * clang format * fix unit test failure * fix review comments * clang format
-
Paul Fultz II authored
* Add env to trace nary device functions * Formatting * Improve contiguous and concat performance * Formatting * Remove unused variable * Formatting * Fix gpu tests * Formatting * Add more test for transposed concat * Formatting * Compute offset and not index * Compute multi-index once * Formatting * Fix transposed inputs * Formatting * Use product order for comparisons of hip_array * Formatting * Add missing s parameter * Formatting * Dont invert permutation * Fix tidy warnings * Formatting * Remove incorrect license * Use a single integer for stride * Formatting * Fix tidy issue
-
- 27 Sep, 2019 1 commit
-
-
Shucai Xiao authored
* add two operators ceil and floor * clang format * add unit test for the ceil and floor operators * remove unintended code
-