- 17 Mar, 2021 4 commits
-
-
Paul authored
-
Paul Fultz II authored
-
Shucai Xiao authored
* code backup * clang format * code backup * change the print function to support print instruction from other modules * clang format * fix cppcheck error * fix cppcheck error * chang to make submodule to be owned by program instead of modules * clang format * add an unit test for copy of a program with sub_modules * clang format * remove the parent_module member variable from the module class * clang format * add unit test for serialization of program with submodules * clang format * Fix bug where instructions were not printed when doing TRACE_EVAL * clang storage of modules from map to list * clang format * Formatting * change the program assign function * clang format * code cleanup * clang format * backup code * clang format * remove unnecessary code * clang format * add module print function * code backup * refine the module::print function * refine the module:to_value() function * code backup * backup code changes * code backup * remove to_value and from_value function from the module class * rename a function * rename the if operator * refine the if operator * refine the print function of module and program * code backup * code backup * fix a build warning * fix overload of compute_shape function * code backup * fix unit test error * fix cppcheck error * fix the issue related to the overload of compute_shape * fix review comments * fix cppcheck error * change the return name of if_op to be if * clang format * fix two unit tests * clang format * remove the unused compute_op function * clang format * fix clang tidy format * clang format * enhance the validate function and uncomment a unit test * clang format * remove unnecessary code * clang format * fix a hang issue related to the valid function * fix an issue in replace_refs * clang format * fix review comments * clang format * fix cppcheck error * add a unit test for more code coverage * clang format * fix review comments and add test for more code coverage * clang format * fix cppcheck error * fix a cppcheck error * clang format * fix cppcheck error * clang format * fix review comments * clang format Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
-
- 12 Mar, 2021 1 commit
-
-
Cagri Eryilmaz authored
* onnx tests for lessorequal operator * lessorequal operator implementation * added test to verify_onnx.cpp * changes to tests * formatting * fixing tidy Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 10 Mar, 2021 1 commit
-
-
Shucai Xiao authored
* fix the flag in rocblas api for int8 data type * used different flag for different rocblas versions * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 05 Mar, 2021 1 commit
-
-
kahmed10 authored
* fix relu6 * add more transposes * add multi output * formatting * add tests * formatting * fix tests * change to_nchw for outputs * add python api * fix cppcheck * remove variable * fix lambda * add multi_output test * add more tests and merge * fix help message * debugging work * fix valid op string * formatting * manual merge * mark function as const Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Shucai Xiao <shucai@gmail.com>
-
- 04 Mar, 2021 1 commit
-
-
Cagri Eryilmaz authored
* adding changes for resnet50 inference: opencv version problem wit qt + headless server support * added histogram output instead of text Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 03 Mar, 2021 1 commit
-
-
Shucai Xiao authored
* fix issue#727 * clang format * refine unit tests * fix cppcheck error * fix review comments * refine a unit test to cover more code changes * fix cppcheck error * remove unnecessary include file * fix review comments * clang format Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 27 Feb, 2021 1 commit
-
-
kahmed10 authored
* fix relu6 * add more transposes * add multi output * formatting * add tests * formatting * fix tests * change to_nchw for outputs * add python api * fix cppcheck * remove variable * fix lambda * add multi_output test * add more tests and merge * fix help message Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com>
-
- 26 Feb, 2021 4 commits
-
-
Cagri Eryilmaz authored
* changes for not operator * changed name of the op from unary_not to not * Added tests for op and onnx parsing * reordering not_test in onnx_test.cpp * not operator -- gpu implementation * added bool test for not operator * Added test and missing links for not operator on GPU * typo fix * adding .onnx test files for not operator * formatting Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Add eliminate_data_type pass * Formatting * Auto convert quant ops * Formatting * Flip the order of decompose * Compute max size differently * Formatting * Clamp values in convert * Formatting * Fix loss of precision in reduce * Formatting * Fix bugs in reduction * Fix accumulator type in reference softmax implementation * Formatting * Update convert test * Remove unused variables * Remove unnecessary quant_dot check * Formatting * Add tests * Formatting * Remove unused code * Remove duplicate ops * Remove blaze dependency * Use set since shape::type_t is no hashable on gcc 5 * Formatting * Add dnnl binary op * Formatting * Add binary and eltwise * Formatting * Add softmax * Formatting * Remove unused operators * Add missing files * Formatting * Add lrn * Formatting * Add deconvolution * Formatting * Change allocate default * Add reorder * Formatting * Add reductions * Formatting * Sort lines * Change literals in another loop * Add pow operator * Formatting * Add pow operator * Formatting * Make sure shapes are packed * Allow broadcasted inputs * Remove unused operators * Simplify functions * Remove softmax * Add sub and erf functions * Formatting * Fix bug * Formatting * Improve parallism * Formatting * Allow multiple batch dimensions * Formatting * Move literal transforms out of lowering * Formatting * Add gather operator * Sort lines * Add early exit for carry * Formatting * Add missing concat * Rename macro * Fix deep nesting * Formatting * Fix cppcheck issues * Remov else * Move attribute to typedef * Formatting * Disable maybe-uninitialized warning since its broken on gcc * Add constexpr default constructor * Formatting * Fix compiler warnings * Fix adjust_allocation test Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Shucai Xiao authored
* update README.md file * update README.md file * update README.md file * update README.md file * update README.md file * update dependencies * update dependencies * update readme file * format issue * edit * edit text * update script to install all dependencies * remove unnecessary libs * refine readme file * refine readme file * refine readme file * refine readme file * refine README * refine readme.md file * fix a error in the README.md file * change to use Dockerfile to call install_prereqs.sh * refine readme file * minor changes * minor changes * minor changes * minor changes * minor changes * minor changes * fixed a bug * refine readme file * minor change * minor changes * minor changes * minor changes * some additional refinement * fix review comments * some refinement * fix review comments * refine readme file * fix minor issue * change the miopen versioin back to 2.4.0 * fix an error in find_package * remove an unnecessary change * some additional refinement of the readme file * fix readme file * fix a review comments * added one more note for rbuild * add more informat for rbuild not found * add a period * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * Update README.md Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> * refine a link in the readme file * some additional refinement * minor changes * minor changes * minor changes Co-authored-by:
kahmed10 <15948690+kahmed10@users.noreply.github.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Shucai Xiao authored
* if operator support with constant condition input * clang format * add a missing file * clang format * add an onnx verifcation unit test for the if operator * clang format' * fix review comments * temp version to try jenkin build * remove unnecessary changes * unit tests refinement for more code coverage * clang format * try a mutex to fix possible race condition in onnxruntime tests * tmp changes to try jenkins build * remove unnecessary code * fix review comments Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 25 Feb, 2021 3 commits
-
-
kahmed10 authored
* fix relu6 * add more transposes * add tests * formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Fix bug where instructions were not printed when doing TRACE_EVAL * Formatting Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Add code object op * Formattting * Add more value tests * Formatting * Fix from_value conversion from binary * Formatting * Dont use offload copy * Remove iostream header Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 23 Feb, 2021 2 commits
-
-
kahmed10 authored
Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Add doc for c++ api * Formatting * Add driver documentation * Change the way base class is documented
-
- 12 Feb, 2021 1 commit
-
-
Paul authored
-
- 11 Feb, 2021 1 commit
-
-
kahmed10 authored
* refactor files * formatting * fix add_bcast test * fix some tidy errors * add transpose field and fix more tidy * formatting * add pad test and fix more tidy * formatting * fix conv parser * fix depthwiseconv * remove unsed functions * remove includes and functions Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 08 Feb, 2021 1 commit
-
-
Paul Fultz II authored
* Add eliminate_data_type pass * Formatting * Auto convert quant ops * Formatting * Flip the order of decompose * Compute max size differently * Formatting * Clamp values in convert * Formatting * Fix loss of precision in reduce * Formatting * Fix bugs in reduction * Fix accumulator type in reference softmax implementation * Formatting * Update convert test * Remove unused variables * Remove unnecessary quant_dot check * Formatting * Add tests * Formatting * Remove unused code * Remove duplicate ops * Remove blaze dependency * Use set since shape::type_t is no hashable on gcc 5 * Formatting Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 06 Feb, 2021 1 commit
-
-
Shucai Xiao authored
* fix a bug that softmax/logsoftmax cannot handle nonstd input shape * clang format * fix review comments * clang format * refine test to have more code coverage * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 05 Feb, 2021 3 commits
-
-
Paul Fultz II authored
* Update module name * Formatting * Add const ref * Fix tidy warning Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com> Co-authored-by:
Shucai Xiao <shucai@gmail.com>
-
Paul Fultz II authored
* Normalize compute functions * Formatting * Save normalization flag to the file * Formatting * Remove tuned functions * Formatting * Use in_index Co-authored-by:
Shucai Xiao <shucai@gmail.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
turneram authored
* Added initial examples * Added python example from wiki * Edited readme * Added cpp interface files * Made changes to readmes * Added jupyter notebook for tf2 ex, added readme for tf1 ex * Added dockerfile * Re-structured driver example * Removed unnecessary files * Changed include path * Removed cpp_interface to rewrite * Added example of parsing, loading, saving with C++ API * Updated readme * Small code change, altered docker invocation, formatiing * Formatting * Added newline to end of dockerfile * Formatting * Formatting * Added C++ API inference example program * Formatting * Added README to cpp inference example * DeepCode suggested changed * DeepCode suggested change * Redesign python inference example * Address review comments * Address review comments * Address review comments * Added check for half version; conditionally ignore struct specialized for deprecated type * Moved new block after find_package(ROCM) * Rename HAVE_EXPR to HAVE_HALF_EXPR; relocated definitions Co-authored-by:
Paul Fultz II <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 04 Feb, 2021 1 commit
-
-
Shucai Xiao authored
* disable some ort tests to pass jenkins build * update onnxruntime to a later version and run disabled tests sequentially * remove the command line to turn on some unit tests
-
- 01 Feb, 2021 1 commit
-
-
Paul Fultz II authored
* Update test for passes * Formatting * Rewrite simplify_reshapes * Formatting * Rewrite normalize pass * Formatting * Rewrite pooling * Formatting * Rewrite schedule tests * Formatting
-
- 29 Jan, 2021 2 commits
- 28 Jan, 2021 1 commit
-
-
Paul Fultz II authored
* Add binary type to serialization * Formatting * Fix tidy warning * Size of value type is 1 * Formatting Co-authored-by:Shucai Xiao <shucai@gmail.com>
-
- 20 Jan, 2021 1 commit
-
-
turneram authored
* Added initial examples * Added python example from wiki * Edited readme * Added cpp interface files * Made changes to readmes * Added jupyter notebook for tf2 ex, added readme for tf1 ex * Added dockerfile * Re-structured driver example * Removed unnecessary files * Changed include path * Removed cpp_interface to rewrite * Added example of parsing, loading, saving with C++ API * Updated readme * Small code change, altered docker invocation, formatiing * Formatting * Added newline to end of dockerfile * Formatting * Formatting * Added C++ API inference example program * Formatting * Added README to cpp inference example * DeepCode suggested changed * DeepCode suggested change * Redesign python inference example * Address review comments * Address review comments * Address review comments
-
- 19 Jan, 2021 1 commit
-
-
Shucai Xiao authored
* add the and operator * clang format * add unit tests for the and operator * clang format * change the and name to logical_and and add the logical_or, logical_xor * clang format * add onnx unit tests for or and xor * add more unit tests
-
- 18 Jan, 2021 2 commits
-
-
Paul Fultz II authored
Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
kahmed10 authored
* initial testing * initial testing * add dequantize * formatting * add tests * formatting * revert file * add parse files * formatting * add axis tuning and fix tests * formatting * add tests and fix int8 * formatting * fix tidy * test with int32 * add default name and change string to upper * formatting * remove boost call * refactor to use tune_axis) * formatting
-
- 13 Jan, 2021 2 commits
-
-
Paul Fultz II authored
* Add initial iterator implementation * Formatting * Access index with bracket * Add cppcheck suppression Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
kahmed10 authored
* initial testing * initial testing * add dequantize * formatting * add tests * formatting * revert file * add parse files * formatting * add axis tuning and fix tests * formatting * add tests and fix int8 * formatting * fix tidy * test with int32 * add default name and change string to upper * formatting * remove boost call
-
- 08 Jan, 2021 2 commits
-
-
Shucai Xiao authored
* add support of the different input data_type for the pow operator * clang format * fix cppcheck error * clang format * add unit test for the pow operator with different input data types * clang format * remove unnecessary comments * fix review comments * clang format * fix a issue related to hash table key type * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
Paul Fultz II authored
* Add build and test github workflow * Fix cget command * Remove def-requirements.txt * Add tmate session to debug workflow * Run tmate session after installing dependencies * Print date periodically * Add clang tidy action * Seperate build and run container in two different jobs * Run bash script * Remove interactive flag * Try to mount the files * Try to use the github workspace * WIthout double braces * Use env variable * Pipe bash script in * Run using hip-clang * Use correct path * Add verbose * Remove j flag * Only run for onnx file to debug * Manually run clang-tidy * Remove quiet flag * Print header file * Printout environment * Remove extra defines * Remove fixits and config flag * Show ldd * Add tmate session * Run onnx protobuf first * Generate proto for tensorflow * Update cppcheck version * Fix some cppcheck issues * Add const * Cppcheck fixes * Formatting * Fix more cppcheck issues * Run two jobs * Cache analysis and run format checking * Fix yaml issues * Fix yaml issues * Fix indentation * Switch to hip-clang for main docker file * Use hip-clang in the readme * Fixes for jenkins * Use ccache to build * Combine file * Set restore keys * Change stage name * Build with ccache * Add missing dependency for ccache * Build debug with codecov * Fix workflow syntax * Fix list * Use quotes * Got to correct build path * Install lcov * Use sudo * Echo all commands * Setup tmate * Add verbose output * Build with cmake directly * Add pthread flag * Remove python config * Continue on error * Use on or off for cmake flag * Use always upload cache * Verbose output * Verbose output from build * Build one target * Reduce debug symbols * Increase garbage collection * Remove dmesg * Increase it to 20 * Update rocm cmake version * Remove jobs from jenkins * Run on all 3 ubuntus * Remove gcc 5 jobs * Dont add flag on 16.04 * Only upload coverage on 18.04 * Dont build for ubuntu 20.04 * Use matrix.os * Use O2 for hip-clang since lower optimizations are broken * Use rocm 3.0 * Pass ccache as cmake variable instead of env variable * Build miopen from source * Show ccache statistics * Print log information * Set compression level * Use hash dir * Set hashdir * Install clang ocl from system * Up compression level * Add locale * Increase cache size to 1G * Lower compression level to 9 * Remove split dwarf * Remove Og * Add back Og * Seperate debug and codecov * Add missing backlash * Garbage collect more often * Add missing locales package * Use Os * Install onednn in docker and run tests * Include target headers in tests * Increase timeout * Remove if condtion * Make flag public * Suppress memory leaks in onednn * Use equal * Add gh annotations * Update rocm-cmake version * Add ldconfig Co-authored-by:Shucai Xiao <shucai@gmail.com>
-
- 07 Jan, 2021 1 commit
-
-
Paul Fultz II authored
-