- 05 Apr, 2021 1 commit
-
-
Shucai Xiao authored
* code cleanup * clang format * backup code * clang format * remove unnecessary code * clang format * add module print function * code backup * refine the module::print function * refine the module:to_value() function * code backup * backup code changes * code backup * remove to_value and from_value function from the module class * rename a function * rename the if operator * refine the if operator * refine the print function of module and program * code backup * code backup * fix a build warning * fix overload of compute_shape function * code backup * fix unit test error * fix cppcheck error * fix the issue related to the overload of compute_shape * fix review comments * fix cppcheck error * change the return name of if_op to be if * clang format * fix two unit tests * clang format * rename variables * clang format * remove the unused compute_op function * clang format * add lowering of if operator and compute_op function * clang format * add parsing if operator in onnx file * clang format * fix clang tidy format * clang format * add the gpu implementation of the if operator * enhance the validate function and uncomment a unit test * clang format * remove unnecessary code * add sub_module processing in ref passes * clang format * clang format * fix a hang issue related to the valid function * fix an issue in replace_refs * clang format * fix review comments * clang format * fix cppcheck error * clang format * add a unit test for more code coverage * clang format * fix review comments and add test for more code coverage * clang format * fix cppcheck error * clang format * fix cppcheck error * fix a cppcheck error * clang format * backup code * clang format * fix cppcheck error * clang format * some code refinement * clang format * code backup to handle submodules in module compilation * clang format * code backup * clang format * code backup * clang format * fix a bug related to literal id * fix a bug in gpu execution * change the way of compiling a graph * clang format * backup more changes * clang format * refine pass log information * remove unnecessary code * clang format * temp changes backup * clang format * add module name prefix to scratch memory id in hip_memory_allocation * clang format * change to copy the cond input by inserting a copy instruction * clang format * change to use the if output argument as the submodule output so can remove a gpu_copy * clang format * consider submodule in some compile passes * clang format * fix review comments * clang format * fix issues related to scratch memory * clang format * remove unnecessary code * fix cppcheck error * clang format * reslove the implicit dependencies issue related to submodule * clang format * fix cppcheck error * clang format * backup temp changes * clang format * fixed an bug in the has_instruction function * clang format * fix the return value of the gpu implementation of the if operator * fix a bug in the compute_shape function in the gpu implementation * add an if onnx unit test * clang format * add more unit tests * clang format * tmp code backup * clang format * fix a sync problem related to copy cond argument from gpu to cpu * clang format * change the compile offload copy flag setting * clang format * enable copy from cpu to be able to do synchronous copy * clang format * add more unit tests * add more unit tests * add more ref unit tests * clang format * fixed a bug error * tmp code backup * clang format * fixed an onnx verify unit test * add more unit tests * clang format * reverse a change * fix cppcheck error * fix cppcheck error * fix to print all instructions in program execution * clang format * fix bugs related to memory coloring and offload copy to be true * clang format * remove unnecessary include header file * sort test cases in ref_cpu_ops alphabetically * clang format * add a flag to disable cpu target in verification test * change the way to disable some tests * clang format * disable verify unit test of the if operators * add a function call to have more code coverage * fix a build error * fix review comments * fix review comments * clang format * add a api gpu unit test for more code coverage * clang format * change to use instruction.size() as node index * move the calc_implicit_deps function to module class as a member function * clang format * move the offload_copy flag setting to lowering * clang format * assign the module_eval lambda function to a variable to simplify code * clang format * move the compute function from ref/gpu implementation to the main if operator * clang format * fix cpp check error * add a unit test for more code coverage * clang format * add unit test to calculate implicit deps * add a python unit test * clang format * refine a unit test to have more code coverage * clang format * chang the way of wrap up arguments for sub modules * clang format * fix some build errors * code cleanup * refine unit tests to have more code coverage * clang format * refine unit test to have more code coverage * code backup * clang format * add memory coloring test * refine memory coloring unit test * clang format * remove an unnecessary line * remove an unused line * remove an unnecessary parameter in the lambda function * clang format * refine a unit test * remove an unnecessary line * refine unit tests to have more code coverage * clang format * combine two lines * add one more unit test for more code coverage * clang format * add one more unit test * clang format * fix review comments * refine a print out information * fix review comments * clang format * change the sync copy to using a gpu device sync * clang format * remove unnecessary code Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 01 Feb, 2021 1 commit
-
-
Paul Fultz II authored
* Update test for passes * Formatting * Rewrite simplify_reshapes * Formatting * Rewrite normalize pass * Formatting * Rewrite pooling * Formatting * Rewrite schedule tests * Formatting
-
- 06 Jan, 2021 1 commit
-
-
Shucai Xiao authored
* add an api get_main_module * clang format * modify onnx unit test for module * clang format * refactor ops unit test with the get_main_module * clang format * code backup * clang format * refine module c api * add python api for module * clang format * fix a python api issue * clang format * fix cppcheck error * clang format * refine unit tests changes * clang format * code backup * code backup * clang format * defer some changes to later PRs * change return of get_main_module from ref to pointer * clang format * add unit tests for the get_main_module_api * clang format * fix cppcheck error * clang format * fix cppcheck error * clang format * add more unit tests for more code change coverage * clang format * fixed a unit test error * clang format * fix unit test * clang format * code backup * code change for more code coverage * change program to module in various passes and matcher * clang format * modify the pass API * code backup * code backup * clang format * code backup * clang format * Add option to no generate a destroy method * Formatting * fix some review comments * clang format * fix review comments * clang format * clang format * code backup * code backup * clang format * fix cppcheck errors * clang format * clang format * fix build errors * clang format * modify gpu unit tests to using module * clang format * fix cppcheck error * clang format * Add flag to enable cpu backend * Make buffers shared * Enable optimizations * Formatting * fix review comments * code backup * clang format * code backup * clang format * fix a bug related to a unit test * clang format * clang format * fix a build error * remove unnecessary code * remove unnecessary files * code backup * clang format * remove the compile function from the module class * clang format * clang format * remove the context parameter from the from_value method of the module class * code refinement * clang format * merge changes from develop branch * clang format * fix cppcheck error * clang format * fix a build error * fixed a merge error * fix cppcheck error * fixed review comments * clang format * fix cppcheck error * fix a cppcheck error * fix cppcheck error * fix build error caused by merge * Add missing has_op function * Formatting * merge changes from develop branch * fix a cppcheck error * fixed some review comments * clang format * remove the begin/end function of the program class * clang format * refine code and fix cppcheck error * clang format * fix review comments * clang format * fix review comments * clang format * add unit tests for more code coverage * clang format * fix review comments * clang format * fix review comments * clang format * fix a build error in debug mode * clang format Co-authored-by:Paul <pfultz2@yahoo.com>
-
- 08 Dec, 2020 1 commit
-
-
Paul Fultz II authored
* Load op when serializing * Formatting * Add missing clip field * Use make_op almost everywhere * Formatting * More make ops for rnns * Get rid of spaces * Formatting * Remove operators headers * Formatting * Remove unused op headers * Increase line threshold
-
- 11 Nov, 2020 1 commit
-
-
Shucai Xiao authored
* code backup * clang format * change corresponding tool files * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 30 Sep, 2020 1 commit
-
-
Paul Fultz II authored
* Make global variables const * Tidy fixes * Disable some lints * Formatting * Fix tidy const * Formatting * Add missing const keywords * Formatting * More fixes * Fix remaining tidy issues * Formatting * Fix rocblas function call * Formatting * Fix nodiscard warnings * Formatting * Use named parameters * Remove overload * Add overload * Remove noncps * Use named param for node * Add auto register header * Use named parameters * Refactor jenkinsfile * Fix shadow * Add missing body variable * Add more const methods * Add hip-clang docker builds * Remove comments * Add clang-format * Add more const * Formatting * Rename stage * Disable check * Add another const * Add python 2 dev packages * Add sphinx to dockerfile
-
- 15 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add compiler options * Add copy operators * Formatting * Use run_passes in tests * Formatting * Use run_pass in schedule test * Formatting * Add compile_options to get_passes in target * Formatting * Offload copy option * Formatting * Copy using pinned memory * Formatting * Improve performance of gpu copying * Formatting * Dont copy * Formatting * Always make an extra copy * Formatting * Remove unused write op * Add missing include * Remove copy_to_gpu function in python api * Make offload copy disabled by default on C++ * Formatting * Fix tidy issues * Formatting * Fix namespace * Fix python tests * Turn clang format off since its broken * Fix compile error on gcc 5 * Remove commented code
-
- 30 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Enable scheduler for 1 stream * Formatting * Improve performance of sorting * Formatting * Adjust the weight calculation * Formatting * Simplify formula * Formatting * Avoid division by zero * Fix scheduler test * Check for either 1 or 2 * Check for waits when order may change * Formatting
-
- 03 Apr, 2019 1 commit
-
-
Shucai Xiao authored
-
- 13 Mar, 2019 1 commit
-
-
Paul authored
-
- 12 Mar, 2019 5 commits
- 11 Mar, 2019 3 commits
- 10 Mar, 2019 3 commits
- 09 Mar, 2019 10 commits
- 08 Mar, 2019 4 commits
- 07 Mar, 2019 3 commits
- 06 Mar, 2019 2 commits