- 16 Sep, 2021 1 commit
-
-
Shucai Xiao authored
Add Loop operator for opset version 13. Notes: 1) Default max iteration number is 10 if no max iteration number is provided 2) To change the max iter number, a user can set the max_loop_iterations in the onnx_option struct when parsing a model. 3) The returned shape of the scan output is from the max_loop_iterations even the actual loop num is less than that. This issue also applies to other operators like NonZero and NonMaxSuppression. A issue #948 is created to track this and to be resolved later. Co-authored-by:
Paul <pfultz2@yahoo.com> Co-authored-by:
mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 06 May, 2021 1 commit
-
-
Paul Fultz II authored
* Use hipStreamSynchronize instead of device sync * Formatting * Suppress FPs * Use sync_stream instead of device * Formatting * Fix python bindings * Formatting
-
- 30 Sep, 2020 1 commit
-
-
Paul Fultz II authored
* Make global variables const * Tidy fixes * Disable some lints * Formatting * Fix tidy const * Formatting * Add missing const keywords * Formatting * More fixes * Fix remaining tidy issues * Formatting * Fix rocblas function call * Formatting * Fix nodiscard warnings * Formatting * Use named parameters * Remove overload * Add overload * Remove noncps * Use named param for node * Add auto register header * Use named parameters * Refactor jenkinsfile * Fix shadow * Add missing body variable * Add more const methods * Add hip-clang docker builds * Remove comments * Add clang-format * Add more const * Formatting * Rename stage * Disable check * Add another const * Add python 2 dev packages * Add sphinx to dockerfile
-
- 18 Aug, 2020 1 commit
-
-
Paul Fultz II authored
* Register ops for main migraphx * Formatting * Register cpu ops * Formatting * Show list of operators in the driver * Formatting * Simplify regiter * Try to register gpu ops * Fix compiler errors * Register rest of the gpu operators * Add some tests * Formatting * Fix gcc compiler warnings * Formatting * Fix tidy warnings * Fix compile error * Use correct op name * Register layer norm * Use const ref * Make run const
-
- 13 Aug, 2020 1 commit
-
-
Shucai Xiao authored
* code backup * code backup Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 21 Jul, 2020 1 commit
-
-
Paul Fultz II authored
* Fix bug in eliminate_concat with negative axis * Formatting * Fix unused parameter * Formatting Co-authored-by:Shucai Xiao <shucai.xiao@amd.com>
-
- 15 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add compiler options * Add copy operators * Formatting * Use run_passes in tests * Formatting * Use run_pass in schedule test * Formatting * Add compile_options to get_passes in target * Formatting * Offload copy option * Formatting * Copy using pinned memory * Formatting * Improve performance of gpu copying * Formatting * Dont copy * Formatting * Always make an extra copy * Formatting * Remove unused write op * Add missing include * Remove copy_to_gpu function in python api * Make offload copy disabled by default on C++ * Formatting * Fix tidy issues * Formatting * Fix namespace * Fix python tests * Turn clang format off since its broken * Fix compile error on gcc 5 * Remove commented code
-
- 18 Sep, 2019 1 commit
-
-
Shucai Xiao authored
* Remove extra copy in gemm * combine rocblas gemm call * clang format * fix a bug in calling rocblas function * clang format' * backup of temporary changes * clang format * unify the gemm call to avoid multiple gpu implemantation * clang format * remove unnecessary code * backup temp changes * clang format * fix cppcheck error * code backup * clang format * remove unnecessary synchronization function * clang format * fix bugs * clang format * more optimization related to gemm * clang format * code cleanup * implementation that can achieves better performance * clang format * temp changes to try performance * clang format * revert to previous commits * fixed review comments * clang format * fix review comments
-
- 12 Mar, 2019 1 commit
-
-
Paul authored
-
- 02 Mar, 2019 1 commit
-
-
Paul authored
-
- 01 Mar, 2019 3 commits
- 11 Dec, 2018 1 commit
-
-
Paul authored
-
- 27 Nov, 2018 1 commit
-
-
Paul authored
-
- 14 Nov, 2018 1 commit
-
-
Paul authored
-
- 06 Nov, 2018 9 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
- 28 Oct, 2018 2 commits
- 27 Oct, 2018 1 commit
-
-
Paul authored
-
- 26 Oct, 2018 3 commits
- 18 Oct, 2018 1 commit
-
-
Paul authored
-
- 13 Sep, 2018 1 commit
-
-
mei-ye authored
-
- 11 Sep, 2018 1 commit
-
-
mei-ye authored
-
- 01 Sep, 2018 1 commit
-
-
Paul Fultz II authored
-
- 31 Aug, 2018 1 commit
-
-
mei-ye authored
-
- 27 Aug, 2018 1 commit
-
-
mei-ye authored
-
- 24 Aug, 2018 1 commit
-
-
mei-ye authored
-
- 23 Aug, 2018 1 commit
-
-
mei-ye authored
-
- 19 Aug, 2018 1 commit
-
-
Paul authored
-