- 09 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Fix bug in bert accuraccy * Formatting * add another test * Fix add and overflow * Formatting * Fix bug in shape_for_each * Use front instead of iterator * Use result.front() * Split add_unary files * Formatting * Fix incorrect last index * Remove comment * Inline function * Fix carry check * Fix metadata errors * Formatting * Reflow * Reflow
-
- 07 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Implement fast-div for index calculations * Formatting * Use fast_div for broadcasts * Formatting * Add remiander function * Compute mult-index using lens instead of strides * Formatting * Simplify equation * Formatting
-
- 04 Oct, 2019 1 commit
-
-
kahmed10 authored
* initial testing of add_clip fusion * formatting * clipped relu fusion * formatting * remove some executables, add fusion test * formatting * remove clipped_relu code * fix clang-tidy * revert changes to cmake files * remove fusion from weight map * formatting * fix syntax error * formatting * fix syntax error * fix syntax error * formatting
-
- 03 Oct, 2019 2 commits
-
-
Shucai Xiao authored
* fixed a bug related to removing gemm copy * clang format * fix review comments * clang format * fix unit test failure * fix review comments * clang format
-
Paul Fultz II authored
* Add env to trace nary device functions * Formatting * Improve contiguous and concat performance * Formatting * Remove unused variable * Formatting * Fix gpu tests * Formatting * Add more test for transposed concat * Formatting * Compute offset and not index * Compute multi-index once * Formatting * Fix transposed inputs * Formatting * Use product order for comparisons of hip_array * Formatting * Add missing s parameter * Formatting * Dont invert permutation * Fix tidy warnings * Formatting * Remove incorrect license * Use a single integer for stride * Formatting * Fix tidy issue
-
- 02 Oct, 2019 1 commit
-
-
kahmed10 authored
* test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook * # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook # This is the 2nd commit message: test hook # This is the 2nd commit message: test hook # This is the 3rd commit message: test hook * fix format * fix format * ignore doc dir * fix regex * fix jenkins error * exclude another dir * formatting test_array * fix version of yapf * test hook * formatting * reinclude dirs
-
- 30 Sep, 2019 1 commit
-
-
Paul authored
-
- 27 Sep, 2019 1 commit
-
-
Shucai Xiao authored
* add two operators ceil and floor * clang format * add unit test for the ceil and floor operators * remove unintended code
-
- 26 Sep, 2019 1 commit
-
-
Paul Fultz II authored
* Fix compiler crash in TF inceptionv4 * Formatting * Remove else
-
- 25 Sep, 2019 1 commit
-
-
Shucai Xiao authored
* first version of refactoring reduce operators. * clang format * refactor the gpu implemantation of the reduce_mean operator * clang format * refactor gpu implementation of the resuce_sum operator * fix cpp check error * fix cppcheck error * fix cppcheck error * fix review comments * clang format * fix a jenkin error * fixed review comments * clang format * fix review comments * clang format * fix review comments * clang format * add implemenation of reduce_min and reduce_max * clang format * add unit test for reduce_min/max operator * clang format * add more unit tests * clang format * fix review comments
-
- 20 Sep, 2019 2 commits
- 19 Sep, 2019 1 commit
-
-
Paul Fultz II authored
* Disable fusion when winograd is used except for 3x3 * Formatting
-
- 18 Sep, 2019 2 commits
-
-
Paul authored
-
Shucai Xiao authored
* Remove extra copy in gemm * combine rocblas gemm call * clang format * fix a bug in calling rocblas function * clang format' * backup of temporary changes * clang format * unify the gemm call to avoid multiple gpu implemantation * clang format * remove unnecessary code * backup temp changes * clang format * fix cppcheck error * code backup * clang format * remove unnecessary synchronization function * clang format * fix bugs * clang format * more optimization related to gemm * clang format * code cleanup * implementation that can achieves better performance * clang format * temp changes to try performance * clang format * revert to previous commits * fixed review comments * clang format * fix review comments
-
- 16 Sep, 2019 4 commits
-
-
Paul Fultz II authored
* Add flags to quantize in driver * Formatting * Fix compile error
-
kahmed10 authored
* add tests, fix bug in ternary op * formatting * uncomment fusion
-
Paul Fultz II authored
-
Shucai Xiao authored
* first version of refactoring reduce operators. * clang format * refactor the gpu implemantation of the reduce_mean operator * clang format * refactor gpu implementation of the resuce_sum operator * fix cpp check error * fix cppcheck error * fix cppcheck error * fix review comments * clang format * fix a jenkin error * fixed review comments * clang format * fix review comments * clang format * fix review comments * clang format
-
- 10 Sep, 2019 1 commit
-
-
kahmed10 authored
* rename test_gpu_miopen * remove gpu prefix
-
- 05 Sep, 2019 2 commits
-
-
Paul authored
-
mvermeulen authored
int8_quantization bug fix related to imagenet models
-
- 04 Sep, 2019 8 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Paul authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
mvermeulen authored
Int8 quantize
-
Shucai Xiao authored
-
- 03 Sep, 2019 10 commits
-
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
-
Shucai Xiao authored
Merge branch 'int8_quantize' of https://github.com/ROCmSoftwarePlatform/AMDMIGraphX into int8_quantize
-
Shucai Xiao authored
-
Shucai Xiao authored
-
-
mvermeulen authored
-