- 17 Jan, 2020 1 commit
-
-
Shucai Xiao authored
* add reduce operators as required by onnxruntime * clang format * remove a test since it can cause overflow * resolve cppcheck error * clang format * fix cppcheck error * clang format
-
- 13 Jan, 2020 1 commit
-
-
Shucai Xiao authored
* change char to upper case * clang format * minor changes * improve implementation of the pooling operator * clang format * improve implementation of the pooling operator * clang format * improving parsing the pooling operators * clang format * continue improve pooling operators * clang format * correct the explicit pad values for maxpool * clang format * add unit test for parsing poolig operators * clang format * fix review comments * refine the padding processing in parsing pooling operator * clang format * add more onnx unit test to have better code coverage * clang format * refine code to fix review comments * clang format * fix onnx unit test Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 10 Jan, 2020 1 commit
-
-
Shucai Xiao authored
* change to support optional outputs * clang format * add onnx test for better code coverage * add corresponding onnx file * fix review comments of handling optional program outputs * clang format * change onnx unit test to pass * clang format * refine onnx unit tests * clang format * remove unnecessary code
-
- 20 Dec, 2019 1 commit
-
-
Shucai Xiao authored
* improve unsqueeze to support negative axis and parsing scalar * clang format * add a test example for the negative axis of unsqueeze * improve the squeeze operator to support negative axis * clang format * fixed a small bug in the lrn implementation * clang format * support negative axis in argmax and argmin * clang format * improve flatten to support negative axis * clang format * change softmax/logsoftmax to support negative axis * clang format * improve transpose by adding default perm * clang format * add one more dimens for tensor size * add one more dimens for tensor size * disable conv ops fusion for non-symmetric cases * clang format * fixed review comments * move computing axis from the device function to the compute function * clang format * move computing axis from device function to the operator computing function * clang format Co-authored-by:mvermeulen <5479696+mvermeulen@users.noreply.github.com>
-
- 19 Dec, 2019 1 commit
-
-
kahmed10 authored
* initial testing * add shape op * formatting * add env variable for batch sizes * formatting * progress on driver * progress on driver * cleanup * cleanup * add and modified prev tests * formatting * remove comment * add shape op test * formatting * manually insert shape op in test * formatting * create options struct for parsers * formatting * Add documentation for python * Fix c++ documentaion * add documentation to parser * formatting * add argmin and tests * fix doc and definitions * formatting * revert test functions * formatting Co-authored-by:Paul Fultz II <pfultz2@yahoo.com>
-
- 28 Nov, 2019 1 commit
-
-
Paul authored
-
- 27 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add experimental support for c++ output * Format * Fix syntax errors * Add resnet50 model * Formatting * Add inceptionv3 model * Formatting * Add alexnet * Formatting * Fix name of pooling mode * Formatting * Fix tidy issues * Ignore driver directory * Show accetable values
-
- 20 Nov, 2019 3 commits
- 18 Nov, 2019 1 commit
-
-
Shucai Xiao authored
* improve gather implementation to handle negative input indices * clang format * clang format * improve concat to support neg axis input * clang format * fix cppcheck error * clang format * code cleanup * clang format * fix review comments * clang format
-
- 17 Nov, 2019 1 commit
-
-
Paul authored
-
- 15 Nov, 2019 1 commit
-
-
Paul Fultz II authored
* Add compiler options * Add copy operators * Formatting * Use run_passes in tests * Formatting * Use run_pass in schedule test * Formatting * Add compile_options to get_passes in target * Formatting * Offload copy option * Formatting * Copy using pinned memory * Formatting * Improve performance of gpu copying * Formatting * Dont copy * Formatting * Always make an extra copy * Formatting * Remove unused write op * Add missing include * Remove copy_to_gpu function in python api * Make offload copy disabled by default on C++ * Formatting * Fix tidy issues * Formatting * Fix namespace * Fix python tests * Turn clang format off since its broken * Fix compile error on gcc 5 * Remove commented code
-
- 14 Nov, 2019 1 commit
-
-
kahmed10 authored
* fix pad calc * simplify ceil calc and remove extra vars * workatound for nasnet * formatting * add split and tests * formatting * fix cppcheck and clang-tidy * fix clang tidy * refactor to use vector of instruction_ref, add UNDEBUG to clang tidy * formatting * fix comment code * fix comments and tidy * formatting * fix syntax error * fix tidy * remove namespace comment
-
- 06 Nov, 2019 1 commit
-
-
Paul authored
-
- 05 Nov, 2019 1 commit
-
-
Paul authored
-
- 04 Nov, 2019 2 commits
-
-
Paul Fultz II authored
* Add functions to do multi-index for local strides as well * Formatting * Use same multi-index path for block_reduce * Formatting * Use multi-index calc in reduce * Formatting * Fix warning * Fix compiler warning * Disable some tidy checks
-
Paul Fultz II authored
* Fix bug in eliminate_concat * Formatting * Skip context_free operators * Formatting * Fix unit test * Formatting
-
- 30 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Enable scheduler for 1 stream * Formatting * Improve performance of sorting * Formatting * Adjust the weight calculation * Formatting * Simplify formula * Formatting * Avoid division by zero * Fix scheduler test * Check for either 1 or 2 * Check for waits when order may change * Formatting
-
- 28 Oct, 2019 1 commit
-
-
Paul Fultz II authored
-
- 25 Oct, 2019 2 commits
-
-
Shucai Xiao authored
* simplify cpu implementation of the convolution, softmax, and logsoftmax * clang format * fix cppcheck error * improve code coverage
-
Shucai Xiao authored
* support onnx file with initializer data not an input * fix format * fix review comments and format issue * format * reorder alphabetically
-
- 24 Oct, 2019 1 commit
-
-
kahmed10 authored
* fix pad calc * simplify ceil calc and remove extra vars * change dilation calculation, add tests * formatting * formatting
-
- 21 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Print graph from the driver * Formatting
-
- 16 Oct, 2019 3 commits
-
-
Paul Fultz II authored
* Flatten nested concats * Formatting * Rename tests
-
kahmed10 authored
* remove binaries replaced by driver * remove source files * add back env var
-
Paul Fultz II authored
* Fuse convolution adds * Formatting * Fuse more 1x1 convs * Add some tests * Formatting * Add test for 1x1 * Add verification for add-conv fusions * Fix stride calculation * Formatting * Add more tests * Rename tests
-
- 15 Oct, 2019 2 commits
-
-
Paul Fultz II authored
* Add more shape operators that can be nops * Dont remove pooling
-
Paul Fultz II authored
* use 32bit integers for indices * Formatting * Update more index types * Formatting
-
- 10 Oct, 2019 1 commit
-
-
Shucai Xiao authored
* accelerate conflict table computation * removed an unnecessary comma
-
- 09 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Fix bug in bert accuraccy * Formatting * add another test * Fix add and overflow * Formatting * Fix bug in shape_for_each * Use front instead of iterator * Use result.front() * Split add_unary files * Formatting * Fix incorrect last index * Remove comment * Inline function * Fix carry check * Fix metadata errors * Formatting * Reflow * Reflow
-
- 07 Oct, 2019 1 commit
-
-
Paul Fultz II authored
* Implement fast-div for index calculations * Formatting * Use fast_div for broadcasts * Formatting * Add remiander function * Compute mult-index using lens instead of strides * Formatting * Simplify equation * Formatting
-
- 04 Oct, 2019 1 commit
-
-
kahmed10 authored
* initial testing of add_clip fusion * formatting * clipped relu fusion * formatting * remove some executables, add fusion test * formatting * remove clipped_relu code * fix clang-tidy * revert changes to cmake files * remove fusion from weight map * formatting * fix syntax error * formatting * fix syntax error * fix syntax error * formatting
-
- 03 Oct, 2019 2 commits
-
-
Shucai Xiao authored
* fixed a bug related to removing gemm copy * clang format * fix review comments * clang format * fix unit test failure * fix review comments * clang format
-
Paul Fultz II authored
* Add env to trace nary device functions * Formatting * Improve contiguous and concat performance * Formatting * Remove unused variable * Formatting * Fix gpu tests * Formatting * Add more test for transposed concat * Formatting * Compute offset and not index * Compute multi-index once * Formatting * Fix transposed inputs * Formatting * Use product order for comparisons of hip_array * Formatting * Add missing s parameter * Formatting * Dont invert permutation * Fix tidy warnings * Formatting * Remove incorrect license * Use a single integer for stride * Formatting * Fix tidy issue
-
- 02 Oct, 2019 1 commit
-
-
kahmed10 authored
* test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook * test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook * # This is a combination of 3 commits. # The first commit's message is: # This is a combination of 2 commits. # The first commit's message is: test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook test hook # This is the 2nd commit message: test hook # This is the 2nd commit message: test hook # This is the 3rd commit message: test hook * fix format * fix format * ignore doc dir * fix regex * fix jenkins error * exclude another dir * formatting test_array * fix version of yapf * test hook * formatting * reinclude dirs
-
- 30 Sep, 2019 1 commit
-
-
Paul authored
-
- 27 Sep, 2019 1 commit
-
-
Shucai Xiao authored
* add two operators ceil and floor * clang format * add unit test for the ceil and floor operators * remove unintended code
-
- 26 Sep, 2019 1 commit
-
-
Paul Fultz II authored
* Fix compiler crash in TF inceptionv4 * Formatting * Remove else
-
- 25 Sep, 2019 1 commit
-
-
Shucai Xiao authored
* first version of refactoring reduce operators. * clang format * refactor the gpu implemantation of the reduce_mean operator * clang format * refactor gpu implementation of the resuce_sum operator * fix cpp check error * fix cppcheck error * fix cppcheck error * fix review comments * clang format * fix a jenkin error * fixed review comments * clang format * fix review comments * clang format * fix review comments * clang format * add implemenation of reduce_min and reduce_max * clang format * add unit test for reduce_min/max operator * clang format * add more unit tests * clang format * fix review comments
-