1. 08 Apr, 2020 1 commit
  2. 29 Mar, 2020 1 commit
  3. 07 Mar, 2020 1 commit
  4. 12 Feb, 2020 1 commit
  5. 10 Feb, 2020 1 commit
    • Shucai Xiao's avatar
      Add additional simple operators (MatMulInteger, ConvInteger, Asinh, Acosh, and Atanh (#431) · a023ec19
      Shucai Xiao authored
      
      
      * Add initial api
      
      * Formatting
      
      * Add more api
      
      * Formatting
      
      * add more operators (asinh, acosh, atanh, MatMulInteger, ConvInteger)
      
      * clang format
      
      * add unit tests for new operators
      
      * clang format
      
      * Add auto api generation
      
      * Formatting
      
      * Fix some compilation errors
      
      * Change handle struct
      
      * Formatting
      
      * Fix reamining compilation errors
      
      * Formatting
      
      * Simplify using ctype
      
      * Formatting
      
      * Initial c++ generation
      
      * Formatting
      
      * Add C++header
      
      * Formatting
      
      * Add test
      
      * Formatting
      
      * Add initial tests
      
      * Formatting
      
      * Try to fix formatting
      
      * Cleanup formatting
      
      * Formatting
      
      * Fix constructors on the same line
      
      * Fix tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Fix tidy issues
      
      * Fix naming issue
      
      * Add onnx API to parse buffer
      
      * Formatting
      
      * Add arguments api
      
      * Formatting
      
      * Fix verify parameters
      
      * Fix cppcheck issues
      
      * Formatting
      
      * Add method to get output shapes and bytes
      
      * Formatting
      
      * Try formatting
      
      * Formatting
      
      * Improve the test coverage
      
      * Formatting
      
      * Add print method
      
      * Formatting
      
      * Fix cppcheck issue
      
      * Fix package dependency
      
      * Add nolint
      
      * Try fix formatting
      
      * Formatting
      
      * formatting
      
      * formatting
      
      * Fix formatting
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarkahmed10 <15948690+kahmed10@users.noreply.github.com>
      a023ec19
  6. 17 Jan, 2020 1 commit
    • Shucai Xiao's avatar
      Reduce operators (#427) · e320f89f
      Shucai Xiao authored
      * add reduce operators as required by onnxruntime
      
      * clang format
      
      * remove a test since it can cause overflow
      
      * resolve cppcheck error
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      e320f89f
  7. 20 Dec, 2019 1 commit
    • Shucai Xiao's avatar
      Improve operators for onnxruntime (#405) · 992666e6
      Shucai Xiao authored
      
      
      * improve unsqueeze to support negative axis and parsing scalar
      
      * clang format
      
      * add a test example for the negative axis of unsqueeze
      
      * improve the squeeze operator to support negative axis
      
      * clang format
      
      * fixed a small bug in the lrn implementation
      
      * clang format
      
      * support negative axis in argmax and argmin
      
      * clang format
      
      * improve flatten to support negative axis
      
      * clang format
      
      * change softmax/logsoftmax to support negative axis
      
      * clang format
      
      * improve transpose by adding default perm
      
      * clang format
      
      * add one more dimens for tensor size
      
      * add one more dimens for tensor size
      
      * disable conv ops fusion for non-symmetric cases
      
      * clang format
      
      * fixed review comments
      
      * move computing axis from the device function to the compute function
      
      * clang format
      
      * move computing axis from device function to the operator computing function
      
      * clang format
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      992666e6
  8. 18 Nov, 2019 1 commit
    • Shucai Xiao's avatar
      Improve concat gather (#402) · 0045d0b7
      Shucai Xiao authored
      * improve gather implementation to handle negative input indices
      
      * clang format
      
      * clang format
      
      * improve concat to support neg axis input
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * code cleanup
      
      * clang format
      
      * fix review comments
      
      * clang format
      0045d0b7
  9. 15 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Add option to do offload copying automatically (#403) · 81b0ff5d
      Paul Fultz II authored
      * Add compiler options
      
      * Add copy operators
      
      * Formatting
      
      * Use run_passes in tests
      
      * Formatting
      
      * Use run_pass in schedule test
      
      * Formatting
      
      * Add compile_options to get_passes in target
      
      * Formatting
      
      * Offload copy option
      
      * Formatting
      
      * Copy using pinned memory
      
      * Formatting
      
      * Improve performance of gpu copying
      
      * Formatting
      
      * Dont copy
      
      * Formatting
      
      * Always make an extra copy
      
      * Formatting
      
      * Remove unused write op
      
      * Add missing include
      
      * Remove copy_to_gpu function in python api
      
      * Make offload copy disabled by default on C++
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Fix namespace
      
      * Fix python tests
      
      * Turn clang format off since its broken
      
      * Fix compile error on gcc 5
      
      * Remove commented code
      81b0ff5d
  10. 04 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Use multi_index for reductions (#400) · 3e0496fb
      Paul Fultz II authored
      * Add functions to do multi-index for local strides as well
      
      * Formatting
      
      * Use same multi-index path for block_reduce
      
      * Formatting
      
      * Use multi-index calc in reduce
      
      * Formatting
      
      * Fix warning
      
      * Fix compiler warning
      
      * Disable some tidy checks
      3e0496fb
  11. 15 Oct, 2019 1 commit
  12. 09 Oct, 2019 1 commit
    • Paul Fultz II's avatar
      Fix bug in bert accuraccy (#385) · a797f890
      Paul Fultz II authored
      * Fix bug in bert accuraccy
      
      * Formatting
      
      * add another test
      
      * Fix add and overflow
      
      * Formatting
      
      * Fix bug in shape_for_each
      
      * Use front instead of iterator
      
      * Use result.front()
      
      * Split add_unary files
      
      * Formatting
      
      * Fix incorrect last index
      
      * Remove comment
      
      * Inline function
      
      * Fix carry check
      
      * Fix metadata errors
      
      * Formatting
      
      * Reflow
      
      * Reflow
      a797f890
  13. 07 Oct, 2019 1 commit
  14. 04 Oct, 2019 1 commit
    • kahmed10's avatar
      Add_clip fusion (#370) · 1398bcc1
      kahmed10 authored
      * initial testing of add_clip fusion
      
      * formatting
      
      * clipped relu fusion
      
      * formatting
      
      * remove some executables, add fusion test
      
      * formatting
      
      * remove clipped_relu code
      
      * fix clang-tidy
      
      * revert changes to cmake files
      
      * remove fusion from weight map
      
      * formatting
      
      * fix syntax error
      
      * formatting
      
      * fix syntax error
      
      * fix syntax error
      
      * formatting
      1398bcc1
  15. 03 Oct, 2019 1 commit
    • Paul Fultz II's avatar
      Improve contiguous and concat performance (#368) · 9b55685c
      Paul Fultz II authored
      * Add env to trace nary device functions
      
      * Formatting
      
      * Improve contiguous and concat performance
      
      * Formatting
      
      * Remove unused variable
      
      * Formatting
      
      * Fix gpu tests
      
      * Formatting
      
      * Add more test for transposed concat
      
      * Formatting
      
      * Compute offset and not index
      
      * Compute multi-index once
      
      * Formatting
      
      * Fix transposed inputs
      
      * Formatting
      
      * Use product order for comparisons of hip_array
      
      * Formatting
      
      * Add missing s parameter
      
      * Formatting
      
      * Dont invert permutation
      
      * Fix tidy warnings
      
      * Formatting
      
      * Remove incorrect license
      
      * Use a single integer for stride
      
      * Formatting
      
      * Fix tidy issue
      9b55685c
  16. 27 Sep, 2019 1 commit
    • Shucai Xiao's avatar
      Ceil floor operators (#375) · 7d06cdbd
      Shucai Xiao authored
      * add two operators ceil and floor
      
      * clang format
      
      * add unit test for the ceil and floor operators
      
      * remove unintended code
      7d06cdbd
  17. 25 Sep, 2019 1 commit
    • Shucai Xiao's avatar
      Reduce_min/max operators (#363) · 3962c2ad
      Shucai Xiao authored
      * first version of refactoring reduce operators.
      
      * clang format
      
      * refactor the gpu implemantation of the reduce_mean operator
      
      * clang format
      
      * refactor gpu implementation of the resuce_sum operator
      
      * fix cpp check error
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * fix a jenkin error
      
      * fixed review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * add implemenation of reduce_min and reduce_max
      
      * clang format
      
      * add unit test for reduce_min/max operator
      
      * clang format
      
      * add more unit tests
      
      * clang format
      
      * fix review comments
      3962c2ad
  18. 16 Sep, 2019 1 commit
  19. 20 Aug, 2019 3 commits
  20. 15 Aug, 2019 2 commits
  21. 14 Aug, 2019 1 commit
  22. 13 Aug, 2019 2 commits
  23. 12 Aug, 2019 2 commits
  24. 05 Aug, 2019 1 commit
  25. 03 Aug, 2019 1 commit
  26. 02 Aug, 2019 4 commits
  27. 01 Aug, 2019 5 commits
  28. 24 Jul, 2019 1 commit