1. 11 Apr, 2022 2 commits
    • bpickrel's avatar
      scatter operator refactoring to include reduction (#1124) · 701c2014
      bpickrel authored
      Change the "scatter" struct and op to a base/child set of three: scatter_none, scatter_add, scatter_mul to mirror Onnx' ScatterElements op. and its three reduction options. (Onnx Scatter op is deprecated and is equivalent to scatter_none.)
      
      Provides both a reference op. and update to Onnx parsing. Tests updated and new test case added.
      701c2014
    • Shucai Xiao's avatar
      fix a bug in create tensor_view with vec data type (#1155) · 3c301efa
      Shucai Xiao authored
      When create a tensor_view with vector date type, the last dimension of the shape should be divided by the vec_size.
      3c301efa
  2. 08 Apr, 2022 1 commit
  3. 06 Apr, 2022 1 commit
  4. 01 Apr, 2022 1 commit
    • Charlie Lin's avatar
      Update developer overview, fix doc CMakeLists (#1140) · 0295965d
      Charlie Lin authored
      * Fix and change doc CMakeLists
      1. Fix include directory location with hange from #1088
      2. Create a DoxygenWarningLog.txt file in <build_dir>/doc/doxygen
      3. Move compiled html or pdf files to <build_dir>/doc/[pdf, html]
      0295965d
  5. 31 Mar, 2022 1 commit
  6. 29 Mar, 2022 3 commits
  7. 28 Mar, 2022 2 commits
    • Paul Fultz II's avatar
      Use ifdef instead of comment for the auto-generated method declarations for... · 8e4d622f
      Paul Fultz II authored
      Use ifdef instead of comment for the auto-generated method declarations for type erased classes (#1138)
      
      It seems the formatting of comments are unreadable for larger methods, so instead just generate a struct with the methods in the interface and add a comment if its optional. It wraps this in #ifdef TYPE_ERASED_DECLARATION(assuming this would never be defined) instead of #if 0, so most editors can still provide syntax highlighting(although I think vscode with clangd will still gray it out unfortunately).
      8e4d622f
    • Paul Fultz II's avatar
      Use ccache for runtime compilation (#1131) · ad056b1f
      Paul Fultz II authored
      * Use ccache for runtime compilation
      ad056b1f
  8. 25 Mar, 2022 1 commit
  9. 24 Mar, 2022 1 commit
  10. 22 Mar, 2022 1 commit
  11. 21 Mar, 2022 1 commit
  12. 18 Mar, 2022 2 commits
  13. 15 Mar, 2022 2 commits
    • Umang Yadav's avatar
      Expose APIs for the MIGraphX program (#1093) · 64e79a94
      Umang Yadav authored
      API includes following
      create_module,
      get_main_module
      add_instruction without module args
      add_instruction with module args
      add_parameter
      add_return
      64e79a94
    • Paul Fultz II's avatar
      Add iterators to kernels tensor_view and fix roialign to work with non-standard shape (#1126) · 31e63991
      Paul Fultz II authored
      This adds iterators to tensor_view, which can allow kernels to work with non-standard shapes like for roialign.
      
      To improve the performance of indexing when using the iterators, the shape class was updated to use integral_constants since the compiler doesn't always fold the const values. An integral_constant will at least enforce that in the AST.
      
      Finally, since index calculations with single integers are improved, I also updated pointwise to use single index rather than multi index. There is about 4% improvement in some cases.
      31e63991
  14. 14 Mar, 2022 3 commits
  15. 11 Mar, 2022 1 commit
    • Shucai Xiao's avatar
      Improve print ins (#1096) · b3b44f5d
      Shucai Xiao authored
      The module::debug_print(ins) is very slow, which makes the trave_eval==1/2 very slow. The reason is printing an ins involves search the whole module to get the instruction, the print it.  This change is to fix that by calling module::print() to get names of all instructions of a program, then print the instruction by getting its name from a hash map.
      b3b44f5d
  16. 09 Mar, 2022 3 commits
  17. 08 Mar, 2022 1 commit
  18. 07 Mar, 2022 1 commit
  19. 04 Mar, 2022 2 commits
  20. 03 Mar, 2022 3 commits
  21. 02 Mar, 2022 3 commits
  22. 28 Feb, 2022 1 commit
  23. 25 Feb, 2022 3 commits