1. 11 Apr, 2022 1 commit
  2. 08 Apr, 2022 1 commit
  3. 06 Apr, 2022 1 commit
  4. 01 Apr, 2022 1 commit
    • Charlie Lin's avatar
      Update developer overview, fix doc CMakeLists (#1140) · 0295965d
      Charlie Lin authored
      * Fix and change doc CMakeLists
      1. Fix include directory location with hange from #1088
      2. Create a DoxygenWarningLog.txt file in <build_dir>/doc/doxygen
      3. Move compiled html or pdf files to <build_dir>/doc/[pdf, html]
      0295965d
  5. 31 Mar, 2022 1 commit
  6. 29 Mar, 2022 3 commits
  7. 28 Mar, 2022 2 commits
    • Paul Fultz II's avatar
      Use ifdef instead of comment for the auto-generated method declarations for... · 8e4d622f
      Paul Fultz II authored
      Use ifdef instead of comment for the auto-generated method declarations for type erased classes (#1138)
      
      It seems the formatting of comments are unreadable for larger methods, so instead just generate a struct with the methods in the interface and add a comment if its optional. It wraps this in #ifdef TYPE_ERASED_DECLARATION(assuming this would never be defined) instead of #if 0, so most editors can still provide syntax highlighting(although I think vscode with clangd will still gray it out unfortunately).
      8e4d622f
    • Paul Fultz II's avatar
      Use ccache for runtime compilation (#1131) · ad056b1f
      Paul Fultz II authored
      * Use ccache for runtime compilation
      ad056b1f
  8. 25 Mar, 2022 1 commit
  9. 24 Mar, 2022 1 commit
  10. 22 Mar, 2022 1 commit
  11. 21 Mar, 2022 1 commit
  12. 18 Mar, 2022 2 commits
  13. 15 Mar, 2022 2 commits
    • Umang Yadav's avatar
      Expose APIs for the MIGraphX program (#1093) · 64e79a94
      Umang Yadav authored
      API includes following
      create_module,
      get_main_module
      add_instruction without module args
      add_instruction with module args
      add_parameter
      add_return
      64e79a94
    • Paul Fultz II's avatar
      Add iterators to kernels tensor_view and fix roialign to work with non-standard shape (#1126) · 31e63991
      Paul Fultz II authored
      This adds iterators to tensor_view, which can allow kernels to work with non-standard shapes like for roialign.
      
      To improve the performance of indexing when using the iterators, the shape class was updated to use integral_constants since the compiler doesn't always fold the const values. An integral_constant will at least enforce that in the AST.
      
      Finally, since index calculations with single integers are improved, I also updated pointwise to use single index rather than multi index. There is about 4% improvement in some cases.
      31e63991
  14. 14 Mar, 2022 3 commits
  15. 11 Mar, 2022 1 commit
    • Shucai Xiao's avatar
      Improve print ins (#1096) · b3b44f5d
      Shucai Xiao authored
      The module::debug_print(ins) is very slow, which makes the trave_eval==1/2 very slow. The reason is printing an ins involves search the whole module to get the instruction, the print it.  This change is to fix that by calling module::print() to get names of all instructions of a program, then print the instruction by getting its name from a hash map.
      b3b44f5d
  16. 09 Mar, 2022 3 commits
  17. 08 Mar, 2022 1 commit
  18. 07 Mar, 2022 1 commit
  19. 04 Mar, 2022 2 commits
  20. 03 Mar, 2022 3 commits
  21. 02 Mar, 2022 3 commits
  22. 28 Feb, 2022 1 commit
  23. 25 Feb, 2022 3 commits
  24. 24 Feb, 2022 1 commit
    • Paul Fultz II's avatar
      Some cmake fixes and updates (#1088) · cd0a4aa5
      Paul Fultz II authored
      Make doc/CMakeLists.txt standalone
      Switch to use rocm-cmake modules for document generation
      Add CONFIGURE_DEPENDS to file(GLOB) so it will update without an explicit cmake run
      Add STRINGS property for build type to make it easier to switch build types with ccmake
      Various fixes and improvements
      cd0a4aa5