1. 10 Sep, 2020 1 commit
    • Paul Fultz II's avatar
      Add load/save function for program (#623) · 63c5582a
      Paul Fultz II authored
      
      
      * Add save/load functions
      
      * Formatting
      
      * Add loading and saving to the driver
      
      * Formatting
      
      * Add return
      
      * Serialize the context with the program
      
      * Formatting
      
      * Add python API
      
      * Formatting
      
      * Add c/c++ apis
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Fix tidy error
      
      * Fix python doc
      
      * Restore python code
      
      * Add function name to errors
      
      * Formatting
      
      * Use lvalue for writing
      
      * Serialize context
      
      * Fix convolution and pooling operator for miopen
      
      * Formatting
      
      * Add const ref
      
      * Set target name to gpu
      
      * Add target tests
      
      * Formatting
      
      * Move register target to cpp file
      
      * Fix target test
      
      * Use make_target in driver
      
      * Formatting
      
      * Use make_target for the API
      
      * Formatting
      
      * Add cpu include
      
      * Increase timeout
      
      * Add more tests
      
      * Formatting
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      63c5582a
  2. 15 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Add option to do offload copying automatically (#403) · 81b0ff5d
      Paul Fultz II authored
      * Add compiler options
      
      * Add copy operators
      
      * Formatting
      
      * Use run_passes in tests
      
      * Formatting
      
      * Use run_pass in schedule test
      
      * Formatting
      
      * Add compile_options to get_passes in target
      
      * Formatting
      
      * Offload copy option
      
      * Formatting
      
      * Copy using pinned memory
      
      * Formatting
      
      * Improve performance of gpu copying
      
      * Formatting
      
      * Dont copy
      
      * Formatting
      
      * Always make an extra copy
      
      * Formatting
      
      * Remove unused write op
      
      * Add missing include
      
      * Remove copy_to_gpu function in python api
      
      * Make offload copy disabled by default on C++
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Fix namespace
      
      * Fix python tests
      
      * Turn clang format off since its broken
      
      * Fix compile error on gcc 5
      
      * Remove commented code
      81b0ff5d
  3. 26 Aug, 2019 2 commits
  4. 05 Mar, 2019 1 commit
  5. 21 Feb, 2019 1 commit
  6. 07 Feb, 2019 1 commit
  7. 04 Feb, 2019 6 commits
  8. 23 Jan, 2019 2 commits
  9. 22 Jan, 2019 1 commit
  10. 27 Nov, 2018 1 commit
  11. 14 Nov, 2018 1 commit
  12. 06 Nov, 2018 9 commits
  13. 02 Nov, 2018 1 commit
    • Shucai Xiao's avatar
      Remove cpu from names (#102) · 0d0778b7
      Shucai Xiao authored
      * add the slice test example on gpu.
      
      * change the gpu slice test according to comments.
      
      * rename cpu_lowering to lowering, rename cpu_target to target, so consistent with gpu side.
      
      * fix the format of a file CMakeLists.txt.
      
      * Revert "change the gpu slice test according to comments."
      
      This reverts commit 721bbb180d11811dc914d60fd8a1c91926e3f947.
      
      * Revert "add the slice test example on gpu."
      
      This reverts commit 68dabb05adffd429e5e5d10c3a1def2b06489f63.
      
      * fix a format for the file doc/src/reference/targets.rst
      0d0778b7