"test/verify/test_slice_reverse.cpp" did not exist on "57c474ccc786a81fdfe90b675fc1634a54ad85c1"
  1. 20 Nov, 2023 1 commit
  2. 15 Nov, 2023 6 commits
  3. 13 Oct, 2023 1 commit
  4. 06 Oct, 2023 1 commit
  5. 21 Sep, 2023 1 commit
  6. 16 Sep, 2023 1 commit
  7. 10 Aug, 2023 1 commit
  8. 08 Aug, 2023 1 commit
  9. 30 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Enable tuning for MLIR (#1965) · be6ecff6
      Paul Fultz II authored
      * Add initial tuning support
      
      * Format
      
      * Add extra param
      
      * Format
      
      * Use exauhstive flag
      
      * Format
      
      * Set expected shapes
      
      * Format
      
      * Format
      
      * Fix missing symbol
      
      * Format
      
      * Add missing license header
      
      * Format
      
      * Update src/targets/gpu/include/migraphx/gpu/mlir.hpp
      be6ecff6
  10. 28 Jul, 2023 1 commit
  11. 06 Jul, 2023 1 commit
  12. 02 Jul, 2023 1 commit
    • Paul Fultz II's avatar
      Improvement to ck integration (#1859) · 3c9df3b4
      Paul Fultz II authored
      Add a CI job to test CK
      Add MIGRAPHX_TUNE_CK env variable to only do tuning for CK
      Continue tuning even when there is invalid configs
      Fix a bug with parallel compilation not using all available threads
      Add additional test for gemms using half types
      Removed int32 as supported type since it doesnt pass our test suite
      3c9df3b4
  13. 17 Jun, 2023 1 commit
    • turneram's avatar
      Update CK commit hash and add gfx940 to supported archs (#1842) · b8898d7e
      turneram authored
      * Add initial ck_gemm code
      
      * Format
      
      * Add additional src files
      
      * Format
      
      * Add include
      
      * Simplify fuse_ck
      
      * Format
      
      * Rename var
      
      * Enable pass
      
      * Update ck version
      
      * Fix include
      
      * Add group stride
      
      * Disable warnings for ck headers
      
      * Format
      
      * Add unpack array
      
      * Add interface to enable tuning
      
      * Format
      
      * Update compile_ops to handle tuning config
      
      * Format
      
      * Add some comments
      
      * Move time_op to migraphx_gpu
      
      * Add banchmarking
      
      * Refactor
      
      * Format
      
      * Add lift class macro
      
      * Use device name
      
      * Format
      
      * Generate configs
      
      * Format
      
      * Pass tuning parameter
      
      * Move data type to is_ck_gemm matcher
      
      * Format
      
      * Add problem_cache to avoid retuning same configs
      
      * Format
      
      * Format
      
      * Mark the problems
      
      * Format
      
      * Use is_null
      
      * Format
      
      * Resize vector
      
      * Only tune with exaustive tuning
      
      * Format
      
      * Use assert
      
      * FOrmat
      
      * Tidy fixes
      
      * More tidy fixes
      
      * Format
      
      * Add license to missing files
      
      * Format
      
      * Use transform
      
      * Format
      
      * Fix tidy
      
      * Format
      
      * Fix cppcheck issues
      
      * Format
      
      * Add static_assert
      
      * Add ops header
      
      * Add assertion in batcher
      
      * Format
      
      * Improve the batch fold check
      
      * Format
      
      * Add where op workaround for CK
      
      * Skip if any input is not a supported ck type
      
      * Format
      
      * Check batch is standard
      
      * Format
      
      * Remove redundant static keyword
      
      * Update commit hash
      
      * Fix error when running without --exhaustive-tune
      
      * Formatting
      
      * Formatting
      
      * Remove fuse_ck_gemm_softmax_gemm
      
      * Update ck hash
      
      * Correct spelling mistake
      
      * Remove commented out logic from fuse_ck
      
      * Remove unused include and add comment
      
      * Formatting
      
      * Remove redundant get_shape and remove ck_gemm from names
      
      * Formatting
      
      * Allow for mixed types with int8 gemms
      
      * Formatting
      
      * Add back find_package from merge
      
      * Update CK commit hash and add gfx940 to fuse_ops supported archs
      
      * Formatting
      
      * Update CK hash
      b8898d7e
  14. 09 Jun, 2023 1 commit
  15. 08 Jun, 2023 1 commit
  16. 24 May, 2023 1 commit
  17. 06 Apr, 2023 1 commit
  18. 29 Mar, 2023 1 commit
  19. 27 Mar, 2023 1 commit
  20. 10 Mar, 2023 1 commit
  21. 16 Feb, 2023 1 commit
  22. 17 Jan, 2023 1 commit
  23. 09 Jan, 2023 1 commit
  24. 06 Dec, 2022 1 commit
  25. 02 Nov, 2022 2 commits
  26. 27 Oct, 2022 1 commit
    • kahmed10's avatar
      Add JIT pad (#1411) · 0d841ded
      kahmed10 authored
      updated GPU pad to now use JIT version.
      added range functions for JIT kernels.
      0d841ded
  27. 19 Oct, 2022 1 commit
    • Charlie Lin's avatar
      Refactor dynamic compute; Dynamic ref unary functions (#1407) · 693cb5d8
      Charlie Lin authored
      Refactor dynamic compute
      - add a compute_output_shape object that implicitly converts to a new dyn_output or shape object
      - dyn_output object can handle computing the static output shape of an operator given the input arguments shapes
        change an operator's compute function to argument compute(const dyn_output& dyn_out, std::vector<argument> args) to 
        use dyn_output object
      
      Dynamic ref unary functions
      -  Included these changes to have an example of the refactored dynamic compute being used
      -  Changes to unary base class to handle dynamic shapes
      -  Changed elu and leaky_relu to use unary base class and pointwise JIT
      693cb5d8
  28. 18 Oct, 2022 1 commit
  29. 04 Oct, 2022 1 commit
  30. 29 Sep, 2022 1 commit
  31. 26 Sep, 2022 1 commit
  32. 21 Sep, 2022 1 commit
  33. 14 Sep, 2022 1 commit
  34. 08 Sep, 2022 1 commit