1. 29 Jun, 2023 2 commits
  2. 28 Jun, 2023 2 commits
  3. 26 Jun, 2023 1 commit
  4. 23 Jun, 2023 1 commit
  5. 22 Jun, 2023 1 commit
  6. 21 Jun, 2023 2 commits
  7. 17 Jun, 2023 3 commits
    • Ted Themistokleous's avatar
      Add trace for SIMPLIFY_ALGEBRA matches (#1838) · a0fa3742
      Ted Themistokleous authored
      * Add trace for SIMPLIFY_ALGEBRA matches
      
      * Fix format
      
      * handle review comments from Umang
      
      -int to size_t for trace
      -move env arg to top of simplify_algebra.cpp
      -handle overload beter for find_matches
      
      * Rename trace_mod param to trace_pass
      
      More representative naming for what this trace flag does
      a0fa3742
    • turneram's avatar
      Update CK commit hash and add gfx940 to supported archs (#1842) · b8898d7e
      turneram authored
      * Add initial ck_gemm code
      
      * Format
      
      * Add additional src files
      
      * Format
      
      * Add include
      
      * Simplify fuse_ck
      
      * Format
      
      * Rename var
      
      * Enable pass
      
      * Update ck version
      
      * Fix include
      
      * Add group stride
      
      * Disable warnings for ck headers
      
      * Format
      
      * Add unpack array
      
      * Add interface to enable tuning
      
      * Format
      
      * Update compile_ops to handle tuning config
      
      * Format
      
      * Add some comments
      
      * Move time_op to migraphx_gpu
      
      * Add banchmarking
      
      * Refactor
      
      * Format
      
      * Add lift class macro
      
      * Use device name
      
      * Format
      
      * Generate configs
      
      * Format
      
      * Pass tuning parameter
      
      * Move data type to is_ck_gemm matcher
      
      * Format
      
      * Add problem_cache to avoid retuning same configs
      
      * Format
      
      * Format
      
      * Mark the problems
      
      * Format
      
      * Use is_null
      
      * Format
      
      * Resize vector
      
      * Only tune with exaustive tuning
      
      * Format
      
      * Use assert
      
      * FOrmat
      
      * Tidy fixes
      
      * More tidy fixes
      
      * Format
      
      * Add license to missing files
      
      * Format
      
      * Use transform
      
      * Format
      
      * Fix tidy
      
      * Format
      
      * Fix cppcheck issues
      
      * Format
      
      * Add static_assert
      
      * Add ops header
      
      * Add assertion in batcher
      
      * Format
      
      * Improve the batch fold check
      
      * Format
      
      * Add where op workaround for CK
      
      * Skip if any input is not a supported ck type
      
      * Format
      
      * Check batch is standard
      
      * Format
      
      * Remove redundant static keyword
      
      * Update commit hash
      
      * Fix error when running without --exhaustive-tune
      
      * Formatting
      
      * Formatting
      
      * Remove fuse_ck_gemm_softmax_gemm
      
      * Update ck hash
      
      * Correct spelling mistake
      
      * Remove commented out logic from fuse_ck
      
      * Remove unused include and add comment
      
      * Formatting
      
      * Remove redundant get_shape and remove ck_gemm from names
      
      * Formatting
      
      * Allow for mixed types with int8 gemms
      
      * Formatting
      
      * Add back find_package from merge
      
      * Update CK commit hash and add gfx940 to fuse_ops supported archs
      
      * Formatting
      
      * Update CK hash
      b8898d7e
    • Umang Yadav's avatar
      Fix convert operation for NaNs (#1840) · 2d635f91
      Umang Yadav authored
      * Fix convert for the NaNs
      
      * NaNs can't be compared, use std::isnan()
      
      * formatting
      
      * formatting
      
      * formatting
      
      * add extra tests
      2d635f91
  8. 16 Jun, 2023 1 commit
  9. 15 Jun, 2023 2 commits
    • Umang Yadav's avatar
      use __hmax, __hmin (#1813) · d208adfc
      Umang Yadav authored
      d208adfc
    • Brian Pickrell's avatar
      fix parse_instancenorm to create broadcast and multibroadcast instruc… (#1715) · 41ba30d5
      Brian Pickrell authored
      * fix parse_instancenorm to create broadcast and multibroadcast instructions with two dynamic shape arguments instead of 1.  Their make_op() functions don't support dynamic shapes when called with one input.  This caused an error when parsing an ONNX 3duunet model
      
      * Use add_common_op() to create multibroadcast op.
      
      * add verification and parsing test for instance_norm with dynamic input.  Parse test doesn't pass.
      
      * fix for test; still doesn't pass
      
      * another fix for test; still doesn't pass
      
      * work in progress, instance_norm_dyn_batch_test works but instance_norm_test doesn't
      
      * fix onnx instancenorm tests to match parser changes.  Passes all check tests
      
      * Updated comments explaining usage of add_common_op()
      
      * hand-merged conflicts with develop
      
      * fix instance_norm_half_test after merge
      
      * add Onnx test instance_norm_dyn_batch_half_test
      
      * add shape test cases broadcast_1in_dyn_error and multibroadcast_1in_dyn_error_0
      41ba30d5
  10. 14 Jun, 2023 2 commits
  11. 12 Jun, 2023 1 commit
  12. 09 Jun, 2023 3 commits
  13. 08 Jun, 2023 2 commits
  14. 06 Jun, 2023 2 commits
  15. 05 Jun, 2023 1 commit
  16. 01 Jun, 2023 1 commit
  17. 31 May, 2023 1 commit
  18. 30 May, 2023 2 commits
  19. 28 May, 2023 1 commit
  20. 25 May, 2023 1 commit
  21. 24 May, 2023 2 commits
  22. 23 May, 2023 1 commit
  23. 20 May, 2023 1 commit
  24. 19 May, 2023 1 commit
  25. 17 May, 2023 2 commits
  26. 08 May, 2023 1 commit