"git@developer.sourcefind.cn:OpenDAS/torchaudio.git" did not exist on "71214b48548b1dcb6ebd581dd36a9d0e60af6837"
  1. 27 Jun, 2023 1 commit
    • Ted Themistokleous's avatar
      Fix Nonzero to track data value with sentinel value based on elements · 28727db2
      Ted Themistokleous authored
      We can't change the behaviour of the nonzero op and we currently pad the output
      with zeros. This unfortunately obfuscates the following cases:
      
      1. When the only nonzero element is the first index - the whole tensor is padded
      with zeros its not obvious if the first value is valid index or padded
      
      2. When the nonzero elements vector is used for indicies. The resulting vector
         with the padded value of 0 is still a valid index thus gather/gatherND and other ops
         will assume the 0 index is valid and operate accordingly.
      
      In this case, by adding a sentinel value of the number of static elements used
      by the desired shape, the resulting nonzero output can now track how many elements
      are valid by determining the value in the correct range.
      
      Originally I intended to use -1 but not all datatypes use this if say, we're dealing with
      unsigned values in our vectors or booleans.
      28727db2
  2. 26 Jun, 2023 1 commit
  3. 23 Jun, 2023 1 commit
  4. 22 Jun, 2023 2 commits
  5. 21 Jun, 2023 2 commits
  6. 20 Jun, 2023 1 commit
  7. 17 Jun, 2023 3 commits
    • Ted Themistokleous's avatar
      Add trace for SIMPLIFY_ALGEBRA matches (#1838) · a0fa3742
      Ted Themistokleous authored
      * Add trace for SIMPLIFY_ALGEBRA matches
      
      * Fix format
      
      * handle review comments from Umang
      
      -int to size_t for trace
      -move env arg to top of simplify_algebra.cpp
      -handle overload beter for find_matches
      
      * Rename trace_mod param to trace_pass
      
      More representative naming for what this trace flag does
      a0fa3742
    • turneram's avatar
      Update CK commit hash and add gfx940 to supported archs (#1842) · b8898d7e
      turneram authored
      * Add initial ck_gemm code
      
      * Format
      
      * Add additional src files
      
      * Format
      
      * Add include
      
      * Simplify fuse_ck
      
      * Format
      
      * Rename var
      
      * Enable pass
      
      * Update ck version
      
      * Fix include
      
      * Add group stride
      
      * Disable warnings for ck headers
      
      * Format
      
      * Add unpack array
      
      * Add interface to enable tuning
      
      * Format
      
      * Update compile_ops to handle tuning config
      
      * Format
      
      * Add some comments
      
      * Move time_op to migraphx_gpu
      
      * Add banchmarking
      
      * Refactor
      
      * Format
      
      * Add lift class macro
      
      * Use device name
      
      * Format
      
      * Generate configs
      
      * Format
      
      * Pass tuning parameter
      
      * Move data type to is_ck_gemm matcher
      
      * Format
      
      * Add problem_cache to avoid retuning same configs
      
      * Format
      
      * Format
      
      * Mark the problems
      
      * Format
      
      * Use is_null
      
      * Format
      
      * Resize vector
      
      * Only tune with exaustive tuning
      
      * Format
      
      * Use assert
      
      * FOrmat
      
      * Tidy fixes
      
      * More tidy fixes
      
      * Format
      
      * Add license to missing files
      
      * Format
      
      * Use transform
      
      * Format
      
      * Fix tidy
      
      * Format
      
      * Fix cppcheck issues
      
      * Format
      
      * Add static_assert
      
      * Add ops header
      
      * Add assertion in batcher
      
      * Format
      
      * Improve the batch fold check
      
      * Format
      
      * Add where op workaround for CK
      
      * Skip if any input is not a supported ck type
      
      * Format
      
      * Check batch is standard
      
      * Format
      
      * Remove redundant static keyword
      
      * Update commit hash
      
      * Fix error when running without --exhaustive-tune
      
      * Formatting
      
      * Formatting
      
      * Remove fuse_ck_gemm_softmax_gemm
      
      * Update ck hash
      
      * Correct spelling mistake
      
      * Remove commented out logic from fuse_ck
      
      * Remove unused include and add comment
      
      * Formatting
      
      * Remove redundant get_shape and remove ck_gemm from names
      
      * Formatting
      
      * Allow for mixed types with int8 gemms
      
      * Formatting
      
      * Add back find_package from merge
      
      * Update CK commit hash and add gfx940 to fuse_ops supported archs
      
      * Formatting
      
      * Update CK hash
      b8898d7e
    • Umang Yadav's avatar
      Fix convert operation for NaNs (#1840) · 2d635f91
      Umang Yadav authored
      * Fix convert for the NaNs
      
      * NaNs can't be compared, use std::isnan()
      
      * formatting
      
      * formatting
      
      * formatting
      
      * add extra tests
      2d635f91
  8. 16 Jun, 2023 2 commits
  9. 15 Jun, 2023 2 commits
    • Umang Yadav's avatar
      use __hmax, __hmin (#1813) · d208adfc
      Umang Yadav authored
      d208adfc
    • Brian Pickrell's avatar
      fix parse_instancenorm to create broadcast and multibroadcast instruc… (#1715) · 41ba30d5
      Brian Pickrell authored
      * fix parse_instancenorm to create broadcast and multibroadcast instructions with two dynamic shape arguments instead of 1.  Their make_op() functions don't support dynamic shapes when called with one input.  This caused an error when parsing an ONNX 3duunet model
      
      * Use add_common_op() to create multibroadcast op.
      
      * add verification and parsing test for instance_norm with dynamic input.  Parse test doesn't pass.
      
      * fix for test; still doesn't pass
      
      * another fix for test; still doesn't pass
      
      * work in progress, instance_norm_dyn_batch_test works but instance_norm_test doesn't
      
      * fix onnx instancenorm tests to match parser changes.  Passes all check tests
      
      * Updated comments explaining usage of add_common_op()
      
      * hand-merged conflicts with develop
      
      * fix instance_norm_half_test after merge
      
      * add Onnx test instance_norm_dyn_batch_half_test
      
      * add shape test cases broadcast_1in_dyn_error and multibroadcast_1in_dyn_error_0
      41ba30d5
  10. 14 Jun, 2023 2 commits
  11. 13 Jun, 2023 1 commit
  12. 12 Jun, 2023 1 commit
  13. 09 Jun, 2023 3 commits
  14. 08 Jun, 2023 2 commits
  15. 06 Jun, 2023 2 commits
  16. 05 Jun, 2023 1 commit
  17. 04 Jun, 2023 1 commit
  18. 02 Jun, 2023 1 commit
  19. 01 Jun, 2023 1 commit
  20. 31 May, 2023 2 commits
  21. 30 May, 2023 2 commits
  22. 29 May, 2023 2 commits
  23. 28 May, 2023 1 commit
  24. 25 May, 2023 1 commit
  25. 24 May, 2023 2 commits