1. 17 Sep, 2021 3 commits
  2. 16 Sep, 2021 1 commit
    • Shucai Xiao's avatar
      Loop operator (#853) · a275f590
      Shucai Xiao authored
      
      
      Add Loop operator for opset version 13.
      Notes: 1) Default max iteration number is 10 if no max iteration number is provided
      2) To change the max iter number, a user can set the max_loop_iterations in the onnx_option struct when parsing a model.
      3) The returned shape of the scan output is from the max_loop_iterations even the actual loop num is less than that. This issue also applies to other operators like NonZero and NonMaxSuppression. A issue #948 is created to track this and to be resolved later.
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      a275f590
  3. 10 Sep, 2021 2 commits
  4. 07 Sep, 2021 3 commits
  5. 02 Sep, 2021 2 commits
  6. 01 Sep, 2021 2 commits
    • Paul Fultz II's avatar
      Add a command to the driver to list supported onnx operators (#938) · 1f741f73
      Paul Fultz II authored
      * Add a command to list supported onnx operators
      1f741f73
    • Chris Austen's avatar
      Adjust HIP_COMPILER_FLAGS to support <$:$<>:> and SHELL: tags (#933) · 33a17257
      Chris Austen authored
      
      
      In ROCm 4.5.0 hip compile flags are coming in differently.  This has
      caused some parsing issues for the HIP_COMPILER_FLAGS variable.  As an example
      
          ROCm 4.3.0: --offload-arch=gfx900
          ROCm 4.5.0: <$<COMPILE_LANGUAGE:CXX>:SHELL:--offload-arch=gfx900>
      
      Using existing code...
          $<$<COMPILE_LANGUAGE:CXX>:SHELL:--offload-arch=gfx900>
      Becomes...
          $<$<COMPILE_LANGUAGE:CXX>:SHELL:
      
      There are two problems with that.
        1) The "<" is not balanced with a "> due to the regex consuming the ">"
        2) There is still a `SHELL:`  label.
      
      This commit repairs both.  I took the regex parsing code from ROCmSoftwarePlatform/MIOpen/blame/develop/CMakeLists.txt
      but improved it to support handling of target features like
      <$<COMPILE_LANGUAGE:CXX>:SHELL:--offload-arch=gfx900:xxx+>
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      33a17257
  7. 31 Aug, 2021 3 commits
    • Paul Fultz II's avatar
      Enable constructing argument with tuple and buffer (#919) · b90d69ae
      Paul Fultz II authored
      
      
      * Improve handling of constructing a tuple from a buffer
      * Add unit test
      * Remove unused function
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      b90d69ae
    • kahmed10's avatar
      Changes to support both OneDNN and ZenDNN builds (#929) · 0859fe90
      kahmed10 authored
      
      
      * Add preallocate method
      
      * Add preallocate_param pass
      
      * Preallocate buffers on the cpu
      
      * Formatting
      
      * Preallocate on the gpu
      
      * Add missing cpp file
      
      * Formatting
      
      * Add lifetime function
      
      * Formatting
      
      * Improve handling of exceptions in test driver
      
      * Formatting
      
      * Auto print exception
      
      * Formatting
      
      * Fork each test case
      
      * Formatting
      
      * Exclude gcc 5 debug build
      
      * Fix tidy issues
      
      * Add color
      
      * Formatting
      
      * Create driver class
      
      * Formatting
      
      * Customize test_case names
      
      * Formatting
      
      * Report status from forked processes
      
      * Formatting
      
      * Update the verify driver
      
      * Formatting
      
      * Print out failed tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Expect passing
      
      * Improve failure reporting on non-linux systems
      
      * Fix ifdef
      
      * Always allocate
      
      * Fix tidy warning
      
      * Flush code code cov
      
      * Formatting
      
      * Fix tidy
      
      * Add const
      
      * Check if weak symbols is linked
      
      * Formatting
      
      * initial progress
      
      * formatting
      
      * Add continue flag
      
      * Formatting
      
      * Set exe name
      
      * Use stringstream and use quotes
      
      * rename vars
      
      * formatting
      
      * more testing
      
      * formatting
      
      * Fix bug when using --continue in the tests
      
      * Formatting
      
      * revert gemm
      
      * revert dot file
      
      * rename var
      
      * update cmakelists and deconv compute
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      0859fe90
    • Shucai Xiao's avatar
      Fix debug assert (#930) · bd85a76c
      Shucai Xiao authored
      * fix two asserts for debug build
      
      * add unit test for copy parameters
      
      * clang format
      
      * add a unit test for reorder_dims
      
      * change tranpose to always require perm not be empty
      
      * clang format
      
      * remove an unnecessary line
      
      * fix tidy error
      
      * fix review comments
      bd85a76c
  8. 25 Aug, 2021 1 commit
    • Shucai Xiao's avatar
      Exclude param from deadcode elimiation (#910) · 4b86a0aa
      Shucai Xiao authored
      
      
      * always keep parameters
      
      * clang format
      
      * fix tidy error
      
      * clang format
      
      * add more unit tests to have more code coverage
      
      * fixed a bug to ensure get_parameter_names to return ordered parameter names
      
      * clang format
      
      * remove unnecessary print out
      
      * refine a code change
      
      * clang format
      
      * add a unit test to check parameter is not removed by dead code elimination
      
      * clang format
      
      * rename a function name
      Co-authored-by: default avatarChris Austen <causten@users.noreply.github.com>
      4b86a0aa
  9. 24 Aug, 2021 1 commit
    • Umang Yadav's avatar
      Change attributes names to be more consistent and reflect better meaning (#916) · 0d2606bb
      Umang Yadav authored
      * rename broadcast and multibroadcast output_lens attribute to out_lens attribute, and change tests and source code to reflect the same
      
      * change the reshape attribute from dims to out_lens
      
      * change transpose attribute's name from dims to perm to reflect better meaning
      
      * use permutation instead of perm for transpose
      
      clang formaating
      
      * use dims instead of out_lens for reshape
      
      clang formatting
      0d2606bb
  10. 20 Aug, 2021 1 commit
  11. 19 Aug, 2021 1 commit
  12. 18 Aug, 2021 2 commits
    • turneram's avatar
      Optimize Q/DQ Format Pass (#889) · 0b5f33b6
      turneram authored
      * Add operators, refactor parsers, add rewrite passes, add tests
      
      * Add ref implementations
      
      * Move broadcasting of scales and zero points to onnx parser
      
      * Allow for x and zero_point to have different types in quantizelinear; fix zero_point default type
      
      * Switch certain variables to int64_t
      
      * Fix overflow in implicit constant conversion
      
      * Remove operators.hpp from includes in tf_test.cpp
      
      * Add conversion for int32 input to quantizelinear and add test case; remove operators.hpp from onnx_test.cpp includes
      
      * Switch dequantizelinear math from int32 to float
      
      * Remove changes to operators.hpp
      
      * Simplify apply_quantizelinear
      
      * Add verify test for int32 data
      
      * Add rewrite_quantization back to CMakeLists
      
      * Add passes to insert qdq after add_bias is applied, replace quant_ops, and remove remaining qdq pairs
      
      * Renaming, refactoring, cleaning up code, adding formal test, and adding passes to targets
      
      * Renaming, review comments, begin adding more specific tests
      
      * Add more specific unit tests
      
      * Fix failing test on CI
      
      * Correct matcher and update qop rewriting, update tests and add more tests
      
      * Update matcher, clean up simplify_qdq, tweak tests
      
      * Add tests, remove pass from CPU target, update dot parameters, clean up simplify_qdq
      
      * Fix correctness bug in ref q/dq implementations; edit gemm parser to make beta always 0.0
      
      * Remove unused variables in onnx gemm tests
      0b5f33b6
    • turneram's avatar
  13. 10 Aug, 2021 1 commit
    • Paul Fultz II's avatar
      Add option to compile with hiprtc (#892) · 91c9ebbc
      Paul Fultz II authored
      * Add hiprtc compile option
      * Add cross compile test
      * Update error reporting
      * Add tests for errors and warnings
      * Fix tidy warning
      * Add comment to ifdefs
      * Skip null character at end of log
      * Assert there is null at the end
      91c9ebbc
  14. 09 Aug, 2021 1 commit
  15. 05 Aug, 2021 1 commit
    • Paul Fultz II's avatar
      Add gpu driver and improvements to pointwise codegen (#851) · 29fa2666
      Paul Fultz II authored
      
      
      * Add method to compile pointwise
      
      * Formatting
      
      * Add lambda
      
      * Add semicolon
      
      * Rename variable
      
      * Add driver to run jit kernels
      
      * Formatting
      
      * Add context
      
      * Formatting
      
      * Make seperate driver folder
      
      * Add more general gpu driver
      
      * Formatting
      
      * Print out wll time
      
      * Formatting
      
      * Run multiple times and skip first run
      
      * Formatting
      
      * Seperate time_op
      
      * Run an op for comparison
      
      * Formatting
      
      * Add debug asserts
      
      * Formatting
      
      * Change parameer name
      
      * Formatting
      
      * Fix argument order
      
      * Formatting
      
      * Add preloading
      
      * Formatting
      
      * Allow a different data type
      
      * Formatting
      
      * Pipeline transformations
      
      * Formatting
      
      * Add vectorization
      
      * Formatting
      
      * Reduce dims
      
      * Formatting
      
      * Compile with launch params as constant
      
      * Formatting
      
      * Make sure buffer can be vecotrized
      
      * Formatting
      
      * Enable vectorization and preloading
      
      * Formatting
      
      * Add print header
      
      * Formatting
      
      * Avoid allocating to large of LDS
      
      * Formatting
      
      * Add some vec functions to a seperate header
      
      * Formatting
      
      * Add stride loops
      
      * Formatting
      
      * Improve the transform pipeline
      
      * Formatting
      
      * Add const
      
      * Fix shape check
      
      * Formatting
      
      * Just check stride axis is zero
      
      * Remove extra finc_vector_axis overload
      
      * Simplify some mroe functions
      
      * Formatting
      
      * Remove some more extra functions
      
      * Formatting
      
      * Simplify more decltypes
      
      * Add another const
      
      * Fix test
      
      * Get buffer pointer different for older compilers
      Co-authored-by: default avatarShucai Xiao <shucai@gmail.com>
      Co-authored-by: default avatarChris Austen <causten@users.noreply.github.com>
      29fa2666
  16. 03 Aug, 2021 1 commit
  17. 28 Jul, 2021 1 commit
  18. 21 Jul, 2021 1 commit
  19. 17 Jul, 2021 1 commit
    • Umang Yadav's avatar
      Remove Alpha Beta from onnx gemm parsing (#874) · eacf042e
      Umang Yadav authored
      * gemm_test_workign
      
      clang_formatting
      
      tests passing
      
      clang formatting
      
      look for beta not equal to one
      
      * make_use of broadcastable_binary_op
      
      clang formatting
      
      * make use of common_op
      
      clang formatting
      
      * move transposes after multiplication
      
      clang formatting
      
      fix transpose
      
      formatting
      
      fix cpp check
      
      foramtting
      
      * fix parsing conditions and ci fails
      eacf042e
  20. 15 Jul, 2021 1 commit
    • turneram's avatar
      Quantize linear ops (#843) · 3282e01a
      turneram authored
      * Add operators, refactor parsers, add rewrite passes, add tests
      
      * Formatting
      
      * Fix cppcheck
      
      * Review comments
      
      * Formatting
      
      * Combine rewrite passes
      
      * Formatting
      
      * Add ref implementations
      
      * Formatting
      
      * Review comments
      
      * Formatting
      
      * Tidy warnings
      
      * Apply review comments
      
      * Formatting
      
      * Fix CI error
      
      * Formatting
      
      * Increase code coverage
      
      * Formatting
      
      * Move broadcasting of scales and zero points to onnx parser
      
      * Formatting
      
      * Allow for x and zero_point to have different types in quantizelinear; fix zero_point default type
      
      * Formatting
      
      * Increase code coverage
      
      * Formatting
      
      * Switch certain variables to int64_t
      
      * Formatting
      
      * Fix overflow in implicit constant conversion
      
      * Formatting
      
      * Increase code coverage
      
      * Formatting
      
      * Remove operators.hpp from includes in tf_test.cpp
      
      * Formatting
      
      * Add conversion for int32 input to quantizelinear and add test case; remove operators.hpp from onnx_test.cpp includes
      
      * Formatting
      
      * Switch dequantizelinear math from int32 to float
      
      * Formatting
      
      * Remove changes to operators.hpp
      
      * Simplify apply_quantizelinear
      
      * Formatting
      
      * Add verify test for int32 data
      
      * Add rewrite_quantization back to CMakeLists
      3282e01a
  21. 14 Jul, 2021 1 commit
  22. 13 Jul, 2021 1 commit
  23. 12 Jul, 2021 1 commit
  24. 09 Jul, 2021 2 commits
  25. 08 Jul, 2021 4 commits
  26. 06 Jul, 2021 1 commit
    • Paul Fultz II's avatar
      Update test driver to continue executing after exceptions and other failures (#868) · f60c3815
      Paul Fultz II authored
      
      
      * Improve handling of exceptions in test driver
      
      * Formatting
      
      * Auto print exception
      
      * Formatting
      
      * Fork each test case
      
      * Formatting
      
      * Exclude gcc 5 debug build
      
      * Fix tidy issues
      
      * Add color
      
      * Formatting
      
      * Create driver class
      
      * Formatting
      
      * Customize test_case names
      
      * Formatting
      
      * Report status from forked processes
      
      * Formatting
      
      * Update the verify driver
      
      * Formatting
      
      * Print out failed tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Expect passing
      
      * Improve failure reporting on non-linux systems
      
      * Fix ifdef
      
      * Flush code code cov
      
      * Formatting
      
      * Fix tidy
      
      * Check if weak symbols is linked
      
      * Formatting
      
      * Add continue flag
      
      * Formatting
      
      * Set exe name
      
      * Use stringstream and use quotes
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      f60c3815