1. 08 Dec, 2020 1 commit
    • Paul Fultz II's avatar
      Refactor to use make_op almost everywhere (#696) · 8d21fdc9
      Paul Fultz II authored
      * Load op when serializing
      
      * Formatting
      
      * Add missing clip field
      
      * Use make_op almost everywhere
      
      * Formatting
      
      * More make ops for rnns
      
      * Get rid of spaces
      
      * Formatting
      
      * Remove operators headers
      
      * Formatting
      
      * Remove unused op headers
      
      * Increase line threshold
      8d21fdc9
  2. 03 Dec, 2020 2 commits
  3. 02 Dec, 2020 1 commit
  4. 26 Nov, 2020 1 commit
  5. 25 Nov, 2020 1 commit
    • Paul Fultz II's avatar
      Fix issues with python 'ImportError' (#690) · 1c417ae7
      Paul Fultz II authored
      * Fix issue with module wrapper
      
      * Formatting
      
      * Remove custom build of python 3.6
      
      * Fix print function in tests
      
      * Formatting
      
      * Install onnx
      
      * Download the models
      
      * Add gcc 5
      
      * Boost time limit for now
      
      * Try to use oauth instead of app
      
      * Disable all warnings
      1c417ae7
  6. 20 Nov, 2020 1 commit
    • Paul Fultz II's avatar
      Fuse skip layernorm (#683) · 1bfb147d
      Paul Fultz II authored
      
      
      * Unify the vectorized and non-vectorized path
      
      * Formatting
      
      * Make fusion easily extendable
      
      * Add skip layernorm fusion
      
      * Formatting
      
      * Call correct layernorm function
      
      * Fix compile errors
      
      * Add DCE
      
      * Add test for skip layernorm
      
      * Formatting
      
      * Remove unused typedef
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      1bfb147d
  7. 19 Nov, 2020 1 commit
  8. 18 Nov, 2020 1 commit
  9. 16 Nov, 2020 2 commits
    • Shucai Xiao's avatar
      Normalize ops (#667) · 8443ecd1
      Shucai Xiao authored
      
      
      * add a pass to normalize ops
      
      * clang format
      
      * add unit tests
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * add support for slice in the normalize_op function
      
      * clang format
      
      * add operation method api for whether we need to call normalize_op
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * rename a function namejJ
      
      * clang format
      
      * change compute_shape to normalize_compute_shape for corresponding operators
      
      * clang format
      
      * remove unnecessary code
      
      * fix various issues
      
      * clang format
      
      * add attributes to operators having axis attributes
      
      * clang format
      
      * fixed jenkins build error
      
      * clang format
      
      * fix a bug related to slice
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * rename a file
      
      * fix cppcheck error
      
      * some code refinement
      
      * clang format
      
      * change attributes to enum
      
      * clang format
      
      * refine the enum
      
      * clang format
      
      * remove unnecessary code
      
      * add unit tests for more code coverage and fixed a bug
      
      * clang format
      
      * remove unnecessary changes
      
      * change normalize_axes to normalize
      
      * clang format
      
      * revert back the changes in broadcast.hpp
      
      * rename normalize_axes to normalize
      
      * fix review comments
      
      * clang format
      
      * Add flag to enable cpu backend
      
      * Make buffers shared
      
      * Enable optimizations
      
      * Formatting
      
      * Try to avoid ambiguous assign in value class
      
      * fixed a build error
      
      * clang format
      
      * add the normalize_ops pass to the ref target
      
      * refactor program to module to normalize_ops pass
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      8443ecd1
    • kahmed10's avatar
      Extra tf ops (#679) · f8b56a66
      kahmed10 authored
      * initial testing
      
      * add new ops
      
      * remove comment
      
      * remove extra changes
      
      * remove extra changes
      
      * add tests
      
      * formatting
      
      * add tests
      f8b56a66
  10. 11 Nov, 2020 2 commits
  11. 10 Nov, 2020 1 commit
  12. 09 Nov, 2020 1 commit
    • Paul Fultz II's avatar
      Add hip compilation (#664) · f71af72a
      Paul Fultz II authored
      
      
      * Add compiler flags
      
      * Add missing include
      
      * Add filesystem header
      
      * Formatting
      
      * Add tmp_dir to run
      
      * Formatting
      
      * Kernel compilation and launching
      
      * Formatting
      
      * Seperate pack_args
      
      * Formatting
      
      * Add alignment tests
      
      * Formatting
      
      * Add compile test
      
      * Formatting
      
      * Complete compile test
      
      * Formatting
      
      * Use is_regular_file free function
      
      * Fix is_regular_file call
      
      * Fix tidy issues
      
      * Fix tidy
      
      * Fix tidy issue
      
      * Print size in read_buffer to debug issue on jenkins
      
      * Add hip flags before src file
      
      * Fix reading output files
      
      * Fix unsued variable warning
      
      * Formatting
      
      * Formatting
      
      * Disable tidy check
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      f71af72a
  13. 05 Nov, 2020 1 commit
    • Shucai Xiao's avatar
      add get_main_module api (#665) · 6554639b
      Shucai Xiao authored
      
      
      * add an api get_main_module
      
      * clang format
      
      * modify onnx unit test for module
      
      * clang format
      
      * refactor ops unit test with the get_main_module
      
      * clang format
      
      * code backup
      
      * clang format
      
      * refine module c api
      
      * add python api for module
      
      * clang format
      
      * fix a python api issue
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * refine unit tests changes
      
      * clang format
      
      * defer some changes to later PRs
      
      * change return of get_main_module from ref to pointer
      
      * clang format
      
      * add unit tests for the get_main_module_api
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * add more unit tests for more code change coverage
      
      * clang format
      
      * fix unit test
      
      * clang format
      
      * code change for more code coverage
      
      * Add option to no generate a destroy method
      
      * Formatting
      
      * fix some review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * clang format
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      6554639b
  14. 04 Nov, 2020 1 commit
    • Paul Fultz II's avatar
      Split cpu and reference implementation (#671) · 500d9441
      Paul Fultz II authored
      
      
      * Add all_targets cmake target
      
      * Rename target
      
      * Add ref target
      
      * Rename tests
      
      * Refactor compiler target
      
      * Formatting
      
      * Verify for every target
      
      * Formatting
      
      * Add verify test suite
      
      * Formatting
      
      * Add initial test programs
      
      * Formatting
      
      * Add rnn tests
      
      * Formatting
      
      * Validate gpu
      
      * Formatting
      
      * Remove old gpu tests
      
      * Fix gpu tests
      
      * Fix ref error
      
      * Fix tidy issues
      
      * Formatting
      
      * Tidy fixes
      
      * Fix header in python api
      
      * Rename to ref
      
      * Use ref in verify_onnx
      
      * Fix tidy issue
      
      * Build with verbose on
      
      * Fix typo
      
      * Remove verbose
      
      * rename some cpu prefix to ref
      Co-authored-by: default avatarShucai Xiao <Shucai.Xiao@amd.com>
      500d9441
  15. 02 Nov, 2020 1 commit
  16. 29 Oct, 2020 1 commit
    • Shucai Xiao's avatar
      Resize op (#656) · 234cf5a4
      Shucai Xiao authored
      
      
      * code backup for upsample op
      
      * clang format
      
      * fixed a bug
      
      * fix a bug
      
      * clang format
      
      * add unit tests for upsample
      
      * clang format
      
      * code backup for resize
      
      * clang format
      
      * code backup
      
      * clang format
      
      * add unit test for resize operator
      
      * clang format
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * clang format
      
      * fix a typo
      
      * refine unit tests for better code coverage
      
      * remove unnecessary python binary file
      
      * fix review comments
      
      * clang format
      
      * change two functions to static
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      234cf5a4
  17. 28 Oct, 2020 1 commit
  18. 22 Oct, 2020 1 commit
  19. 19 Oct, 2020 1 commit
    • Shucai Xiao's avatar
      Op constructor c/python api (#636) · e939ddcd
      Shucai Xiao authored
      
      
      * code backup
      
      * clang format
      
      * add the python api for op construction
      
      * clang format
      
      * add converting a json string to a value and then pass to make_op
      
      * fix a build error
      
      * support kwargs decoder
      
      * clang format
      
      * code backup
      
      * clang format
      
      * temp code backup
      
      * clang format
      
      * temp code backup
      
      * convert input string to the json format
      
      * clang format
      
      * remove unnecessary code
      
      * fix review comments
      
      * clang format
      
      * add a comment
      
      * add unit tests
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * rename two files
      
      * refine an error report
      
      * clang format
      
      * remove unnecessary code
      
      * fix a review comments
      
      * add convert_to_json unit tests
      
      * clang format
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * add more unit tests
      
      * clang format
      
      * fix clang tidy error
      
      * clang format
      
      * fixed an test failure
      
      * fix cppcheck error
      
      * fix a bug
      
      * fix cppcheck error
      
      * fix jenkins build error
      
      * fix a jenkin build error by wrap up string
      
      * clang format
      
      * another try
      
      * clang format
      
      * fix jenkins build error
      
      * clang format
      
      * remove unnecessary code
      
      * fix review comments
      
      * fix cppcheck error
      
      * fix review comments
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      e939ddcd
  20. 15 Oct, 2020 1 commit
    • turneram's avatar
      Added greater and less operators (#660) · 48ffbfa5
      turneram authored
      
      
      * Added greater and less operators
      
      * Fixed ops_test.cpp
      
      * Set commutative to false for less, greater
      
      * Refactored parse_equal/less/greater into parse_compare_op
      
      * Removed unnecessary function attributes() from greater.hpp/less.hpp
      
      * Added op_name arguments
      
      * Removed local settings
      
      * Formatting
      
      * Missing comma
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Missing space
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      48ffbfa5
  21. 09 Oct, 2020 1 commit
    • Paul Fultz II's avatar
      Add parallel stream analysis (#629) · 1d98fbb4
      Paul Fultz II authored
      * Add intial multi stream analysis
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Remove comment
      
      * Analyze streams on the gpu
      
      * Formatting
      
      * Fix nstream
      
      * Formatting
      
      * Add test for return
      
      * Formatting
      
      * Make sure return has a stream assignment
      
      * Formatting
      
      * Fix asserts and checks
      
      * Improve error message for out-of-order sequence
      
      * Formatting
      1d98fbb4
  22. 08 Oct, 2020 2 commits
    • kahmed10's avatar
      Add build flag for fast math (#639) · a5065265
      kahmed10 authored
      
      
      * add flag
      
      * formatting
      
      * remove env variable
      
      * fix api expression
      
      * add api test
      
      * add api test
      
      * add op test
      
      * formatting
      
      * fix function name
      
      * fix syntax
      
      * formatting
      
      * modify test
      
      * remove test and update doc
      
      * move test to new file
      
      * formatting
      
      * revert test files
      
      * rewrite check
      
      * New
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      a5065265
    • Shucai Xiao's avatar
      Upsample op (#646) · 4789b387
      Shucai Xiao authored
      
      
      * code backup for upsample op
      
      * clang format
      
      * fixed a bug
      
      * fix a bug
      
      * clang format
      
      * add unit tests for upsample
      
      * clang format
      
      * clang format
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      4789b387
  23. 07 Oct, 2020 1 commit
  24. 30 Sep, 2020 1 commit
    • Paul Fultz II's avatar
      Add hip clang builds to jenkins (#651) · f28a62ea
      Paul Fultz II authored
      * Make global variables const
      
      * Tidy fixes
      
      * Disable some lints
      
      * Formatting
      
      * Fix tidy const
      
      * Formatting
      
      * Add missing const keywords
      
      * Formatting
      
      * More fixes
      
      * Fix remaining tidy issues
      
      * Formatting
      
      * Fix rocblas function call
      
      * Formatting
      
      * Fix nodiscard warnings
      
      * Formatting
      
      * Use named parameters
      
      * Remove overload
      
      * Add overload
      
      * Remove noncps
      
      * Use named param for node
      
      * Add auto register header
      
      * Use named parameters
      
      * Refactor jenkinsfile
      
      * Fix shadow
      
      * Add missing body variable
      
      * Add more const methods
      
      * Add hip-clang docker builds
      
      * Remove comments
      
      * Add clang-format
      
      * Add more const
      
      * Formatting
      
      * Rename stage
      
      * Disable check
      
      * Add another const
      
      * Add python 2 dev packages
      
      * Add sphinx to dockerfile
      f28a62ea
  25. 25 Sep, 2020 1 commit
  26. 21 Sep, 2020 3 commits
  27. 16 Sep, 2020 1 commit
  28. 14 Sep, 2020 2 commits
    • Shucai Xiao's avatar
      Where op (#630) · 4fdc4dfe
      Shucai Xiao authored
      
      
      * add the where operator
      
      * clang format
      
      * add where unit tests
      
      * add where op unit test
      
      * clang format
      
      * add more unit tests for the where op
      
      * clang format
      
      * Add support for constructing value from enum
      
      * Formatting
      
      * add an comment about the algorithm
      
      * call make_op to create the convert instruction
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      4fdc4dfe
    • Paul Fultz II's avatar
      Some perf improvements to bert (#627) · 9f283810
      Paul Fultz II authored
      
      
      * Fuse gemm in fuse ops
      
      * Formatting
      
      * Add const ref
      
      * Remove assert
      
      * Skip already fused gemms
      
      * Skip already fused gemm
      
      * Formatting
      
      * Use float_equal
      
      * Avoid non-standard shapes for inputs
      
      * Formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      9f283810
  29. 11 Sep, 2020 1 commit
  30. 10 Sep, 2020 1 commit
    • Paul Fultz II's avatar
      Add load/save function for program (#623) · 63c5582a
      Paul Fultz II authored
      
      
      * Add save/load functions
      
      * Formatting
      
      * Add loading and saving to the driver
      
      * Formatting
      
      * Add return
      
      * Serialize the context with the program
      
      * Formatting
      
      * Add python API
      
      * Formatting
      
      * Add c/c++ apis
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Fix tidy error
      
      * Fix python doc
      
      * Restore python code
      
      * Add function name to errors
      
      * Formatting
      
      * Use lvalue for writing
      
      * Serialize context
      
      * Fix convolution and pooling operator for miopen
      
      * Formatting
      
      * Add const ref
      
      * Set target name to gpu
      
      * Add target tests
      
      * Formatting
      
      * Move register target to cpp file
      
      * Fix target test
      
      * Use make_target in driver
      
      * Formatting
      
      * Use make_target for the API
      
      * Formatting
      
      * Add cpu include
      
      * Increase timeout
      
      * Add more tests
      
      * Formatting
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      63c5582a
  31. 09 Sep, 2020 1 commit
    • Shucai Xiao's avatar
      Dropout change for two outputs (#626) · e67aa78c
      Shucai Xiao authored
      
      
      * add support for latest dropout version
      
      * clang format
      
      * fix a build error
      
      * fix a cppcheck error
      
      * add bool type
      
      * code backup
      
      * code backup
      
      * clang format
      
      * fix build warnings
      
      * clang format
      
      * add the equal operator
      
      * add the equal operator
      
      * clang format
      
      * remove unnecessary code
      
      * refine unit tests
      
      * clang format
      
      * fix review comments and a bug
      
      * clang format
      
      * additional changes
      
      * clang format
      
      * remove unnecessary changes
      
      * remove unnecessary changes
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      e67aa78c
  32. 01 Sep, 2020 1 commit
  33. 31 Aug, 2020 1 commit