1. 05 Nov, 2020 1 commit
    • Shucai Xiao's avatar
      add get_main_module api (#665) · 6554639b
      Shucai Xiao authored
      
      
      * add an api get_main_module
      
      * clang format
      
      * modify onnx unit test for module
      
      * clang format
      
      * refactor ops unit test with the get_main_module
      
      * clang format
      
      * code backup
      
      * clang format
      
      * refine module c api
      
      * add python api for module
      
      * clang format
      
      * fix a python api issue
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * refine unit tests changes
      
      * clang format
      
      * defer some changes to later PRs
      
      * change return of get_main_module from ref to pointer
      
      * clang format
      
      * add unit tests for the get_main_module_api
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * add more unit tests for more code change coverage
      
      * clang format
      
      * fix unit test
      
      * clang format
      
      * code change for more code coverage
      
      * Add option to no generate a destroy method
      
      * Formatting
      
      * fix some review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * clang format
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      6554639b
  2. 04 Nov, 2020 1 commit
    • Paul Fultz II's avatar
      Split cpu and reference implementation (#671) · 500d9441
      Paul Fultz II authored
      
      
      * Add all_targets cmake target
      
      * Rename target
      
      * Add ref target
      
      * Rename tests
      
      * Refactor compiler target
      
      * Formatting
      
      * Verify for every target
      
      * Formatting
      
      * Add verify test suite
      
      * Formatting
      
      * Add initial test programs
      
      * Formatting
      
      * Add rnn tests
      
      * Formatting
      
      * Validate gpu
      
      * Formatting
      
      * Remove old gpu tests
      
      * Fix gpu tests
      
      * Fix ref error
      
      * Fix tidy issues
      
      * Formatting
      
      * Tidy fixes
      
      * Fix header in python api
      
      * Rename to ref
      
      * Use ref in verify_onnx
      
      * Fix tidy issue
      
      * Build with verbose on
      
      * Fix typo
      
      * Remove verbose
      
      * rename some cpu prefix to ref
      Co-authored-by: default avatarShucai Xiao <Shucai.Xiao@amd.com>
      500d9441
  3. 02 Nov, 2020 1 commit
  4. 29 Oct, 2020 1 commit
    • Shucai Xiao's avatar
      Resize op (#656) · 234cf5a4
      Shucai Xiao authored
      
      
      * code backup for upsample op
      
      * clang format
      
      * fixed a bug
      
      * fix a bug
      
      * clang format
      
      * add unit tests for upsample
      
      * clang format
      
      * code backup for resize
      
      * clang format
      
      * code backup
      
      * clang format
      
      * add unit test for resize operator
      
      * clang format
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * clang format
      
      * fix a typo
      
      * refine unit tests for better code coverage
      
      * remove unnecessary python binary file
      
      * fix review comments
      
      * clang format
      
      * change two functions to static
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      234cf5a4
  5. 28 Oct, 2020 1 commit
  6. 22 Oct, 2020 1 commit
  7. 19 Oct, 2020 1 commit
    • Shucai Xiao's avatar
      Op constructor c/python api (#636) · e939ddcd
      Shucai Xiao authored
      
      
      * code backup
      
      * clang format
      
      * add the python api for op construction
      
      * clang format
      
      * add converting a json string to a value and then pass to make_op
      
      * fix a build error
      
      * support kwargs decoder
      
      * clang format
      
      * code backup
      
      * clang format
      
      * temp code backup
      
      * clang format
      
      * temp code backup
      
      * convert input string to the json format
      
      * clang format
      
      * remove unnecessary code
      
      * fix review comments
      
      * clang format
      
      * add a comment
      
      * add unit tests
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * rename two files
      
      * refine an error report
      
      * clang format
      
      * remove unnecessary code
      
      * fix a review comments
      
      * add convert_to_json unit tests
      
      * clang format
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * add more unit tests
      
      * clang format
      
      * fix clang tidy error
      
      * clang format
      
      * fixed an test failure
      
      * fix cppcheck error
      
      * fix a bug
      
      * fix cppcheck error
      
      * fix jenkins build error
      
      * fix a jenkin build error by wrap up string
      
      * clang format
      
      * another try
      
      * clang format
      
      * fix jenkins build error
      
      * clang format
      
      * remove unnecessary code
      
      * fix review comments
      
      * fix cppcheck error
      
      * fix review comments
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      e939ddcd
  8. 15 Oct, 2020 1 commit
    • turneram's avatar
      Added greater and less operators (#660) · 48ffbfa5
      turneram authored
      
      
      * Added greater and less operators
      
      * Fixed ops_test.cpp
      
      * Set commutative to false for less, greater
      
      * Refactored parse_equal/less/greater into parse_compare_op
      
      * Removed unnecessary function attributes() from greater.hpp/less.hpp
      
      * Added op_name arguments
      
      * Removed local settings
      
      * Formatting
      
      * Missing comma
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Formatting
      
      * Missing space
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      48ffbfa5
  9. 09 Oct, 2020 1 commit
    • Paul Fultz II's avatar
      Add parallel stream analysis (#629) · 1d98fbb4
      Paul Fultz II authored
      * Add intial multi stream analysis
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Remove comment
      
      * Analyze streams on the gpu
      
      * Formatting
      
      * Fix nstream
      
      * Formatting
      
      * Add test for return
      
      * Formatting
      
      * Make sure return has a stream assignment
      
      * Formatting
      
      * Fix asserts and checks
      
      * Improve error message for out-of-order sequence
      
      * Formatting
      1d98fbb4
  10. 08 Oct, 2020 2 commits
    • kahmed10's avatar
      Add build flag for fast math (#639) · a5065265
      kahmed10 authored
      
      
      * add flag
      
      * formatting
      
      * remove env variable
      
      * fix api expression
      
      * add api test
      
      * add api test
      
      * add op test
      
      * formatting
      
      * fix function name
      
      * fix syntax
      
      * formatting
      
      * modify test
      
      * remove test and update doc
      
      * move test to new file
      
      * formatting
      
      * revert test files
      
      * rewrite check
      
      * New
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      a5065265
    • Shucai Xiao's avatar
      Upsample op (#646) · 4789b387
      Shucai Xiao authored
      
      
      * code backup for upsample op
      
      * clang format
      
      * fixed a bug
      
      * fix a bug
      
      * clang format
      
      * add unit tests for upsample
      
      * clang format
      
      * clang format
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      4789b387
  11. 07 Oct, 2020 1 commit
  12. 30 Sep, 2020 1 commit
    • Paul Fultz II's avatar
      Add hip clang builds to jenkins (#651) · f28a62ea
      Paul Fultz II authored
      * Make global variables const
      
      * Tidy fixes
      
      * Disable some lints
      
      * Formatting
      
      * Fix tidy const
      
      * Formatting
      
      * Add missing const keywords
      
      * Formatting
      
      * More fixes
      
      * Fix remaining tidy issues
      
      * Formatting
      
      * Fix rocblas function call
      
      * Formatting
      
      * Fix nodiscard warnings
      
      * Formatting
      
      * Use named parameters
      
      * Remove overload
      
      * Add overload
      
      * Remove noncps
      
      * Use named param for node
      
      * Add auto register header
      
      * Use named parameters
      
      * Refactor jenkinsfile
      
      * Fix shadow
      
      * Add missing body variable
      
      * Add more const methods
      
      * Add hip-clang docker builds
      
      * Remove comments
      
      * Add clang-format
      
      * Add more const
      
      * Formatting
      
      * Rename stage
      
      * Disable check
      
      * Add another const
      
      * Add python 2 dev packages
      
      * Add sphinx to dockerfile
      f28a62ea
  13. 25 Sep, 2020 1 commit
  14. 21 Sep, 2020 3 commits
  15. 16 Sep, 2020 1 commit
  16. 14 Sep, 2020 2 commits
    • Shucai Xiao's avatar
      Where op (#630) · 4fdc4dfe
      Shucai Xiao authored
      
      
      * add the where operator
      
      * clang format
      
      * add where unit tests
      
      * add where op unit test
      
      * clang format
      
      * add more unit tests for the where op
      
      * clang format
      
      * Add support for constructing value from enum
      
      * Formatting
      
      * add an comment about the algorithm
      
      * call make_op to create the convert instruction
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      4fdc4dfe
    • Paul Fultz II's avatar
      Some perf improvements to bert (#627) · 9f283810
      Paul Fultz II authored
      
      
      * Fuse gemm in fuse ops
      
      * Formatting
      
      * Add const ref
      
      * Remove assert
      
      * Skip already fused gemms
      
      * Skip already fused gemm
      
      * Formatting
      
      * Use float_equal
      
      * Avoid non-standard shapes for inputs
      
      * Formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      9f283810
  17. 11 Sep, 2020 1 commit
  18. 10 Sep, 2020 1 commit
    • Paul Fultz II's avatar
      Add load/save function for program (#623) · 63c5582a
      Paul Fultz II authored
      
      
      * Add save/load functions
      
      * Formatting
      
      * Add loading and saving to the driver
      
      * Formatting
      
      * Add return
      
      * Serialize the context with the program
      
      * Formatting
      
      * Add python API
      
      * Formatting
      
      * Add c/c++ apis
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Fix tidy error
      
      * Fix python doc
      
      * Restore python code
      
      * Add function name to errors
      
      * Formatting
      
      * Use lvalue for writing
      
      * Serialize context
      
      * Fix convolution and pooling operator for miopen
      
      * Formatting
      
      * Add const ref
      
      * Set target name to gpu
      
      * Add target tests
      
      * Formatting
      
      * Move register target to cpp file
      
      * Fix target test
      
      * Use make_target in driver
      
      * Formatting
      
      * Use make_target for the API
      
      * Formatting
      
      * Add cpu include
      
      * Increase timeout
      
      * Add more tests
      
      * Formatting
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      63c5582a
  19. 09 Sep, 2020 1 commit
    • Shucai Xiao's avatar
      Dropout change for two outputs (#626) · e67aa78c
      Shucai Xiao authored
      
      
      * add support for latest dropout version
      
      * clang format
      
      * fix a build error
      
      * fix a cppcheck error
      
      * add bool type
      
      * code backup
      
      * code backup
      
      * clang format
      
      * fix build warnings
      
      * clang format
      
      * add the equal operator
      
      * add the equal operator
      
      * clang format
      
      * remove unnecessary code
      
      * refine unit tests
      
      * clang format
      
      * fix review comments and a bug
      
      * clang format
      
      * additional changes
      
      * clang format
      
      * remove unnecessary changes
      
      * remove unnecessary changes
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      e67aa78c
  20. 01 Sep, 2020 1 commit
  21. 31 Aug, 2020 3 commits
  22. 28 Aug, 2020 1 commit
    • Shucai Xiao's avatar
      Onnx datatype parsing (#618) · 2433f9de
      Shucai Xiao authored
      * add bool type
      
      * code backup
      
      * code backup
      
      * clang format
      
      * fix build warnings
      
      * clang format
      
      * add the equal operator
      
      * add the equal operator
      
      * clang format
      
      * remove unnecessary code
      
      * refine unit tests
      
      * clang format
      
      * fix review comments and a bug
      
      * clang format
      
      * additional changes
      
      * clang format
      
      * fix cppcheck error
      
      * add bool type in c api
      
      * fix cppcheck error
      
      * fix review comments
      
      * fix cppcheck error
      
      * fix a build error related to gcc
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * added the equal operator to register list
      
      * add parsing boolean type
      
      * clang format
      
      * fix bool type issue for python output
      
      * clang format
      
      * add support for automatic multibroadcast of the equal operator
      
      * additional unit tests for more code coverage
      
      * clang format
      
      * missing an onnx file
      
      * code cleanup
      
      * clang format
      
      * fix issue of int64 data type in python api
      
      * clang format
      
      * include more onnx backend unit tests
      
      * add more unit tests to cover code changes
      
      * clang format
      
      * disable python tests for python2.7
      
      * code backup for python half type support
      
      * clang format
      
      * code change to support half data type in python
      
      * add a unit test for python half type
      
      * clang format
      
      * add more unit tests for more code coverage
      
      * clang format
      
      * add more unit tests for more code coverage
      
      * clang format
      
      * refine unit tests to have more code coverage
      
      * clang format
      
      * update python unit test
      
      * revert to a previous version of test_gpu.py
      2433f9de
  23. 27 Aug, 2020 2 commits
    • Shucai Xiao's avatar
      Context serialization (#607) · 6e1f9f20
      Shucai Xiao authored
      
      
      * Add initial serialization
      
      * Formatting
      
      * Add unit tests
      
      * Formatting
      
      * Add tests for serialization
      
      * Formatting
      
      * Use or not and
      
      * Add value test
      
      * Formatting
      
      * Add more tests
      
      * Add shape serialization
      
      * Formatting
      
      * Add serializtion for literal and argument
      
      * Formatting
      
      * Add from and to value to operatation
      
      * Formatting
      
      * Serialize empty types
      
      * Formatting
      
      * Tidy fixes
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Reformat value type macro
      
      * Formatting
      
      * Handle enum types
      
      * Formatting
      
      * Use const ref
      
      * Update
      
      * Add tests for to_value/from_value
      
      * Formatting
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * remove the from/to_value method for the generate context struct
      
      * clang format
      
      * code backup
      
      * Dont print literal data in hip_copy_literal
      
      * clang format
      
      * add unit test to have better coverage
      
      * remove unnecessary code
      
      * remove unnecessary code
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      6e1f9f20
    • Shucai Xiao's avatar
      Bool type and equal operator (#603) · 59b80d4e
      Shucai Xiao authored
      
      
      * add bool type
      
      * code backup
      
      * code backup
      
      * clang format
      
      * fix build warnings
      
      * clang format
      
      * add the equal operator
      
      * add the equal operator
      
      * clang format
      
      * remove unnecessary code
      
      * refine unit tests
      
      * clang format
      
      * fix review comments and a bug
      
      * clang format
      
      * additional changes
      
      * clang format
      
      * fix cppcheck error
      
      * add bool type in c api
      
      * fix cppcheck error
      
      * fix review comments
      
      * fix cppcheck error
      
      * fix a build error related to gcc
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * added the equal operator to register list
      
      * add parsing boolean type
      
      * clang format
      
      * fix bool type issue for python output
      
      * clang format
      
      * add support for automatic multibroadcast of the equal operator
      
      * additional unit tests for more code coverage
      
      * clang format
      
      * missing an onnx file
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      59b80d4e
  24. 26 Aug, 2020 1 commit
    • Paul Fultz II's avatar
      Add C++ ability to construct operators by name (#616) · 002eb4e2
      Paul Fultz II authored
      
      
      * Add make_op function
      
      * Formatting
      
      * Add more values
      
      * Formatting
      
      * Remove templates parse_conv functions
      
      * Formatting
      
      * Remove mat_mul template
      
      * Formatting
      
      * Reduce header includes
      
      * Fix compiling for gpu
      
      * Formatting
      
      * Use make_op in lowering
      
      * Formatting
      
      * Sort lines
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Fix tidy error
      
      * Formatting
      
      * Add const refs
      
      * Add explicit this
      
      * Add more const refs
      
      * Sort the program
      
      * Remove commented out code
      
      * Formatting
      
      * Infer gpu prefix
      
      * Formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      002eb4e2
  25. 25 Aug, 2020 1 commit
    • Paul Fultz II's avatar
      Improve layernorm performance (#613) · 56b3bf58
      Paul Fultz II authored
      * Use increment instead of division to compute register offset
      
      * Formatting
      
      * Limit layernorm to 1024 elements
      
      * Formatting
      
      * Add verification to driver
      
      * Formatting
      
      * Remove early return
      
      * Use block_size 256
      
      * Vectorize the kernel
      
      * Formatting
      
      * Convert to vector type
      
      * Add layernorm tests
      
      * Formatting
      
      * Formatting
      
      * Refactor layernorm to run both algos
      
      * Formatting
      
      * Fix compile error
      
      * Fix tidy warnings
      
      * Formatting
      
      * Add layernorm function
      
      * Formatting
      56b3bf58
  26. 21 Aug, 2020 1 commit
  27. 20 Aug, 2020 1 commit
  28. 19 Aug, 2020 2 commits
    • Paul Fultz II's avatar
      Add to_value/from_value to operation class (#605) · d1caaaa1
      Paul Fultz II authored
      * Add initial serialization
      
      * Formatting
      
      * Add unit tests
      
      * Formatting
      
      * Add tests for serialization
      
      * Formatting
      
      * Use or not and
      
      * Add value test
      
      * Formatting
      
      * Add more tests
      
      * Add shape serialization
      
      * Formatting
      
      * Add serializtion for literal and argument
      
      * Formatting
      
      * Add from and to value to operatation
      
      * Formatting
      
      * Serialize empty types
      
      * Formatting
      
      * Tidy fixes
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Reformat value type macro
      
      * Formatting
      
      * Handle enum types
      
      * Formatting
      
      * Use const ref
      
      * Update
      
      * Add tests for to_value/from_value
      
      * Formatting
      
      * Add more tests
      
      * Rewrite test to avoid redundant assignment
      d1caaaa1
    • Shucai Xiao's avatar
      move init miopen fusion operator to finalize method (#606) · 453517ad
      Shucai Xiao authored
      * move initialization of miopen fusion operators to finalize method
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * removed an unnecessary assert
      453517ad
  29. 18 Aug, 2020 1 commit
    • Paul Fultz II's avatar
      Register all operators in migraphx (#604) · e8be8548
      Paul Fultz II authored
      * Register ops for main migraphx
      
      * Formatting
      
      * Register cpu ops
      
      * Formatting
      
      * Show list of operators in the driver
      
      * Formatting
      
      * Simplify regiter
      
      * Try to register gpu ops
      
      * Fix compiler errors
      
      * Register rest of the gpu operators
      
      * Add some tests
      
      * Formatting
      
      * Fix gcc compiler warnings
      
      * Formatting
      
      * Fix tidy warnings
      
      * Fix compile error
      
      * Use correct op name
      
      * Register layer norm
      
      * Use const ref
      
      * Make run const
      e8be8548
  30. 14 Aug, 2020 1 commit
    • kahmed10's avatar
      Layernorm onnx support (#599) · 2c5d5fee
      kahmed10 authored
      
      
      * fix pad calc
      
      * bert tf passes correctness
      
      * formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * add inline
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * test no mul_add
      
      * formatting
      
      * debug layernorm
      
      * debug layernorm
      
      * manual merge
      
      * more progress
      
      * formatting
      
      * remove miopen batchnorm
      
      * remove headers
      
      * Fix compile error with no dpp reductions
      
      * fix indices
      
      * formatting
      
      * change matcher
      
      * formatting
      
      * remove binds
      
      * formatting
      
      * disable tf matcher
      
      * formatting
      
      * use fast div
      
      * formatting
      
      * fix matcher
      
      * formatting
      
      * remove comment
      
      * move find_matches
      
      * add assert
      
      * formatting
      
      * fix deepcode issue
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      2c5d5fee
  31. 13 Aug, 2020 2 commits
    • Shucai Xiao's avatar
      integrate onnx backend test suit to migraphx (#574) · d612e976
      Shucai Xiao authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * update to MIOPen 2.3
      
      * add support for nd pooling
      
      * formatting
      
      * test miopen 2.4
      
      * change function name
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * add gpu ops tests
      
      * formatting
      
      * initial progress
      
      * formatting
      
      * add pkg-config
      
      * add to support asymmetric padding of averagepool
      
      * clang format
      
      * fix bug for average pooling
      
      * clang format
      
      * fix a bug
      
      * add unit tests for the asymmetric padding of averagepool
      
      * clang format
      
      * change functions
      
      * formatting
      
      * additional code refinement
      
      * clang format
      
      * check existing tests
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      
      * change for loop to transform
      
      * formatting
      
      * add tests
      
      * formatting
      
      * remove comment
      
      * add more tests
      
      * remove an optimization for pooling
      
      * clang format
      
      * add and fix unit tests
      
      * clang format
      
      * update gpu miopen calls
      
      * formatting
      
      * initial progress
      
      * add cpu impl and tests
      
      * formatting
      
      * add NOLINT
      
      * add 3d test
      
      * formatting
      
      * add more op_shape tests
      
      * test diff miopen version
      
      * add submodule onnx
      
      * add pooling shape tests
      
      * fix error msg
      
      * add onnx_test_backend
      
      * reorganize python code
      
      * temp disable test
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * code backup
      
      * add support device choice
      
      * refine onnx backend test
      
      * revert to miopen 2.4
      
      * fix review comments
      
      * fix review comments
      
      * clang format
      
      * fixed review comments
      
      * clang format
      
      * fix cppcheck error
      
      * copy onnx_backend_test to dest when building
      
      * add testdata folder
      
      * fix bounds
      
      * formatting
      
      * code backup
      
      * code backup
      
      * remove unnecessary file
      
      * fix various bugs
      
      * remove unnecessary changes
      
      * remove unnecessary submodule
      
      * remove unnecessary lines
      
      * fix algorithm
      
      * formatting
      
      * refine onnx backend unit tests
      
      * pin numpy version
      
      * fix build issue
      
      * fixed a filename to be copied
      
      * add the onnx dependency in docker image
      
      * ensure results are copied back correctly
      
      * specify onnx version
      
      * update excluded tests
      
      * remove unnecessary log info
      
      * turn on more unit tests
      
      * restrict onnx backend test to python 3.x
      
      * clang format
      
      * refine retrieving the input parameters
      
      * clang format
      
      * fix program input parameter names
      
      * clang format
      
      * avoid running onnx test in python 2.x
      
      * fix cppcheck error
      
      * fix python2.7 backend unit tests error
      
      * clang format
      
      * resolve the issue of ensure data copy to be completed
      
      * clang format
      
      * fix review comments
      
      * fix onnx backend unit test error
      
      * another change to make onnx backend test pass
      
      * clang format
      
      * fix onnx backend test error
      
      * clang format
      
      * disable onnx backend test to try
      
      * build try
      
      * update Dockerfile to try onnx backend test
      
      * remove unnecessary code
      
      * fix a bug in copying program
      
      * clang format
      
      * update dockerfile to include onnx
      
      * fix review comments
      
      * add the pytest module to the container
      
      * exclude real model to avoid to be downloaded
      
      * resolve the sync device for data copy from gpu to cpu
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * move sync_device after memory_coloring
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      d612e976
    • Shucai Xiao's avatar
      Move alloc copy to finalize (#602) · 0a347dff
      Shucai Xiao authored
      
      
      * code backup
      
      * code backup
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      0a347dff