1. 25 Aug, 2020 1 commit
    • Paul Fultz II's avatar
      Improve layernorm performance (#613) · 56b3bf58
      Paul Fultz II authored
      * Use increment instead of division to compute register offset
      
      * Formatting
      
      * Limit layernorm to 1024 elements
      
      * Formatting
      
      * Add verification to driver
      
      * Formatting
      
      * Remove early return
      
      * Use block_size 256
      
      * Vectorize the kernel
      
      * Formatting
      
      * Convert to vector type
      
      * Add layernorm tests
      
      * Formatting
      
      * Formatting
      
      * Refactor layernorm to run both algos
      
      * Formatting
      
      * Fix compile error
      
      * Fix tidy warnings
      
      * Formatting
      
      * Add layernorm function
      
      * Formatting
      56b3bf58
  2. 21 Aug, 2020 1 commit
  3. 20 Aug, 2020 1 commit
  4. 19 Aug, 2020 1 commit
  5. 18 Aug, 2020 1 commit
    • Paul Fultz II's avatar
      Register all operators in migraphx (#604) · e8be8548
      Paul Fultz II authored
      * Register ops for main migraphx
      
      * Formatting
      
      * Register cpu ops
      
      * Formatting
      
      * Show list of operators in the driver
      
      * Formatting
      
      * Simplify regiter
      
      * Try to register gpu ops
      
      * Fix compiler errors
      
      * Register rest of the gpu operators
      
      * Add some tests
      
      * Formatting
      
      * Fix gcc compiler warnings
      
      * Formatting
      
      * Fix tidy warnings
      
      * Fix compile error
      
      * Use correct op name
      
      * Register layer norm
      
      * Use const ref
      
      * Make run const
      e8be8548
  6. 14 Aug, 2020 1 commit
    • kahmed10's avatar
      Layernorm onnx support (#599) · 2c5d5fee
      kahmed10 authored
      
      
      * fix pad calc
      
      * bert tf passes correctness
      
      * formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * add inline
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * test no mul_add
      
      * formatting
      
      * debug layernorm
      
      * debug layernorm
      
      * manual merge
      
      * more progress
      
      * formatting
      
      * remove miopen batchnorm
      
      * remove headers
      
      * Fix compile error with no dpp reductions
      
      * fix indices
      
      * formatting
      
      * change matcher
      
      * formatting
      
      * remove binds
      
      * formatting
      
      * disable tf matcher
      
      * formatting
      
      * use fast div
      
      * formatting
      
      * fix matcher
      
      * formatting
      
      * remove comment
      
      * move find_matches
      
      * add assert
      
      * formatting
      
      * fix deepcode issue
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      2c5d5fee
  7. 13 Aug, 2020 2 commits
    • Shucai Xiao's avatar
      integrate onnx backend test suit to migraphx (#574) · d612e976
      Shucai Xiao authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * update to MIOPen 2.3
      
      * add support for nd pooling
      
      * formatting
      
      * test miopen 2.4
      
      * change function name
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * add gpu ops tests
      
      * formatting
      
      * initial progress
      
      * formatting
      
      * add pkg-config
      
      * add to support asymmetric padding of averagepool
      
      * clang format
      
      * fix bug for average pooling
      
      * clang format
      
      * fix a bug
      
      * add unit tests for the asymmetric padding of averagepool
      
      * clang format
      
      * change functions
      
      * formatting
      
      * additional code refinement
      
      * clang format
      
      * check existing tests
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      
      * change for loop to transform
      
      * formatting
      
      * add tests
      
      * formatting
      
      * remove comment
      
      * add more tests
      
      * remove an optimization for pooling
      
      * clang format
      
      * add and fix unit tests
      
      * clang format
      
      * update gpu miopen calls
      
      * formatting
      
      * initial progress
      
      * add cpu impl and tests
      
      * formatting
      
      * add NOLINT
      
      * add 3d test
      
      * formatting
      
      * add more op_shape tests
      
      * test diff miopen version
      
      * add submodule onnx
      
      * add pooling shape tests
      
      * fix error msg
      
      * add onnx_test_backend
      
      * reorganize python code
      
      * temp disable test
      
      * fix cppcheck error
      
      * fix cppcheck error
      
      * code backup
      
      * add support device choice
      
      * refine onnx backend test
      
      * revert to miopen 2.4
      
      * fix review comments
      
      * fix review comments
      
      * clang format
      
      * fixed review comments
      
      * clang format
      
      * fix cppcheck error
      
      * copy onnx_backend_test to dest when building
      
      * add testdata folder
      
      * fix bounds
      
      * formatting
      
      * code backup
      
      * code backup
      
      * remove unnecessary file
      
      * fix various bugs
      
      * remove unnecessary changes
      
      * remove unnecessary submodule
      
      * remove unnecessary lines
      
      * fix algorithm
      
      * formatting
      
      * refine onnx backend unit tests
      
      * pin numpy version
      
      * fix build issue
      
      * fixed a filename to be copied
      
      * add the onnx dependency in docker image
      
      * ensure results are copied back correctly
      
      * specify onnx version
      
      * update excluded tests
      
      * remove unnecessary log info
      
      * turn on more unit tests
      
      * restrict onnx backend test to python 3.x
      
      * clang format
      
      * refine retrieving the input parameters
      
      * clang format
      
      * fix program input parameter names
      
      * clang format
      
      * avoid running onnx test in python 2.x
      
      * fix cppcheck error
      
      * fix python2.7 backend unit tests error
      
      * clang format
      
      * resolve the issue of ensure data copy to be completed
      
      * clang format
      
      * fix review comments
      
      * fix onnx backend unit test error
      
      * another change to make onnx backend test pass
      
      * clang format
      
      * fix onnx backend test error
      
      * clang format
      
      * disable onnx backend test to try
      
      * build try
      
      * update Dockerfile to try onnx backend test
      
      * remove unnecessary code
      
      * fix a bug in copying program
      
      * clang format
      
      * update dockerfile to include onnx
      
      * fix review comments
      
      * add the pytest module to the container
      
      * exclude real model to avoid to be downloaded
      
      * resolve the sync device for data copy from gpu to cpu
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * move sync_device after memory_coloring
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      d612e976
    • Shucai Xiao's avatar
      Move alloc copy to finalize (#602) · 0a347dff
      Shucai Xiao authored
      
      
      * code backup
      
      * code backup
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      0a347dff
  8. 12 Aug, 2020 1 commit
  9. 28 Jul, 2020 1 commit
  10. 21 Jul, 2020 2 commits
  11. 10 Jul, 2020 3 commits
    • Paul Fultz II's avatar
      Optimize multiply across slices (#568) · e66968a2
      Paul Fultz II authored
      
      
      * Add initial optimization when using a mul over a sliced convolution
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Convert to an assert
      
      * Check if used once
      
      * Formatting
      
      * Add test with horiz fusion
      
      * Formatting
      
      * Optimize nested slice
      
      * Formatting
      
      * Fix test
      
      * Add const refs
      
      * Remove unnecessary assert
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      e66968a2
    • Shucai Xiao's avatar
      Gpu batchnorm (#564) · 70ba8213
      Shucai Xiao authored
      
      
      * Initial cpu conv-nd
      
      * Formatting
      
      * Make index signed
      
      * Formatting
      
      * Assert the indices are greater than 0
      
      * Use equal instead of lexicographical_compare
      
      * Formatting
      
      * change the batchnorm cpu implementation to support multiple input dimensions
      
      * clang format
      
      * add unit tests for cpu batch_norm nd implementation
      
      * clang format
      
      * support nd batchnormalization
      
      * clang format
      
      * add rewrite batch_norm unit tests
      
      * clang format
      
      * remove a unit test
      
      * Fix tidy errors
      
      * Formatting
      
      * Handle different types
      
      * Formatting
      
      * Fix nested visits
      
      * Formatting
      
      * Add 3d conv test
      
      * Formatting
      
      * revert unnecessary changes
      
      * remove a print line
      
      * Fix ICE
      
      * Formatting
      
      * fix the per_activation mode of 2d
      
      * clang format
      
      * code clean up
      
      * clang format
      
      * add 1d and 3d gpu unit test
      
      * clang format
      
      * add unit test for rewrite_batchnorm
      
      * clang format
      
      * additional refinement
      
      * fix review comments
      
      * added a unit test to have more code coverage
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      70ba8213
    • Paul Fultz II's avatar
  12. 08 Jul, 2020 1 commit
    • kahmed10's avatar
      Nd pooling gpu (#551) · d1258e80
      kahmed10 authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * update to MIOPen 2.3
      
      * add support for nd pooling
      
      * formatting
      
      * test miopen 2.4
      
      * change function name
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * add gpu ops tests
      
      * formatting
      
      * add pkg-config
      
      * change functions
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      
      * test diff miopen version
      
      * add pooling shape tests
      
      * temp disable test
      
      * revert to miopen 2.4
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      d1258e80
  13. 07 Jul, 2020 1 commit
  14. 23 Jun, 2020 1 commit
    • Shucai Xiao's avatar
      Neg operator (#557) · 866cca5b
      Shucai Xiao authored
      * add the neg operator
      
      * clang format
      
      * add missing operator
      
      * fixed a cppcheck error
      
      * change to use the neg operator
      
      * clang format
      866cca5b
  15. 22 Jun, 2020 1 commit
    • kahmed10's avatar
      ND convolution GPU support (#550) · 1cc724ee
      kahmed10 authored
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * change functions
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      1cc724ee
  16. 08 Jun, 2020 1 commit
    • kahmed10's avatar
      Enable read support for n-dimensional ops (#537) · cb722cf9
      kahmed10 authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * fix review comments, remove duplicate test
      
      * formatting
      
      * fix variable
      
      * fix assert bug
      
      * fix attr check
      
      * remove std
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      cb722cf9
  17. 03 Jun, 2020 1 commit
    • Shucai Xiao's avatar
      Bert fuse slice reshape trans contiguous (#542) · 93be5e2b
      Shucai Xiao authored
      
      
      * fix pad calc
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * bert tf passes correctness
      
      * formatting
      
      * Add remap
      
      * Formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * Add compute method for dot
      
      * Formatting
      
      * add inline
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * add test for gelu
      
      * formatting
      
      * added add_gelu fusion
      
      * Add gemm fusions
      
      * Formatting
      
      * add files
      
      * formatting
      
      * test no mul_add
      
      * formatting
      
      * progress on div
      
      * formatting
      
      * continue work on pass
      
      * remove layernorm opt
      
      * revert reduce file
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * add optimization for bert
      
      * code backup for bert optimization
      
      * continue testing
      
      * formatting
      
      * fix matcher
      
      * formatting
      
      * add gelu_fn and tests
      
      * formatting
      
      * fix matcher, remove extra tests
      
      * formatting
      
      * fix matcher
      
      * add missing files
      
      * add find_layernorm
      
      * add add_transpose to cmake file
      
      * code backup for the contigous fusion
      
      * refine ops fusion
      
      * clang format
      
      * fixed bug in previous optimization
      
      * clang format
      
      * add more optimization
      
      * remove unnecessary code
      
      * refinement of the fustion code
      
      * clang format
      
      * fixed a bug
      
      * add used_once
      
      * formatting
      
      * start on new gelu
      
      * formatting
      
      * add matchers in fuse_ops
      
      * formatting
      
      * add dce to fix add_gelu
      
      * add simplify_rsqrt and test
      
      * formatting
      
      * debugging value for matcher
      
      * formatting
      
      * add more to matchers
      
      * formatting
      
      * fix errors
      
      * remove onnx gen
      
      * add any_arg, change matchers to use either_arg
      
      * formatting
      
      * clang format
      
      * formatting
      
      * add used_once
      
      * formatting
      
      * code cleanup
      
      * clang format
      
      * fixed a bug
      
      * remove unnecessary code
      
      * refine comments
      
      * optimize bert to remove more contiguous
      
      * clang format
      
      * remove unnecessary code
      
      * add unit tests for bert optimization
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * refine a fusion of reshape and slice
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * add the fusion of slice and transpose
      
      * clang format
      
      * add another optimization to fuse slice and transpose
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarShucai Xiao <scxiao@prj47-rack-99.local.lan>
      93be5e2b
  18. 02 Jun, 2020 1 commit
  19. 29 May, 2020 1 commit
  20. 22 May, 2020 1 commit
  21. 20 May, 2020 1 commit
    • Shucai Xiao's avatar
      Rnn variable seq lengths (#517) · 90200619
      Shucai Xiao authored
      
      
      * code backup
      
      * clang format
      
      * fix compiling errors
      
      * clang format
      
      * rename a few files
      
      * rename a few files
      
      * fix variable bugs
      
      * clang format
      
      * add an operator to shift input sequences
      
      * clang format
      
      * fixed a bug
      
      * clang format
      
      * fixed a bug
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * refine code related lstm operator optimization
      
      * clang format
      
      * fix various bugs
      
      * clang format
      
      * fixed a bug in rewrite_lstm
      
      * clang format
      
      * fixed another bug
      
      * refine two operator names
      
      * clang format
      
      * refine file names
      
      * fix cppcheck error
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * fix cppcheck error
      
      * fixed review comments
      
      * clang format
      
      * add unit tests
      
      * clang format
      
      * add unit tests
      
      * clang format
      
      * refine unit tests for better coverage
      
      * clang format
      
      * fixed a bug
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * rename two operators according to review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * fix a cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      Co-authored-by: default avatarShucai Xiao <scxiao@prj47-rack-99.local.lan>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      90200619
  22. 15 May, 2020 1 commit
    • kahmed10's avatar
      Add gelu optimization (#521) · 0079028a
      kahmed10 authored
      
      
      * fix pad calc
      
      * bert tf passes correctness
      
      * formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * add inline
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * add test for gelu
      
      * formatting
      
      * added add_gelu fusion
      
      * add files
      
      * formatting
      
      * remove layernorm opt
      
      * revert reduce file
      
      * add gelu_fn and tests
      
      * formatting
      
      * fix matcher, remove extra tests
      
      * formatting
      
      * fix matcher
      
      * add used_once
      
      * formatting
      
      * start on new gelu
      
      * formatting
      
      * add matchers in fuse_ops
      
      * formatting
      
      * add dce to fix add_gelu
      
      * add simplify_rsqrt and test
      
      * formatting
      
      * debugging value for matcher
      
      * formatting
      
      * add more to matchers
      
      * formatting
      
      * fix errors
      
      * remove onnx gen
      
      * add any_arg, change matchers to use either_arg
      
      * formatting
      
      * formatting
      
      * add used_once
      
      * formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      0079028a
  23. 11 May, 2020 1 commit
  24. 08 May, 2020 1 commit
    • Paul Fultz II's avatar
      Horizontal fusions of gemms and convolutions (#472) · 1a4ff504
      Paul Fultz II authored
      
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * Add remap
      
      * Formatting
      
      * Add compute method for dot
      
      * Formatting
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * Add gemm fusions
      
      * Formatting
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Add more testing
      
      * Formatting
      
      * Fix when there is duplicate splits in inputs
      
      * Formatting
      
      * Fix mismatch iterators
      
      * Add tests for dot fusions
      
      * Formatting
      
      * Add test for convolution
      
      * Formatting
      
      * Fix tidy issues
      
      * Add more tests
      
      * Formatting
      
      * Ignore build directory for codecov
      
      * Add test for groups
      
      * Formatting
      
      * Add more tests for groups
      
      * Formatting
      
      * Add test for missing end slice
      
      * Add newline
      
      * Remove unused function
      
      * Add support for when beta is not 1
      
      * Formatting
      
      * Add test for scalar
      
      * Add one more scalar test
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      1a4ff504
  25. 04 May, 2020 1 commit
  26. 21 Apr, 2020 1 commit
  27. 17 Apr, 2020 1 commit
  28. 14 Apr, 2020 1 commit
  29. 08 Apr, 2020 1 commit
  30. 06 Apr, 2020 1 commit
    • Paul Fultz II's avatar
      Upgrade to hcc 3.0 (#487) · e184395c
      Paul Fultz II authored
      * Upgrade compiler to hcc
      
      * Add missing perl package for hcc
      
      * Update hip and rocblas version
      
      * Add flag to speedup build
      
      * Remove symlink
      
      * Add no-gpu-rdc to linking
      
      * Add missing define
      
      * Disable no unused argument
      
      * Suppress warning
      
      * Add definition to all targets
      
      * Fix compiler flags
      
      * Refer to rocblas
      e184395c
  31. 29 Mar, 2020 1 commit
  32. 07 Mar, 2020 1 commit
  33. 06 Mar, 2020 1 commit
    • Shucai Xiao's avatar
      Support multi program outputs (#436) · 5592b921
      Shucai Xiao authored
      
      
      * Add initial api
      
      * Formatting
      
      * Add more api
      
      * Formatting
      
      * Add auto api generation
      
      * Formatting
      
      * Fix some compilation errors
      
      * Change handle struct
      
      * Formatting
      
      * Fix reamining compilation errors
      
      * Formatting
      
      * fixed a bug related to number of outputs
      
      * Simplify using ctype
      
      * Formatting
      
      * Initial c++ generation
      
      * Formatting
      
      * Add C++header
      
      * Formatting
      
      * Add test
      
      * Formatting
      
      * Add initial tests
      
      * Formatting
      
      * Try to fix formatting
      
      * Cleanup formatting
      
      * Formatting
      
      * Fix constructors on the same line
      
      * Fix tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Fix tidy issues
      
      * Fix naming issue
      
      * Add onnx API to parse buffer
      
      * Formatting
      
      * Add arguments api
      
      * Formatting
      
      * Fix verify parameters
      
      * Fix cppcheck issues
      
      * Formatting
      
      * Add method to get output shapes and bytes
      
      * Formatting
      
      * Try formatting
      
      * Formatting
      
      * Improve the test coverage
      
      * Formatting
      
      * Add print method
      
      * Formatting
      
      * Fix cppcheck issue
      
      * Fix package dependency
      
      * code backup for support multiple outputs
      
      * clang format
      
      * change migraphx api to support multiple program outputs
      
      * clang format
      
      * change api implementation
      
      * clang format
      
      * clang format
      
      * fix a build error
      
      * additional changes
      
      * clang format
      
      * change api for correct automatic generation
      
      * clang format
      
      * fix unit test error
      
      * fix unit test error
      
      * fix unit tests error
      
      * support multiple program outputs
      
      * clang format
      
      * remove @ from the add_return name
      
      * Add nolint
      
      * Try fix formatting
      
      * Formatting
      
      * formatting
      
      * formatting
      
      * Fix formatting
      
      * code cleanup
      
      * clang format
      
      * fix cppcheck error
      
      * fix a cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * record graph output name
      
      * clang format
      
      * refine print the add_return instruction
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * refine the name of the add_return instruction
      
      * fixed a bug related to workspace
      
      * fixed two small bugs
      
      * clang format
      
      * add more unit tests for multiple program outputs
      
      * clang format
      
      * change an error info
      
      * clang format
      
      * fix cppcheck error
      
      * add unit test for better code coverage
      
      * change to reduce code change
      
      * clang format
      
      * remove storing program output
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * clang format
      
      * remove unnecessary change
      
      * resolve an assert error
      
      * clang format
      
      * change the output name with prefix '#'
      
      * changes in quantization function to support the returns instructin
      
      * clang format
      
      * refine unit tests
      
      * clang format
      
      * refine profiling print out report
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      5592b921
  34. 26 Feb, 2020 1 commit
  35. 24 Feb, 2020 1 commit
  36. 12 Feb, 2020 1 commit