"torchvision/tv_tensors/_bounding_boxes.py" did not exist on "2bc8a14db3143ca0e61f627bef6b57c10111daf0"
  1. 21 Jul, 2020 2 commits
  2. 10 Jul, 2020 3 commits
    • Paul Fultz II's avatar
      Optimize multiply across slices (#568) · e66968a2
      Paul Fultz II authored
      
      
      * Add initial optimization when using a mul over a sliced convolution
      
      * Formatting
      
      * Add more tests
      
      * Formatting
      
      * Convert to an assert
      
      * Check if used once
      
      * Formatting
      
      * Add test with horiz fusion
      
      * Formatting
      
      * Optimize nested slice
      
      * Formatting
      
      * Fix test
      
      * Add const refs
      
      * Remove unnecessary assert
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      e66968a2
    • Shucai Xiao's avatar
      Gpu batchnorm (#564) · 70ba8213
      Shucai Xiao authored
      
      
      * Initial cpu conv-nd
      
      * Formatting
      
      * Make index signed
      
      * Formatting
      
      * Assert the indices are greater than 0
      
      * Use equal instead of lexicographical_compare
      
      * Formatting
      
      * change the batchnorm cpu implementation to support multiple input dimensions
      
      * clang format
      
      * add unit tests for cpu batch_norm nd implementation
      
      * clang format
      
      * support nd batchnormalization
      
      * clang format
      
      * add rewrite batch_norm unit tests
      
      * clang format
      
      * remove a unit test
      
      * Fix tidy errors
      
      * Formatting
      
      * Handle different types
      
      * Formatting
      
      * Fix nested visits
      
      * Formatting
      
      * Add 3d conv test
      
      * Formatting
      
      * revert unnecessary changes
      
      * remove a print line
      
      * Fix ICE
      
      * Formatting
      
      * fix the per_activation mode of 2d
      
      * clang format
      
      * code clean up
      
      * clang format
      
      * add 1d and 3d gpu unit test
      
      * clang format
      
      * add unit test for rewrite_batchnorm
      
      * clang format
      
      * additional refinement
      
      * fix review comments
      
      * added a unit test to have more code coverage
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      70ba8213
    • Paul Fultz II's avatar
  3. 08 Jul, 2020 1 commit
    • kahmed10's avatar
      Nd pooling gpu (#551) · d1258e80
      kahmed10 authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * update to MIOPen 2.3
      
      * add support for nd pooling
      
      * formatting
      
      * test miopen 2.4
      
      * change function name
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * add gpu ops tests
      
      * formatting
      
      * add pkg-config
      
      * change functions
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      
      * test diff miopen version
      
      * add pooling shape tests
      
      * temp disable test
      
      * revert to miopen 2.4
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      d1258e80
  4. 07 Jul, 2020 1 commit
  5. 23 Jun, 2020 1 commit
    • Shucai Xiao's avatar
      Neg operator (#557) · 866cca5b
      Shucai Xiao authored
      * add the neg operator
      
      * clang format
      
      * add missing operator
      
      * fixed a cppcheck error
      
      * change to use the neg operator
      
      * clang format
      866cca5b
  6. 22 Jun, 2020 1 commit
    • kahmed10's avatar
      ND convolution GPU support (#550) · 1cc724ee
      kahmed10 authored
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * progress on 1d to 2d
      
      * formatting
      
      * change compute and compile functions
      
      * formatting
      
      * fix duplicate
      
      * fix conflict
      
      * fix issue with 1d conv
      
      * formatting
      
      * add check for 3d limit
      
      * rename function
      
      * formatting
      
      * rename functions
      
      * formatting
      
      * add op_shape test
      
      * change functions
      
      * formatting
      
      * change to copy_backward
      
      * formatting
      1cc724ee
  7. 08 Jun, 2020 1 commit
    • kahmed10's avatar
      Enable read support for n-dimensional ops (#537) · cb722cf9
      kahmed10 authored
      
      
      * initial progress
      
      * formatting
      
      * add pooling changes
      
      * formatting
      
      * change eliminate_pad
      
      * formatting
      
      * rename var
      
      * fomratting
      
      * update op shape test and compute
      
      * formatting
      
      * revert conv constructor
      
      * formatting
      
      * change initializer
      
      * formatting
      
      * fix tidy
      
      * change quant conv and shape check
      
      * add tests and fixes
      
      * formatting
      
      * fix type
      
      * fix conv test
      
      * formatting
      
      * add pooling and bn tests
      
      * formatting
      
      * add inconsistent attr tests
      
      * fix padding issue
      
      * formatting
      
      * fix review comments, remove duplicate test
      
      * formatting
      
      * fix variable
      
      * fix assert bug
      
      * fix attr check
      
      * remove std
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      cb722cf9
  8. 03 Jun, 2020 1 commit
    • Shucai Xiao's avatar
      Bert fuse slice reshape trans contiguous (#542) · 93be5e2b
      Shucai Xiao authored
      
      
      * fix pad calc
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * bert tf passes correctness
      
      * formatting
      
      * Add remap
      
      * Formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * Add compute method for dot
      
      * Formatting
      
      * add inline
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * add test for gelu
      
      * formatting
      
      * added add_gelu fusion
      
      * Add gemm fusions
      
      * Formatting
      
      * add files
      
      * formatting
      
      * test no mul_add
      
      * formatting
      
      * progress on div
      
      * formatting
      
      * continue work on pass
      
      * remove layernorm opt
      
      * revert reduce file
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * add optimization for bert
      
      * code backup for bert optimization
      
      * continue testing
      
      * formatting
      
      * fix matcher
      
      * formatting
      
      * add gelu_fn and tests
      
      * formatting
      
      * fix matcher, remove extra tests
      
      * formatting
      
      * fix matcher
      
      * add missing files
      
      * add find_layernorm
      
      * add add_transpose to cmake file
      
      * code backup for the contigous fusion
      
      * refine ops fusion
      
      * clang format
      
      * fixed bug in previous optimization
      
      * clang format
      
      * add more optimization
      
      * remove unnecessary code
      
      * refinement of the fustion code
      
      * clang format
      
      * fixed a bug
      
      * add used_once
      
      * formatting
      
      * start on new gelu
      
      * formatting
      
      * add matchers in fuse_ops
      
      * formatting
      
      * add dce to fix add_gelu
      
      * add simplify_rsqrt and test
      
      * formatting
      
      * debugging value for matcher
      
      * formatting
      
      * add more to matchers
      
      * formatting
      
      * fix errors
      
      * remove onnx gen
      
      * add any_arg, change matchers to use either_arg
      
      * formatting
      
      * clang format
      
      * formatting
      
      * add used_once
      
      * formatting
      
      * code cleanup
      
      * clang format
      
      * fixed a bug
      
      * remove unnecessary code
      
      * refine comments
      
      * optimize bert to remove more contiguous
      
      * clang format
      
      * remove unnecessary code
      
      * add unit tests for bert optimization
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * refine a fusion of reshape and slice
      
      * clang format
      
      * fix cppcheck error
      
      * fix review comments
      
      * add the fusion of slice and transpose
      
      * clang format
      
      * add another optimization to fuse slice and transpose
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarShucai Xiao <scxiao@prj47-rack-99.local.lan>
      93be5e2b
  9. 02 Jun, 2020 1 commit
  10. 29 May, 2020 1 commit
  11. 22 May, 2020 1 commit
  12. 20 May, 2020 1 commit
    • Shucai Xiao's avatar
      Rnn variable seq lengths (#517) · 90200619
      Shucai Xiao authored
      
      
      * code backup
      
      * clang format
      
      * fix compiling errors
      
      * clang format
      
      * rename a few files
      
      * rename a few files
      
      * fix variable bugs
      
      * clang format
      
      * add an operator to shift input sequences
      
      * clang format
      
      * fixed a bug
      
      * clang format
      
      * fixed a bug
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * code backup
      
      * clang format
      
      * refine code related lstm operator optimization
      
      * clang format
      
      * fix various bugs
      
      * clang format
      
      * fixed a bug in rewrite_lstm
      
      * clang format
      
      * fixed another bug
      
      * refine two operator names
      
      * clang format
      
      * refine file names
      
      * fix cppcheck error
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * fix cppcheck error
      
      * fixed review comments
      
      * clang format
      
      * add unit tests
      
      * clang format
      
      * add unit tests
      
      * clang format
      
      * refine unit tests for better coverage
      
      * clang format
      
      * fixed a bug
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * rename two operators according to review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix review comments
      
      * fix a cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      Co-authored-by: default avatarShucai Xiao <scxiao@prj47-rack-99.local.lan>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      90200619
  13. 15 May, 2020 1 commit
    • kahmed10's avatar
      Add gelu optimization (#521) · 0079028a
      kahmed10 authored
      
      
      * fix pad calc
      
      * bert tf passes correctness
      
      * formatting
      
      * add test
      
      * formatting
      
      * remove comment
      
      * add inline
      
      * formatting
      
      * fix order for literal
      
      * formatting
      
      * add test for gelu
      
      * formatting
      
      * added add_gelu fusion
      
      * add files
      
      * formatting
      
      * remove layernorm opt
      
      * revert reduce file
      
      * add gelu_fn and tests
      
      * formatting
      
      * fix matcher, remove extra tests
      
      * formatting
      
      * fix matcher
      
      * add used_once
      
      * formatting
      
      * start on new gelu
      
      * formatting
      
      * add matchers in fuse_ops
      
      * formatting
      
      * add dce to fix add_gelu
      
      * add simplify_rsqrt and test
      
      * formatting
      
      * debugging value for matcher
      
      * formatting
      
      * add more to matchers
      
      * formatting
      
      * fix errors
      
      * remove onnx gen
      
      * add any_arg, change matchers to use either_arg
      
      * formatting
      
      * formatting
      
      * add used_once
      
      * formatting
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      0079028a
  14. 11 May, 2020 1 commit
  15. 08 May, 2020 1 commit
    • Paul Fultz II's avatar
      Horizontal fusions of gemms and convolutions (#472) · 1a4ff504
      Paul Fultz II authored
      
      
      * Add decompose pass
      
      * Add decompose test
      
      * Formatting
      
      * Add remap
      
      * Formatting
      
      * Add compute method for dot
      
      * Formatting
      
      * Add finder for horizontal fusion
      
      * Formatting
      
      * Formatting
      
      * Reuse predicate
      
      * Add gemm fusions
      
      * Formatting
      
      * Add some fixes for convolution
      
      * Formatting
      
      * Fix shape tests
      
      * Formatting
      
      * Reuse axis equal
      
      * Add initial split fusion
      
      * Formatting
      
      * Update offset
      
      * Workaround outputs that cant accept nonstandard shapes
      
      * Formatting
      
      * Add check for split concat
      
      * Formatting
      
      * Add missing headers
      
      * Formatting
      
      * Add tests
      
      * Formatting
      
      * Add more testing
      
      * Formatting
      
      * Fix when there is duplicate splits in inputs
      
      * Formatting
      
      * Fix mismatch iterators
      
      * Add tests for dot fusions
      
      * Formatting
      
      * Add test for convolution
      
      * Formatting
      
      * Fix tidy issues
      
      * Add more tests
      
      * Formatting
      
      * Ignore build directory for codecov
      
      * Add test for groups
      
      * Formatting
      
      * Add more tests for groups
      
      * Formatting
      
      * Add test for missing end slice
      
      * Add newline
      
      * Remove unused function
      
      * Add support for when beta is not 1
      
      * Formatting
      
      * Add test for scalar
      
      * Add one more scalar test
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      1a4ff504
  16. 04 May, 2020 1 commit
  17. 21 Apr, 2020 1 commit
  18. 17 Apr, 2020 1 commit
  19. 14 Apr, 2020 1 commit
  20. 08 Apr, 2020 1 commit
  21. 06 Apr, 2020 1 commit
    • Paul Fultz II's avatar
      Upgrade to hcc 3.0 (#487) · e184395c
      Paul Fultz II authored
      * Upgrade compiler to hcc
      
      * Add missing perl package for hcc
      
      * Update hip and rocblas version
      
      * Add flag to speedup build
      
      * Remove symlink
      
      * Add no-gpu-rdc to linking
      
      * Add missing define
      
      * Disable no unused argument
      
      * Suppress warning
      
      * Add definition to all targets
      
      * Fix compiler flags
      
      * Refer to rocblas
      e184395c
  22. 29 Mar, 2020 1 commit
  23. 07 Mar, 2020 1 commit
  24. 06 Mar, 2020 1 commit
    • Shucai Xiao's avatar
      Support multi program outputs (#436) · 5592b921
      Shucai Xiao authored
      
      
      * Add initial api
      
      * Formatting
      
      * Add more api
      
      * Formatting
      
      * Add auto api generation
      
      * Formatting
      
      * Fix some compilation errors
      
      * Change handle struct
      
      * Formatting
      
      * Fix reamining compilation errors
      
      * Formatting
      
      * fixed a bug related to number of outputs
      
      * Simplify using ctype
      
      * Formatting
      
      * Initial c++ generation
      
      * Formatting
      
      * Add C++header
      
      * Formatting
      
      * Add test
      
      * Formatting
      
      * Add initial tests
      
      * Formatting
      
      * Try to fix formatting
      
      * Cleanup formatting
      
      * Formatting
      
      * Fix constructors on the same line
      
      * Fix tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Fix tidy issues
      
      * Fix naming issue
      
      * Add onnx API to parse buffer
      
      * Formatting
      
      * Add arguments api
      
      * Formatting
      
      * Fix verify parameters
      
      * Fix cppcheck issues
      
      * Formatting
      
      * Add method to get output shapes and bytes
      
      * Formatting
      
      * Try formatting
      
      * Formatting
      
      * Improve the test coverage
      
      * Formatting
      
      * Add print method
      
      * Formatting
      
      * Fix cppcheck issue
      
      * Fix package dependency
      
      * code backup for support multiple outputs
      
      * clang format
      
      * change migraphx api to support multiple program outputs
      
      * clang format
      
      * change api implementation
      
      * clang format
      
      * clang format
      
      * fix a build error
      
      * additional changes
      
      * clang format
      
      * change api for correct automatic generation
      
      * clang format
      
      * fix unit test error
      
      * fix unit test error
      
      * fix unit tests error
      
      * support multiple program outputs
      
      * clang format
      
      * remove @ from the add_return name
      
      * Add nolint
      
      * Try fix formatting
      
      * Formatting
      
      * formatting
      
      * formatting
      
      * Fix formatting
      
      * code cleanup
      
      * clang format
      
      * fix cppcheck error
      
      * fix a cppcheck error
      
      * clang format
      
      * fix review comments
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * record graph output name
      
      * clang format
      
      * refine print the add_return instruction
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * refine the name of the add_return instruction
      
      * fixed a bug related to workspace
      
      * fixed two small bugs
      
      * clang format
      
      * add more unit tests for multiple program outputs
      
      * clang format
      
      * change an error info
      
      * clang format
      
      * fix cppcheck error
      
      * add unit test for better code coverage
      
      * change to reduce code change
      
      * clang format
      
      * remove storing program output
      
      * fix cppcheck error
      
      * fix review comments
      
      * clang format
      
      * clang format
      
      * remove unnecessary change
      
      * resolve an assert error
      
      * clang format
      
      * change the output name with prefix '#'
      
      * changes in quantization function to support the returns instructin
      
      * clang format
      
      * refine unit tests
      
      * clang format
      
      * refine profiling print out report
      Co-authored-by: default avatarPaul <pfultz2@yahoo.com>
      Co-authored-by: default avatarKhalique <15948690+kahmed10@users.noreply.github.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      5592b921
  25. 26 Feb, 2020 1 commit
  26. 24 Feb, 2020 1 commit
  27. 12 Feb, 2020 1 commit
  28. 10 Feb, 2020 1 commit
    • Shucai Xiao's avatar
      Add additional simple operators (MatMulInteger, ConvInteger, Asinh, Acosh, and Atanh (#431) · a023ec19
      Shucai Xiao authored
      
      
      * Add initial api
      
      * Formatting
      
      * Add more api
      
      * Formatting
      
      * add more operators (asinh, acosh, atanh, MatMulInteger, ConvInteger)
      
      * clang format
      
      * add unit tests for new operators
      
      * clang format
      
      * Add auto api generation
      
      * Formatting
      
      * Fix some compilation errors
      
      * Change handle struct
      
      * Formatting
      
      * Fix reamining compilation errors
      
      * Formatting
      
      * Simplify using ctype
      
      * Formatting
      
      * Initial c++ generation
      
      * Formatting
      
      * Add C++header
      
      * Formatting
      
      * Add test
      
      * Formatting
      
      * Add initial tests
      
      * Formatting
      
      * Try to fix formatting
      
      * Cleanup formatting
      
      * Formatting
      
      * Fix constructors on the same line
      
      * Fix tests
      
      * Formatting
      
      * Fix tidy issues
      
      * Fix tidy issues
      
      * Fix naming issue
      
      * Add onnx API to parse buffer
      
      * Formatting
      
      * Add arguments api
      
      * Formatting
      
      * Fix verify parameters
      
      * Fix cppcheck issues
      
      * Formatting
      
      * Add method to get output shapes and bytes
      
      * Formatting
      
      * Try formatting
      
      * Formatting
      
      * Improve the test coverage
      
      * Formatting
      
      * Add print method
      
      * Formatting
      
      * Fix cppcheck issue
      
      * Fix package dependency
      
      * Add nolint
      
      * Try fix formatting
      
      * Formatting
      
      * formatting
      
      * formatting
      
      * Fix formatting
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      Co-authored-by: default avatarkahmed10 <15948690+kahmed10@users.noreply.github.com>
      a023ec19
  29. 24 Jan, 2020 1 commit
    • kahmed10's avatar
      Conv transpose op (#429) · cf85b4c6
      kahmed10 authored
      
      
      * initial testing
      
      * add shape op
      
      * formatting
      
      * add env variable for batch sizes
      
      * formatting
      
      * progress on driver
      
      * progress on driver
      
      * cleanup
      
      * cleanup
      
      * add and modified prev tests
      
      * formatting
      
      * remove comment
      
      * add shape op test
      
      * formatting
      
      * manually insert shape op in test
      
      * formatting
      
      * create options struct for parsers
      
      * formatting
      
      * Add documentation for python
      
      * Fix c++ documentaion
      
      * add documentation to parser
      
      * formatting
      
      * add argmin and tests
      
      * fix doc and definitions
      
      * formatting
      
      * revert test functions
      
      * formatting
      
      * cpu impl of conv_transpose
      
      * more work on conv_transpose
      
      * rename files, added extratests
      
      * formatting
      
      * add more tests
      
      * formatting
      
      * changes
      
      * fix tests
      
      * fix tidy
      
      * formatting
      
      * fixed function parameter
      
      * fix function parameter
      
      * add cpu ops test
      
      * formatting
      Co-authored-by: default avatarPaul Fultz II <pfultz2@yahoo.com>
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      cf85b4c6
  30. 17 Jan, 2020 1 commit
    • Shucai Xiao's avatar
      Reduce operators (#427) · e320f89f
      Shucai Xiao authored
      * add reduce operators as required by onnxruntime
      
      * clang format
      
      * remove a test since it can cause overflow
      
      * resolve cppcheck error
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      e320f89f
  31. 20 Dec, 2019 1 commit
    • Shucai Xiao's avatar
      Improve operators for onnxruntime (#405) · 992666e6
      Shucai Xiao authored
      
      
      * improve unsqueeze to support negative axis and parsing scalar
      
      * clang format
      
      * add a test example for the negative axis of unsqueeze
      
      * improve the squeeze operator to support negative axis
      
      * clang format
      
      * fixed a small bug in the lrn implementation
      
      * clang format
      
      * support negative axis in argmax and argmin
      
      * clang format
      
      * improve flatten to support negative axis
      
      * clang format
      
      * change softmax/logsoftmax to support negative axis
      
      * clang format
      
      * improve transpose by adding default perm
      
      * clang format
      
      * add one more dimens for tensor size
      
      * add one more dimens for tensor size
      
      * disable conv ops fusion for non-symmetric cases
      
      * clang format
      
      * fixed review comments
      
      * move computing axis from the device function to the compute function
      
      * clang format
      
      * move computing axis from device function to the operator computing function
      
      * clang format
      Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
      992666e6
  32. 27 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Add support for builtin models in the driver (#404) · b0aef8c8
      Paul Fultz II authored
      * Add experimental support for c++ output
      
      * Format
      
      * Fix syntax errors
      
      * Add resnet50 model
      
      * Formatting
      
      * Add inceptionv3 model
      
      * Formatting
      
      * Add alexnet
      
      * Formatting
      
      * Fix name of pooling mode
      
      * Formatting
      
      * Fix tidy issues
      
      * Ignore driver directory
      
      * Show accetable values
      b0aef8c8
  33. 18 Nov, 2019 1 commit
    • Shucai Xiao's avatar
      Improve concat gather (#402) · 0045d0b7
      Shucai Xiao authored
      * improve gather implementation to handle negative input indices
      
      * clang format
      
      * clang format
      
      * improve concat to support neg axis input
      
      * clang format
      
      * fix cppcheck error
      
      * clang format
      
      * code cleanup
      
      * clang format
      
      * fix review comments
      
      * clang format
      0045d0b7
  34. 17 Nov, 2019 1 commit
  35. 15 Nov, 2019 1 commit
    • Paul Fultz II's avatar
      Add option to do offload copying automatically (#403) · 81b0ff5d
      Paul Fultz II authored
      * Add compiler options
      
      * Add copy operators
      
      * Formatting
      
      * Use run_passes in tests
      
      * Formatting
      
      * Use run_pass in schedule test
      
      * Formatting
      
      * Add compile_options to get_passes in target
      
      * Formatting
      
      * Offload copy option
      
      * Formatting
      
      * Copy using pinned memory
      
      * Formatting
      
      * Improve performance of gpu copying
      
      * Formatting
      
      * Dont copy
      
      * Formatting
      
      * Always make an extra copy
      
      * Formatting
      
      * Remove unused write op
      
      * Add missing include
      
      * Remove copy_to_gpu function in python api
      
      * Make offload copy disabled by default on C++
      
      * Formatting
      
      * Fix tidy issues
      
      * Formatting
      
      * Fix namespace
      
      * Fix python tests
      
      * Turn clang format off since its broken
      
      * Fix compile error on gcc 5
      
      * Remove commented code
      81b0ff5d
  36. 04 Nov, 2019 2 commits
    • Paul Fultz II's avatar
      Use multi_index for reductions (#400) · 3e0496fb
      Paul Fultz II authored
      * Add functions to do multi-index for local strides as well
      
      * Formatting
      
      * Use same multi-index path for block_reduce
      
      * Formatting
      
      * Use multi-index calc in reduce
      
      * Formatting
      
      * Fix warning
      
      * Fix compiler warning
      
      * Disable some tidy checks
      3e0496fb
    • Paul Fultz II's avatar
      Fix accuraccy issue in resnet50 (#395) · 78c83426
      Paul Fultz II authored
      * Fix bug in eliminate_concat
      
      * Formatting
      
      * Skip context_free operators
      
      * Formatting
      
      * Fix unit test
      
      * Formatting
      78c83426