1. 29 Oct, 2023 1 commit
  2. 24 Oct, 2023 1 commit
  3. 20 Sep, 2023 1 commit
  4. 19 Sep, 2023 1 commit
  5. 13 Sep, 2023 2 commits
  6. 07 Sep, 2023 1 commit
  7. 29 Aug, 2023 1 commit
  8. 25 Jan, 2023 1 commit
    • Qianfeng's avatar
      Batchnorm inference instances, external API, client examples and gtests (#531) · a1b2441f
      Qianfeng authored
      * File renaming and class renaming for device element-wise operation
      
      * Add batchnorm-infer instances, external API and client example
      
      * Add batchnorm-infer profiler module and gtests
      
      * Remove file device_elementwise_extension.hpp and move NormalizeInInfer operation to element_wise_operation.hpp
      
      * Remove the using of class aliasing for DeviceElementwiseForBatchNormInfer
      
      * Rename class and file due to conflict from device_elementwise_2d.hpp
      
      * Fix namespace in batcnnorm_infer_nhwc client example
      a1b2441f
  9. 14 Nov, 2022 1 commit
    • Po Yen Chen's avatar
      Rangify STL algorithms (#438) · dc663fae
      Po Yen Chen authored
      * Rangify STL algorithms
      
      This commit adapts rangified std::copy(), std::fill() & std::transform()
      
      * Re-write more std::copy() calls
      
      * Re-write std::copy() calls in profiler
      dc663fae
  10. 19 Oct, 2022 1 commit
  11. 17 Oct, 2022 1 commit
    • arai713's avatar
      adding tensor_permutation example folder (#389) · cee440fe
      arai713 authored
      * adding tensor_permutation example folder
      
      * fixed formatting
      
      * adding tensor_permutation example folder
      
      * fixed formatting
      
      * changed deviceelementwise parameters for outscalar
      
      * removed .swo file
      
      * updated folder/file name
      
      * changed function call in verification for better consistency with hostelementwist parameters
      
      * formatted again
      
      * fixed shape in verification function call
      
      * changed verification function call, added definition for nhwc
      
      * added elementwise permute example
      
      * updated CMakeLists file in folder
      
      * Delete CmakeLists.txt
      
      * Delete tensor_permute.cpp
      
      * first version of 2d gridwise_elementwise kernel
      
      * temporary fix for stride problem
      
      * formatting
      
      * format
      
      * changed directory name
      
      * Delete gridwise_elementwise_2d.hpp
      
      * Delete CMakeLists.txt
      
      * Delete extra file
      
      * delete extra file
      
      * got rid of extraneous code
      
      * added 2d device elementwise file
      
      * deleted accidently added file
      
      * update
      
      * stride values generalized with equations
      
      * updated stride for output matrix
      
      * Update CMakeLists.txt
      
      * removed extraneous commented code
      
      * removed shape_nchw vector, replaced with GetLength for each dimension
      
      * changed vector load in kernel call
      
      * removed extra space in CMake
      cee440fe