• Qianfeng's avatar
    Overhaul to Reducton and its dependants (#237) · 63eee2d9
    Qianfeng authored
    * Tiny fix in dynamic_buffer.hpp to support vectorized AtomicAdd for double type
    
    * Update to host layer and host reduction
    
    * Merge and remove reduction kernels
    
    * Merge and remove reduction device interfaces and update pooling device interface
    
    * Merge and remove useless reduction device instances
    
    * Update to reduction profiler and reduction ctests
    
    * Update to reduction and pooling examples and add one reduction example
    
    * Change to reduction examples to let them testable by ctest
    
    * Add explicit pass checking for reduction and pooling examples
    
    * Explicit assignment of tensor shapes in example reduce_blockwise_two_call
    
    * Use atomic_add to repace atomicAdd and add atomic_add for double type
    
    * Add reduce ctest support for double data type
    
    * Replace to_int_vector() by using c++ std::vector::assign()
    
    * Keep DeviceReduceThreadWise separated from DeviceReduceBlockWise
    
    * Merge DeviceReduceBlockWise and DeviceReduceMultiBlockAtomicAdd into DeviceReduceMultiBlock
    
    * Add GetAtomicOperationZeroValue() support for AtomicMax
    
    * Tiny change to reduce example README.md
    
    * Fix some tiny issues due to branch merging
    
    * Revoke previous change in dynamic_buffer.hpp and add atomic_add for double2_t
    
    * Add reduce multiblock_atomic_add instances for fp64 to verify vectorized atomic_add on fp64
    
    * Renaming
    
    * Clean the header includings in device_reduce instances header files
    63eee2d9
test_reduce_no_index.sh 2.49 KB