"library/src/host_tensor/device.cpp" did not exist on "5c7cec11159d22636dd4c1119e7e430d156a8df7"
Improve Reduction kernel api (#152)
* Add ThreadwiseReduction functor as per-thread reduction api * Using ThreadwiseReduce api and some change in using PartitionedBlockwiseReduction api to simply the kernels * Add comments and remove useless declarations in the kernels * Tiny updates
Showing
Please register or sign in to comment