"vscode:/vscode.git/clone" did not exist on "80d30d49107bd65524d9fbdeac854a601d6403d8"
Improve Reduction kernel api (#152)
* Add ThreadwiseReduction functor as per-thread reduction api * Using ThreadwiseReduce api and some change in using PartitionedBlockwiseReduction api to simply the kernels * Add comments and remove useless declarations in the kernels * Tiny updates
Showing
Please register or sign in to comment