Commit aa15cdd3 authored by Shucai Xiao's avatar Shucai Xiao
Browse files

Merge branch 'reduce' of https://github.com/ROCmSoftwarePlatform/AMDMIGraphX into opt_log_softmax

parents e59b3101 f5a19acf
......@@ -182,7 +182,7 @@ __device__ auto block_reduce(index idx, Op op, T init, std::size_t n, F f)
}
__syncthreads();
type y = 0;
type y = init;
for(std::size_t i = 0; i < idx.nlocal() / 64; i++)
{
y = op(y, buffer[i]);
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment