Commit 89dfc4dd authored by Shucai Xiao's avatar Shucai Xiao
Browse files

fixed a bug in the gpu implementation of logsoftmax.

parent 7804a412
......@@ -31,7 +31,7 @@ void logsoftmax(hipStream_t stream, const argument& result, const argument& arg,
// use one block for items in one batch.
const size_t max_block_size = 1024;
size_t block_size = 1;
while(block_size < max_block_size and block_size < n_dim)
while(block_size < max_block_size and block_size < batch_item_num)
{
block_size *= 2;
}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment