Unverified Commit 7af08783 authored by Xavier Hallade's avatar Xavier Hallade Committed by GitHub
Browse files

fix divergence in barriers (#3621)

Without this fix, we see cases in which not all work-items in a thread group end up hitting the same number of barriers, which leads to a hang in OpenCL GPU execution.
parent b95f39da
......@@ -249,6 +249,9 @@ __kernel void findBlocksWithInteractions(real4 periodicBoxSize, real4 invPeriodi
neighborsInBuffer -= TILE_SIZE*tilesToStore;
}
}
else {
SYNC_WARPS;
}
}
}
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment