Tune HIP neighbor-list launch heuristics
Apply heuristics for HIP neighbor-list construction: use fewer nonbonded force blocks for small neighbor-list systems, use two tiles per batch for larger atom-block counts, and increase the findBlocksWithInteractions thread block size for small atom-block counts. Standard concurrent validation shows no clear per-case regression and a small geomean throughput improvement over the current blocksPerCU baseline.
Showing
Please register or sign in to comment