• one's avatar
    Tune HIP neighbor-list launch heuristics · 4d20b76e
    one authored
    Apply heuristics for HIP neighbor-list construction:
    use fewer nonbonded force blocks for small neighbor-list systems, use two
    tiles per batch for larger atom-block counts, and increase the
    findBlocksWithInteractions thread block size for small atom-block counts.
    
    Standard concurrent validation shows no clear per-case regression and a
    small geomean throughput improvement over the current blocksPerCU baseline.
    4d20b76e
HipNonbondedUtilities.cpp 36.5 KB