Tune HIP LangevinMiddle launch block size
Use explicit 128-thread launches for the three LangevinMiddle integration kernels to improve HIP throughput while preserving the existing PME launch heuristics.
Showing
Please register or sign in to comment