Commits · 4ab645ea6911bf29e9ec6fd13b50ba1e196cbc40 · tsoc / openmm

10 Feb, 2026 1 commit

GPU implementation of L-BFGS (#5198) · 4ab645ea

Evan Pretti authored Feb 10, 2026

* Make reference/CPU minimizer into a kernel

* Add per-platform support for GPU minimization

* Initial implementation of GPU minimization

* Fixes

* Increase robustness when initial gradient is huge

* Handle overflow leading to non-finite values gracefully

* Handle large forces in single precision more robustly

* Optimize kernels

* Fix kernel launch size

* Update banner years

* Don't create MinimizeKernel until first minimization requested

* Make some compile-time constants into kernel arguments

* Consolidate scale calculation kernel

* Condense alpha/beta reduction kernels using atomics

* Condense line search dot kernels with reductions

* Remove a download, and download grad norm separately

* Asynchronously check lbfgs convergence condition

* Restructure line search to avoid download waiting

* Start line search preemptively in case CPU evaluation is not needed

* In rare cases, constraint error might not decrease after one optimization round

* Better handling of unsupported 64-bit atomics, use FLT_MAX

* Pick gradient mode based on GPU vs. CPU evaluation

* Rework getDiff/getScale reduction, remove reduceBuffer

* Older CUDA might not like float hex literals

* Fix error in a comment

4ab645ea

11 Dec, 2025 1 commit

Add LCPO method (#5130) · adfd84c2

Evan Pretti authored Dec 11, 2025

* Basic LCPO support

* Add basic test for LCPO from a prmtop file

* API for LCPOForce

* Started LCPO reference implementation

* Finished reference forces & test cases

* Use other test for finite difference since grid might have discontinuous forces

* Reference platform formatting

* Initial implementation of CPU platform

* Bugfixes

* More vectorization and improve neighbor list query speed

* Parallelize part of neighbor search

* Check box size for LCPO with periodic boundary conditions

* Fixes for updating parameters in context

* GBSAOBCForce doesn't use first & last indices for updates, so no need for this optimization here

* Changes to neighbor checking and optimization

* Fixes and minor changes

* Add global surface tension parameter

* Only process half of the pairs in the neighbor list

* Remove unnecessary checks

* Initial version of common platform implementation

* Asynchronously download neighbor list size

* Debugging

* Do pair precomputation in copyPairsToNeighborList

* Recompute interactions instead of scanning neighbor list in inner loop

* Condense position array before computations

* Also make neighbor count download asynchronous on device

* Fixes for kernel launching

* Topology-based LCPO parameter assignment

* Fixes, and use test system for LCPO with nucleic acids

* Always raise instead of warn when LCPO parameters can't be assigned

* Use Amber convention for phosphates

adfd84c2

23 Sep, 2025 1 commit

Update file headers (#5074) · 05472c9a

Evan Pretti authored Sep 23, 2025

* Replace SimTK-containing file headers

* Update file headers for new Tinker reader files added

05472c9a

12 Sep, 2025 1 commit

Add constant potential method (#4870) · f55abcaa

Evan Pretti authored Sep 12, 2025



* Initial implementation of C++ API

* Add kernel interface and information for API generation

* API updates for updating electrode parameters

* Add serialization proxy for ConstantPotentialForce

* Update file headers

* Add CG error tolerance and fix units on getCharges() return value

* Initial implementation of matrix solver

* Fixes and conjugate gradient solver

* Try to fix Linux and Windows builds

* Make sure charge constraint target is on total charge

* Restore handling of exceptions like NonbondedForce since they won't involve electrode atoms

* Ameliorate numerical instability in constrained conjugate gradient

* Fix uninitialized pointers, memory leak, and style

* Set CG tolerance units in Python API

* Test ConstantPotentialForce serialization

* Read/write ExceptionsUsePeriodicBoundaryConditions as bool

* Improve constrained conjugate gradient robustness to roundoff error accumulation

* Recompute matrix if electrode atoms move due to setPositions()

* Tolerance is now in gradient (potential) units again

* Add neutralizing background correction

* Add Python API tests

* Fixes for CG and nonbonded exceptions

* Add initial tests checking against existing NonbondedForce behavior

* Expand test suite and fix some implementation issues

* Add additional tests using larger reference system

* Add Gaussian test

* Finish test against reference computation

* CPU platform implementation

* Fixes for compilation on some platforms

* Fixes for constant potential with AVX/AVX2

* Test linking CPU PME library to constant potential test directly

* Older SWIG versions don't support Python set to C++ set conversion

* Add user guide entry

* Increase speed of reference test

* Conditional building constant potential CPU test is unreliable

* Debugging

* Miscellaneous fixes and improvements for CI

* Cache charges so solver will not run if system and coordinates have not changed

* Preconditioner flag, stability, and automatic detection improvements

* Add GPU platform-specific constant potential kernel classes

* PME and device-host I/O changes to support constant potential

* Initial common constant potential implementation

* Constant potential fixes:

* Fix preconditioner PME position/charge save/restore logic

* Fix reduction synchronization in constant potential solver kernels

* Add double-float accumulation for conjugate gradient solver when
  double unsupported by hardware

* Improve conditioning of a test system, and make sure particles are in or
out of cutoff for consistency and ease of comparing between platforms

* Reorder guess charges for CG when atom reordering changes positions

* Remove PME queue for now

* Trying to debug optimized direct space derivative kernel

* Remove extraneous debugging lines

* Style updates; just make CPU preconditioner double precision

* Debugging updated optimized direct derivatives kernel for all but OpenCL CPU

* OpenCL CPU implementation of direct space derivatives, and cleanup

* Try to make test even shorter to not time out on CI

* Temporary - Debugging

* Debugging

* Debugging

* Debugging

* Debugging

* Remove debugging code and fix reduction synchronization

* Fix other reductions

* Debugging - are tests hanging or just slow on CI?

* Debugging

* Debugging

* Fix macro for case when double precision is available on hardware

* Remove changes for debugging again

* Try to improve matrix solver cache locality by uploading transpose

* Fixes for atom ordering and periodic images

* Can't rely on reorder listener for cell offset updates

* Test reducing number of contexts and timing for CI

* Debugging

* Remove timing code and revert debugging changes

* Matrix solver and plasma term optimizations

* Reduce CG solver kernel calls and downloads

* Don't read back convergence flag from global memory

* Update PME due to refactoring in master branch

* Faster matrix solver (1st step)

* Faster matrix solver for CUDA

* Faster matrix solver compatibility with non-CUDA platforms

* Matrix solver fixes

* Use warp shuffle reductions when possible

* Attempt to work around intermittent compiler crash in Intel CPU OpenCL

* Optimize CG solver kernel 1

* Rework CG solver so some kernels can use more than 1 block

* Don't run out of shared memory

* Asynchronously download convergence flag while clearing buffers

---------
Co-authored-by: Evan Pretti <pretti@sh03-17n15.int>

f55abcaa

10 Sep, 2024 1 commit

Merged parallel code (#4649) · b28d2e66

Peter Eastman authored Sep 10, 2024

* Unified lots of parallel computation code between platforms

* Unified test code between platforms

* Eliminated duplicated timing code

b28d2e66

16 Sep, 2023 1 commit

CustomCPPForceImpl for writing forces in C++ (#4231) · 9a0db725

Peter Eastman authored Sep 15, 2023

* Implemented CustomCPPForceImpl

* Documentation for CustomCPPForceImpl

* Attempt at fixing Windows compilation error

* Improved documentation

9a0db725

20 Aug, 2020 1 commit

Fixed range overflow with very large numbers of atoms (#2806) · cdc0789a

peastman authored Aug 20, 2020

* Fixed range overflow with very large numbers of atoms

* More fixes to overflow with large numbers of atoms

* Fix test failures

cdc0789a

14 Feb, 2020 1 commit
- Renamed BAOABLangevinIntegrator to LangevinMiddleIntegrator · 65ee8fd7
  Peter Eastman authored Feb 14, 2020
  
  65ee8fd7
21 Oct, 2019 1 commit
- CPU implementation of BAOABLangevinIntegrator · 8ee2c644
  peastman authored Oct 21, 2019
  
  8ee2c644
26 Jan, 2017 1 commit
- Fixes to dispersion PME · c274f08d
  Peter Eastman authored Jan 26, 2017
  
  c274f08d
13 Jan, 2017 1 commit
- Eliminated RealOpenMM type · a783b996
  peastman authored Jan 13, 2017
  
  a783b996
02 May, 2016 1 commit
- Created CPU implementation of GayBerneForce · 6c1d5eb1
  peastman authored May 02, 2016
  
  6c1d5eb1
26 Feb, 2016 1 commit
- Fixed a bug in CPU neighbor list with triclinic boxes · a4f7077b
  peastman authored Feb 26, 2016
  
  a4f7077b
15 Jan, 2016 1 commit
- Loading a checkpoint marks that positions have been set · 2f5bb780
  Peter Eastman authored Jan 14, 2016
  
  2f5bb780
04 Nov, 2015 1 commit
- Finished implementing CompoundIntegrator · 5fa6fbc1
  Peter Eastman authored Nov 04, 2015
  
  5fa6fbc1
24 Sep, 2015 1 commit
- Optimizations to CPU platform · 02825c46
  peastman authored Sep 24, 2015
  
  02825c46
23 Sep, 2015 2 commits
- Fixed compilation error on Windows · 97f5e611
  Peter Eastman authored Sep 23, 2015
  
  97f5e611
- Continuing to refactor tests · 2f553a66
  Peter Eastman authored Sep 23, 2015
  
  2f553a66
22 Sep, 2015 2 commits
- Continuing to refactor tests · 4ed151e1
  Peter Eastman authored Sep 22, 2015
  
  4ed151e1
- Continuing to refactor tests · ccd811da
  Peter Eastman authored Sep 22, 2015
  
  ccd811da
21 Sep, 2015 1 commit
- Began refactoring of tests to eliminate duplicated code · a5a52dd1
  Peter Eastman authored Sep 21, 2015
  
  a5a52dd1
03 Sep, 2015 1 commit
- Added NonbondedForce::getPMEParametersInContext() · 4c6d48ba
  peastman authored Sep 03, 2015
  
  4c6d48ba
27 Aug, 2015 1 commit
- Python 2/3 compatibility in single code base, plus python 3 testing on travis. · b7088b74
  peastman authored Aug 10, 2015
  
  b7088b74
12 Aug, 2015 1 commit
- Show fewer warnings on Visual Studio · 7509c2bf
  Peter Eastman authored Aug 12, 2015
  
  7509c2bf
07 Jul, 2015 1 commit
- Created test case for force groups having different cutoffs · ab8d97b3
  Peter Eastman authored Jul 07, 2015
  
  ab8d97b3
24 Feb, 2015 1 commit
- Fixed problems running in PNaCl · aa7bd1cf
  peastman authored Feb 24, 2015
  
  aa7bd1cf
08 Jan, 2015 1 commit
- Adjusted the tolerance on a test that was failing on some computers · 38acdd6e
  Peter Eastman authored Jan 08, 2015
  
  38acdd6e
19 Dec, 2014 1 commit
- CPU PME works with triclinic boxes · f6aae604
  peastman authored Dec 18, 2014
  
  f6aae604
17 Dec, 2014 1 commit
- Continuing to implement triclinic boxes in CPU platform · 78e5f13a
  peastman authored Dec 16, 2014
  
  78e5f13a
16 Dec, 2014 1 commit
- CpuNeighborList correctly handles triclinic boxes · 4193a416
  peastman authored Dec 16, 2014
  
  4193a416
10 Dec, 2014 1 commit
- Began implementing triclinic boxes for CPU platform · 4b6f53e5
  peastman authored Dec 10, 2014
  
  4b6f53e5
26 Nov, 2014 1 commit
- Prevent running tests on unsupported CPUs · 59484a6f
  peastman authored Nov 25, 2014
  
  59484a6f
13 Oct, 2014 1 commit
- GBSAOBCForce allows the surface area energy to be changed · 34a604b8
  peastman authored Oct 13, 2014
  
  34a604b8
06 Oct, 2014 1 commit
- Began creating CPU version of CustomGBForce · 468a8a5c
  peastman authored Oct 06, 2014
  
  468a8a5c
04 Sep, 2014 1 commit
- Made test cases more robust · c7f629ac
  peastman authored Sep 04, 2014
  
  c7f629ac
29 Aug, 2014 1 commit
- CustomManyParticleForce offers two different "permutation modes". Implemented... · f5acdd7a
  peastman authored Aug 28, 2014
```
CustomManyParticleForce offers two different "permutation modes".  Implemented it for Reference and CPU platforms.
```
  f5acdd7a
22 Aug, 2014 1 commit
- Optimizations to CUDA version of CustomManyParticleForce · 150d943a
  peastman authored Aug 22, 2014
  
  150d943a
15 Aug, 2014 1 commit
- Finished CPU implementation of CustomManyParticleForce · 14d3c584
  peastman authored Aug 15, 2014
  
  14d3c584
14 Aug, 2014 1 commit
- CPU version of CustomManyParticleForce is multithreaded · 0e2ffb4b
  peastman authored Aug 14, 2014
  
  0e2ffb4b
25 Jul, 2014 1 commit
- CPU platform works on PNaCl · 0bb293f8
  peastman authored Jul 25, 2014
  
  0bb293f8