- 06 May, 2026 2 commits
-
-
one authored
-
one authored
- Use bitwise prefix accounting when storing sparse interactions as single pairs in the HIP pair-list kernel. This reduces the number of ballot operations needed to compute per-lane single-pair offsets. - For HIP CDNA single precision, raise MAX_BITS_FOR_PAIRS to 8 so more sparse interactions are emitted as single pairs instead of full tiles. Keep the existing double precision and RDNA thresholds unchanged. - Also simplify the HIP LJPME direct correction by computing alpha^2*r2
-
- 29 Apr, 2026 1 commit
-
-
one authored
-
- 24 Apr, 2026 1 commit
-
-
one authored
-
- 17 Apr, 2026 2 commits
- 16 Apr, 2026 4 commits
- 10 Apr, 2026 1 commit
-
-
Peter Eastman authored
* Created ReplicaExchangeSampler * Improvements to ReplicaExchangeSampler
-
- 07 Apr, 2026 1 commit
-
-
Peter Eastman authored
-
- 06 Apr, 2026 2 commits
-
-
Peter Eastman authored
* PythonForce can be restricted to a subset of particles * Fix exception with CUDA
-
Peter Eastman authored
-
- 02 Apr, 2026 2 commits
-
-
Peter Eastman authored
-
Peter Eastman authored
* Infrastructure for multistate sampling * Added computeRelativeEnergies()
-
- 31 Mar, 2026 1 commit
-
-
ramdoys authored
* optimization, switch to unordered map * Remove emplace, reduce reserve allocation
-
- 30 Mar, 2026 1 commit
-
-
Peter Eastman authored
-
- 27 Mar, 2026 2 commits
-
-
Evan Pretti authored
-
Peter Eastman authored
* Cache coefficients for long range correction * updateParametersInContext() clears cache
-
- 26 Mar, 2026 1 commit
-
-
Jeff Wagner authored
* fix 5234 and add test * clean up docstring and standardize test name
-
- 12 Mar, 2026 1 commit
-
-
Peter Eastman authored
-
- 05 Mar, 2026 1 commit
-
-
Peter Eastman authored
-
- 26 Feb, 2026 1 commit
-
-
Peter Eastman authored
-
- 24 Feb, 2026 1 commit
-
-
Peter Eastman authored
-
- 19 Feb, 2026 1 commit
-
-
Peter Eastman authored
* Fixed issue that caused inefficient sorting when a block contained only one atom * Add the fix to OpenCL and HIP
-
- 17 Feb, 2026 1 commit
-
-
Peter Eastman authored
* Added two more synonyms for HOH * Change H20 to H2O
-
- 16 Feb, 2026 1 commit
-
-
Yulian Manchev authored
* Update patch documentation Clarified the definition and purpose of patches. * Fix typo in RemoveExternalBond tag description * Update wording in patches * A few edits to the description of patches --------- Co-authored-by:Peter Eastman <peter.eastman@gmail.com>
-
- 11 Feb, 2026 1 commit
-
-
Peter Eastman authored
-
- 10 Feb, 2026 3 commits
-
-
Peter Eastman authored
-
Peter Eastman authored
-
Evan Pretti authored
* Make reference/CPU minimizer into a kernel * Add per-platform support for GPU minimization * Initial implementation of GPU minimization * Fixes * Increase robustness when initial gradient is huge * Handle overflow leading to non-finite values gracefully * Handle large forces in single precision more robustly * Optimize kernels * Fix kernel launch size * Update banner years * Don't create MinimizeKernel until first minimization requested * Make some compile-time constants into kernel arguments * Consolidate scale calculation kernel * Condense alpha/beta reduction kernels using atomics * Condense line search dot kernels with reductions * Remove a download, and download grad norm separately * Asynchronously check lbfgs convergence condition * Restructure line search to avoid download waiting * Start line search preemptively in case CPU evaluation is not needed * In rare cases, constraint error might not decrease after one optimization round * Better handling of unsupported 64-bit atomics, use FLT_MAX * Pick gradient mode based on GPU vs. CPU evaluation * Rework getDiff/getScale reduction, remove reduceBuffer * Older CUDA might not like float hex literals * Fix error in a comment
-
- 09 Feb, 2026 2 commits
-
-
Peter Eastman authored
* Residue templates can specify constraints * Patched template generation preserves constraints
-
Peter Eastman authored
* API for querying devices * CUDA and HIP implementations of getDevices() * Fix test failures * Fix test failures * CUDA returns correct devices even if no context has been created * Return a single device for Reference and CPU * Fix CI failure
-
- 30 Jan, 2026 1 commit
-
-
Peter Eastman authored
* Templates can match whole molecules * addExtraParticles() supports molecule templates * Documentation on molecule templates * Bug fix
-
- 14 Jan, 2026 1 commit
-
-
Peter Eastman authored
-
- 08 Jan, 2026 1 commit
-
-
Peter Eastman authored
-
- 30 Dec, 2025 1 commit
-
-
Christian Clauss authored
-
- 14 Dec, 2025 1 commit
-
-
Anton Gorenko authored
* Remove std::enable_if, warpRotateLeft is always used with TILE_SIZE * Do not use built-in warpSize in constexpr contexts Starting from ROCm 7 warpSize is no longer constexpr. findInteractingBlocks.hip uses it for sizes of __shared__ arrays. * Check if hipHostMallocNumaUser is allowed before using it
-
- 13 Dec, 2025 1 commit
-
-
Anton Gorenko authored
adaptFrictionPart1 is executed first, it reads randomForce and segmentVelocity buffers before other kernels write them so they can contain garbage after allocation (cuMemAlloc/hipMalloc/clCreateBuffer do not clear allocated memory).
-