- 05 Sep, 2024 1 commit
-
-
Anton Gorenko authored
* All AMD GPUs support shuffle, double precision and 64-bit int atomics; * Remove unused code: !ENABLE_SHUFFLE code paths in nonbonded.hip; * Use intrinsics in single-precision; * Use realToFixedPoint (faster float32-to-int64); * Remove shared atomIndices, use shuffles; * Check early if atoms are in the cutoff range, sometimes all lanes in a warp can skip computations, single pairs can also skip useless atomics with zero values; * Remove volatile skipTiles access, use shuffles; * Distribute work for warps in a strided order; * Skip warps that may be still busy in the first loop; * Unify conditions for excluded atoms with `includeInteraction`; * Move multiprocessors to HipContext; * Increase number of warps for computeNonbonded; * Disable packed math for >=MI200 (it affects performance of some kernels like computeGKForces of amoebagk); * Remove defaultOptimizationOptions and createModule's optimizationFlags as they are never used; * Support -save-temps.
-
- 01 Sep, 2024 4 commits
-
-
Anton Gorenko authored
* Compile kernels with max block size of 256 threads: The default hipcc behavior since ROCm 4.2 is to compile kernels with 1024 threads unless __launch_bounds__ is specified. This significantly increases register pressure especially in heavy kernels (double precision, for example), requiring register spilling; * Optimize computeRange by using multiple blocks for reduction; * Use blocks of 1024 threads for computeBucketPositions - it is executed as a single work group so larger block size is faster; * Sort up-to lenghtNextPow2 instead of blockDim.x (faster for short buckets); * Optimize sortShortList2; * Optimize sortBuckets with bit instructions; * Decrease bucket size for non-uniform sorting: too many buckets may have sizes too large to sort in shared memory; * Add more sizes in tests.
-
Anton Gorenko authored
* Remove setting of link libraries, include and link dirs and compile flags for each target, instead let Cmake deal with them by linking the main library to hip::host hiprtc::hiprtc hip::hipfft; * Fix: custom command without ADD_CUSTOM_TARGET and ADD_DEPENDENCIES is executed for both static and shared targets; * Remove IF(APPLE) parts.
-
Anton Gorenko authored
Fix SegFault in HipCalcHippoNonbondedForceKernel HipSort was created using a temporary ref. Adding `HipContext& cu` field to HipCalcHippoNonbondedForceKernel fixes the issue; -
Anton Gorenko authored
Port changes in CUDA backend to HIP Fix a warning about arithmetic operations on void* in HipArray::uploadSubArray Fix "Error Initializing context ROCm 5.3.0" https://github.com/StreamHPC/openmm-hip/issues/3 hipDeviceSetCacheConfig returns hipErrorNotSupported on 5.3 Co-authored-by:Nick Curtis <nicholas.curtis@amd.com>
-
- 23 Aug, 2024 2 commits
-
-
Peter Eastman authored
-
Peter Eastman authored
* Support numpy 2 * Debugging * Removed debugging code
-
- 19 Aug, 2024 1 commit
-
-
Peter Eastman authored
-
- 06 Aug, 2024 1 commit
-
-
Peter Eastman authored
* Don't require importlib_metadata * Handle older versions of importlib
-
- 25 Jul, 2024 1 commit
-
-
Timothy Palpant authored
* Fix import of netcdf_file for scipy 1.14 * Fix indentation --------- Co-authored-by:Timothy Palpant <tim@atommapper.com>
-
- 19 Jul, 2024 1 commit
-
-
Peter Eastman authored
* Run Mac tests on ARM and Intel * Added missing environment file * Removed obsolete code for M1 runner * Removed obsolete code for M1 runner
-
- 17 Jul, 2024 1 commit
-
-
Peter Eastman authored
* Debug CI failures * Debugging * Debugging * Debugging * Debugging * Debugging * Debugging * Debugging * Debugging * Removed build that was failing * Fixed URL that had changed
-
- 09 Jul, 2024 1 commit
-
-
Peter Eastman authored
* addHydrogens() allows specifying exactly what hydrogens to add * Prevent CI from using numpy 2.0
-
- 13 May, 2024 1 commit
-
-
Peter Eastman authored
-
- 03 May, 2024 1 commit
-
-
Matt Thompson authored
* Allow multiple registrations of the same atom type if definitions identical * Different short-circuiting logic
-
- 29 Apr, 2024 3 commits
-
-
Peter Eastman authored
-
Peter Eastman authored
* DrudeForce supports periodic boundary conditions * Fixed uninitialized memory
-
FloLangenfeld authored
Co-authored-by:FloLangenfeld <florent.langenfeld@peptinov.fr>
-
- 10 Apr, 2024 1 commit
-
-
Marc Schuh authored
* added type checking for Simulation.step() * changed how to check if step is an integer number
-
- 09 Apr, 2024 1 commit
-
-
Peter Eastman authored
-
- 06 Apr, 2024 2 commits
-
-
Peter Eastman authored
* ATMForce reorders inner contexts for better performance * Fixed obsolete comments
-
Peter Eastman authored
-
- 05 Apr, 2024 1 commit
-
-
Peter Eastman authored
-
- 28 Mar, 2024 2 commits
-
-
Peter Eastman authored
-
Raul authored
* Avoid overflow in large XTC files * Also cast box indices to size_t
-
- 21 Mar, 2024 1 commit
-
-
Peter Eastman authored
* Created DebuggingReporter class * Fixed description
-
- 18 Mar, 2024 1 commit
-
-
Matt Thompson authored
* Update references to "openmmforcefields" project * Org name, too
-
- 09 Mar, 2024 1 commit
-
-
Peter Eastman authored
* Revised code from installing CUDA on CI * Debugging * Debugging * Debugging * Debugging * Debugging * Debugging * Debugging
-
- 08 Mar, 2024 2 commits
-
-
ZangRuochen authored
LoongArch is a new architecture, already supported by linux-6.1, gcc-12. Signed-off-by:
Zang Ruochen <zangruochen@loongson.cn> Co-authored-by:
Zang Ruochen <zangruochen@loongson.cn>
-
Peter Eastman authored
-
- 05 Mar, 2024 2 commits
-
-
Stefan Doerr authored
-
Miguel Dias Costa authored
-
- 04 Mar, 2024 1 commit
-
-
Peter Eastman authored
-
- 24 Feb, 2024 1 commit
-
-
Peter Eastman authored
* Minor optimization to validating exclusions * Optimizations to findMoleculeGroups()
-
- 23 Feb, 2024 1 commit
-
-
Peter Eastman authored
* Improved performance of CustomHbondForce on large systems * Fixed CUDA compilation errors
-
- 17 Feb, 2024 1 commit
-
-
Peter Eastman authored
* Made LangevinIntegrator identical to LangevinMiddleIntegrator * Removed unused code * VariableLangevinIntegrator uses LFMiddle
-
- 13 Feb, 2024 1 commit
-
-
Peter Eastman authored
* Can use getPlatform() instead of getPlatformByName() * More concise arguments for getState()
-
- 02 Feb, 2024 1 commit
-
-
Peter Eastman authored
* Reference platform supports nested virtual sites * Common platform supports nested virtual sites * Fixed force distribution from nested virtual sites * Fixed test failures
-
- 26 Jan, 2024 1 commit
-
-
Emilio Gallicchio authored
* Improve numerical stability for edge cases * Undo checking of derivative factors * Undo checking for NaNs
-
- 24 Jan, 2024 1 commit
-
-
Daniel R. Roe authored
-