1. 31 Oct, 2024 1 commit
  2. 23 Oct, 2024 1 commit
  3. 16 Oct, 2024 3 commits
  4. 09 Oct, 2024 1 commit
  5. 04 Oct, 2024 1 commit
  6. 02 Oct, 2024 2 commits
    • Marc Schuh's avatar
      Increase describeNextReport readability and allow requesting more values (#4671) · 693756ba
      Marc Schuh authored
      * added type checking for Simulation.step()
      
      * changed how to check if step is an integer number
      
      * allow for dicts to be returned from Reporter.describeNextReport
      remove deprecated getState parameters ( #4437 )
      
      * convert old format into new format
      
      * update docstring
      
      * nested set comprehension to set.union
      
      * Allow 'periodic':None
      update describeNextReport in all occurrences in the code
      
      * debug
      
      * update documentation
      
      * add a reporter for energyParameterDerivative
      
      * Revert "add a reporter for energyParameterDerivative"
      
      This reverts commit 1d44dc3f60153defb6252ab56a3b85350fa24826.
      
      * Edit documentation
      693756ba
    • Peter Eastman's avatar
      e370c346
  7. 01 Oct, 2024 2 commits
  8. 27 Sep, 2024 1 commit
  9. 24 Sep, 2024 1 commit
  10. 23 Sep, 2024 1 commit
    • Anton Gorenko's avatar
      Optimize PME spread charge kernel (#4633) · 8ea42950
      Anton Gorenko authored
      * PME_ORDER threads process one atom;
      * PME_ORDER threads access consecutive addresses;
      * No need to permute z indices with zindexTable;
      * finishSpreadCharge is needed only with fixed point charge spreading;
      8ea42950
  11. 18 Sep, 2024 1 commit
  12. 13 Sep, 2024 1 commit
  13. 10 Sep, 2024 3 commits
  14. 06 Sep, 2024 1 commit
    • Peter Eastman's avatar
      Optimize updateParametersInContext() (#4610) · 78902bed
      Peter Eastman authored
      * Optimize CustomNonbondedForce.updateParametersInContext()
      
      * Optimized uploading changed values to GPU
      
      * Optimized updateParametersInContext() for lots of bonded forces
      
      * Optimized updateParametersInContext() for CustomExternalForce
      
      * Optimized updateParametersInContext() for NonbondedForce
      
      * Code changes for HIP platform
      78902bed
  15. 05 Sep, 2024 16 commits
  16. 04 Sep, 2024 2 commits
  17. 02 Sep, 2024 1 commit
  18. 01 Sep, 2024 1 commit
    • Anton Gorenko's avatar
      Optimize sorting kernels and tune block sizes · 7279c539
      Anton Gorenko authored
      * Compile kernels with max block size of 256 threads:
        The default hipcc behavior since ROCm 4.2 is to compile kernels
        with 1024 threads unless __launch_bounds__ is specified. This
        significantly increases register pressure especially in heavy kernels
        (double precision, for example), requiring register spilling;
      * Optimize computeRange by using multiple blocks for reduction;
      * Use blocks of 1024 threads for computeBucketPositions - it is executed
        as a single work group so larger block size is faster;
      * Sort up-to lenghtNextPow2 instead of blockDim.x (faster for short
        buckets);
      * Optimize sortShortList2;
      * Optimize sortBuckets with bit instructions;
      * Decrease bucket size for non-uniform sorting: too many buckets may
        have sizes too large to sort in shared memory;
      * Add more sizes in tests.
      7279c539