"csrc/sm90/helpers.h" did not exist on "c28eca99dbc664dd2716415ed03492afe5fefade"
- 22 Jul, 2022 1 commit
-
-
Adel Johar authored
* Support kernel files with extensions of any length (like .hip) * Do not allow to replace symbols in single-line comments * Add OPENMM_BUILD_COMMON CMake option It allows to build and install common platform files even if CUDA or OpenCL platforms are not built. This is required for HIP platform (openmm-hip) if ROCm OpenCL packages are not installed. * Add an option for Python wrapper to install into user packages OPENMM_PYTHON_USER_INSTALL is OFF be default. * Support FFT backends in Amoeba plugin The HIP platform supports FFT backends, this commit moves findLegalFFTDimension to ComputeContext, so platforms can have their own implementations. * Compatibility for common platform w/ new HIP platform * Do not use volatile with private and local AtomData parameters on HIP The generated code is not optimal, for example, the compiler generates flat_load instructions instead of ds_read. * Tune launch bounds for PME grid-related kernels and add WA for RDNA Force the compiler to use all registers for gridSpreadCharge and gridInterpolateForce by limiting max waves per EU to 1 on CDNA GPUs, RDNA GPUs work better without it. * Optimize atom data structs in GBSA and Amoeba on HIP Manually rearrange fields, add paddings and force alignments to have faster accesses to shared memory: ds_read and ds_write may work slower if addresses are not aligned by 16 bytes. Co-authored-by:
Anton Gorenko <anton@streamhpc.com> Co-authored-by:
Nick Curtis <nicholas.curtis@amd.com>
-
- 07 Mar, 2022 1 commit
-
-
Anton Gorenko authored
It allows to use a faster float-to-int64 in the HIP platform.
-
- 22 May, 2021 1 commit
-
-
Peter Eastman authored
* Began converting AMOEBA to common platform * Beginning of OpenCL platform for AMOEBA * Converted AmoebaVdwForce to common platform * Cleaned up reference AMOEBA tests * Began converting AmoebaMultipoleForce to common platform * Continue converting AmoebaMultipoleForce to common platform * Bug fixes * Bug fix * Continue converting AmoebaMultipoleForce to common platform * Converting AmoebaMultipoleForce and AmoebaGeneralizedKirkwoodForce to common platform * Converting AmoebaMultipoleForce and AmoebaGeneralizedKirkwoodForce to common platform * Creating OpenCL version of AmoebaMultipoleForce and AmoebaGeneralizedKirkwoodForce * Creating OpenCL version of AmoebaMultipoleForce and AmoebaGeneralizedKirkwoodForce * Creating OpenCL version of AmoebaMultipoleForce and AmoebaGeneralizedKirkwoodForce * Converted arrays from real3 to real * Bug fix to OpenCL AmoebaGeneralizedKirkwoodForce * Fixes for AMD GPUs * Began converting HippoNonbondedForce to common platform * Continuing to convert HippoNonbondedForce to common platform * Continuing to convert HippoNonbondedForce to common platform * Working on unifying PME kernels * Fixed error on devices without 64 bit atomics * Unified PME kernels * Converted HippoNonbondedForce to common platform * Creating OpenCL implementation of HippoNonbondedForce * Continuing OpenCL implementation of HippoNonbondedForce * Mostly finished OpenCL implementation of HippoNonbondedForce * Eliminated three component vector types in host code * Fix errors on CPU OpenCL * Skip double precision tests for AMOEBA on OpenCL * Bug fixes * Bug fixes * Fixed compilation error
-
- 13 Jul, 2018 1 commit
-
-
Peter Eastman authored
-
- 29 May, 2018 1 commit
-
-
peastman authored
-
- 06 Feb, 2018 1 commit
-
-
peastman authored
-
- 02 Feb, 2018 1 commit
-
-
Peter Eastman authored
-
- 23 Feb, 2017 1 commit
-
-
peastman authored
-
- 08 Feb, 2017 1 commit
-
-
Peter Eastman authored
-
- 13 May, 2016 1 commit
-
-
Peter Eastman authored
-
- 11 May, 2016 1 commit
-
-
Peter Eastman authored
-
- 11 Dec, 2015 1 commit
-
-
peastman authored
-
- 05 Nov, 2015 1 commit
-
-
Peter Eastman authored
-
- 02 Oct, 2015 1 commit
-
-
Peter Eastman authored
-
- 15 Apr, 2015 1 commit
-
-
Peter Eastman authored
-
- 03 Apr, 2015 3 commits
-
-
Peter Eastman authored
-
peastman authored
-
Peter Eastman authored
-
- 14 Jan, 2015 1 commit
-
-
Peter Eastman authored
-
- 08 Jan, 2015 1 commit
-
-
Peter Eastman authored
-
- 27 Jun, 2013 1 commit
-
-
peastman authored
-
- 22 Mar, 2013 1 commit
-
-
Peter Eastman authored
-
- 14 Dec, 2012 1 commit
-
-
Peter Eastman authored
When converting to fixed point, multiply by 0x100000000 instead of 0xFFFFFFFF. This should be (very very slightly) more accurate, since its reciprocal can be exactly represented in floating point.
-
- 17 Oct, 2012 1 commit
-
-
Peter Eastman authored
-
- 23 May, 2012 1 commit
-
-
Peter Eastman authored
-
- 13 Dec, 2011 1 commit
-
-
Peter Eastman authored
-
- 07 Oct, 2011 1 commit
-
-
Peter Eastman authored
-
- 09 Sep, 2011 1 commit
-
-
Peter Eastman authored
-
- 08 Sep, 2011 1 commit
-
-
Peter Eastman authored
-
- 05 Jul, 2011 1 commit
-
-
Peter Eastman authored
-
- 27 May, 2011 1 commit
-
-
Peter Eastman authored
-
- 03 Sep, 2010 1 commit
-
-
Peter Eastman authored
-
- 20 Jul, 2010 1 commit
-
-
Peter Eastman authored
-
- 10 Jun, 2010 1 commit
-
-
Peter Eastman authored
-
- 09 Jun, 2010 1 commit
-
-
Peter Eastman authored
-
- 07 May, 2010 1 commit
-
-
Peter Eastman authored
-
- 26 Apr, 2010 1 commit
-
-
Peter Eastman authored
-
- 07 Apr, 2010 1 commit
-
-
Peter Eastman authored
-
- 04 Mar, 2010 1 commit
-
-
Peter Eastman authored
-
- 26 Jan, 2010 1 commit
-
-
Peter Eastman authored
-