Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
tsoc
openmm
Commits
16f10eff
"csrc/gfx93/decode/sparse_fp8/splitkv_mla.cuh" did not exist on "620f8769a112358b4fb4be170901d55687d26388"
Commit
16f10eff
authored
Apr 10, 2026
by
one
Browse files
tune computeNonbonded launch params
parent
9c6732f0
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
5 deletions
+4
-5
platforms/hip/src/HipNonbondedUtilities.cpp
platforms/hip/src/HipNonbondedUtilities.cpp
+4
-5
No files found.
platforms/hip/src/HipNonbondedUtilities.cpp
View file @
16f10eff
...
@@ -65,16 +65,15 @@ HipNonbondedUtilities::HipNonbondedUtilities(HipContext& context) : context(cont
...
@@ -65,16 +65,15 @@ HipNonbondedUtilities::HipNonbondedUtilities(HipContext& context) : context(cont
string
errorMessage
=
"Error initializing nonbonded utilities"
;
string
errorMessage
=
"Error initializing nonbonded utilities"
;
CHECK_RESULT
(
hipEventCreateWithFlags
(
&
downloadCountEvent
,
context
.
getEventFlags
()));
CHECK_RESULT
(
hipEventCreateWithFlags
(
&
downloadCountEvent
,
context
.
getEventFlags
()));
CHECK_RESULT
(
hipHostMalloc
((
void
**
)
&
pinnedCountBuffer
,
2
*
sizeof
(
unsigned
int
),
context
.
getHostMallocFlags
()));
CHECK_RESULT
(
hipHostMalloc
((
void
**
)
&
pinnedCountBuffer
,
2
*
sizeof
(
unsigned
int
),
context
.
getHostMallocFlags
()));
numForceThreadBlocks
=
5
*
4
*
context
.
getMultiprocessors
();
numForceThreadBlocks
=
16
*
4
*
context
.
getMultiprocessors
();
forceThreadBlockSize
=
6
4
;
forceThreadBlockSize
=
25
6
;
findInteractingBlocksThreadBlockSize
=
context
.
getSIMDWidth
()
;
findInteractingBlocksThreadBlockSize
=
128
;
// When building the neighbor list, we can optionally use large blocks (32 * warpSize atoms) to
// When building the neighbor list, we can optionally use large blocks (32 * warpSize atoms) to
// accelerate the process. This makes building the neighbor list faster, but it prevents
// accelerate the process. This makes building the neighbor list faster, but it prevents
// us from sorting atom blocks by size, which leads to a slightly less efficient neighbor
// us from sorting atom blocks by size, which leads to a slightly less efficient neighbor
// list. We guess based on system size which will be faster.
// list. We guess based on system size which will be faster.
useLargeBlocks
=
false
;
useLargeBlocks
=
(
context
.
getNumAtoms
()
>
90000
);
setKernelSource
(
HipKernelSources
::
nonbonded
);
setKernelSource
(
HipKernelSources
::
nonbonded
);
}
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment