Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
tsoc
openmm
Commits
bf7efca6
Commit
bf7efca6
authored
Jul 24, 2015
by
peastman
Browse files
Merge pull request #1051 from peastman/ffterror
Fixed potential compilation error in CUDA FFT
parents
5b074f80
3fd5e2c5
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
platforms/cuda/src/CudaFFT3D.cpp
platforms/cuda/src/CudaFFT3D.cpp
+1
-1
No files found.
platforms/cuda/src/CudaFFT3D.cpp
View file @
bf7efca6
...
@@ -169,7 +169,7 @@ static int getSmallestRadix(int size) {
...
@@ -169,7 +169,7 @@ static int getSmallestRadix(int size) {
}
}
CUfunction
CudaFFT3D
::
createKernel
(
int
xsize
,
int
ysize
,
int
zsize
,
int
&
threads
,
int
axis
,
bool
forward
,
bool
inputIsReal
)
{
CUfunction
CudaFFT3D
::
createKernel
(
int
xsize
,
int
ysize
,
int
zsize
,
int
&
threads
,
int
axis
,
bool
forward
,
bool
inputIsReal
)
{
int
maxThreads
=
256
;
int
maxThreads
=
(
context
.
getUseDoublePrecision
()
?
128
:
256
)
;
// while (maxThreads > 128 && maxThreads-64 >= zsize)
// while (maxThreads > 128 && maxThreads-64 >= zsize)
// maxThreads -= 64;
// maxThreads -= 64;
int
threadsPerBlock
=
zsize
/
getSmallestRadix
(
zsize
);
int
threadsPerBlock
=
zsize
/
getSmallestRadix
(
zsize
);
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment