Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
Faiss
Commits
a49479f4
Commit
a49479f4
authored
Sep 07, 2022
by
Your Name
Browse files
增加warp +=情况下的显式sync
parent
847585df
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
.test_gpu.sh.swp
.test_gpu.sh.swp
+0
-0
faiss/gpu/impl/PQScanMultiPassNoPrecomputed-inl.cuh
faiss/gpu/impl/PQScanMultiPassNoPrecomputed-inl.cuh
+2
-2
No files found.
.test_gpu.sh.swp
0 → 100644
View file @
a49479f4
File added
faiss/gpu/impl/PQScanMultiPassNoPrecomputed-inl.cuh
View file @
a49479f4
...
@@ -248,13 +248,13 @@ __global__ void pqScanNoPrecomputedMultiPass(
...
@@ -248,13 +248,13 @@ __global__ void pqScanNoPrecomputedMultiPass(
#pragma unroll
#pragma unroll
for
(
int
byte
=
0
;
byte
<
kBytesPerCode32
;
++
byte
)
{
for
(
int
byte
=
0
;
byte
<
kBytesPerCode32
;
++
byte
)
{
auto
code
=
getByte
(
code32
[
word
],
byte
*
8
,
8
);
auto
code
=
getByte
(
code32
[
word
],
byte
*
8
,
8
);
// HC Debug
__syncthreads
();
auto
offset
=
codesPerSubQuantizer
*
auto
offset
=
codesPerSubQuantizer
*
(
word
*
kBytesPerCode32
+
byte
);
(
word
*
kBytesPerCode32
+
byte
);
dist
+=
ConvertTo
<
float
>::
to
(
codeDist
[
offset
+
code
]);
dist
+=
ConvertTo
<
float
>::
to
(
codeDist
[
offset
+
code
]);
// HC Debug explicit sync
__syncthreads
();
}
}
}
}
}
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment