Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Commits
b90cccf7
"vscode:/vscode.git/clone" did not exist on "ecf5d8d7684fedea0d8de0bd12ad07f2def7b24a"
Commit
b90cccf7
authored
Jan 06, 2021
by
Chao Liu
Browse files
update threadwise copy (for static tensor descriptor) for updated buffer_load oob
parent
f8a869a6
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
5 deletions
+0
-5
composable_kernel/include/tensor_operation/threadwise_generic_tensor_slice_copy.hpp
...tensor_operation/threadwise_generic_tensor_slice_copy.hpp
+0
-5
No files found.
composable_kernel/include/tensor_operation/threadwise_generic_tensor_slice_copy.hpp
View file @
b90cccf7
...
...
@@ -93,11 +93,6 @@ struct ThreadwiseGenericTensorSliceCopy_v4r2
// buffer to hold a src long-vector
SrcData
p_src_long_vector
[
long_vector_size
];
#if 1
// zero out buffer
static_for
<
0
,
long_vector_size
,
1
>
{}([
&
](
auto
i
)
{
p_src_long_vector
[
i
]
=
0
;
});
#endif
// load data from src to the long-vector buffer
static_for
<
0
,
long_vector_size
/
src_data_per_access
,
1
>
{}([
&
](
auto
i
)
{
auto
scalar_id
=
make_zero_multi_index
<
nDim
>
();
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment