Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Commits
702c7445
Commit
702c7445
authored
Sep 07, 2022
by
Po-Yen, Chen
Browse files
Add debug code the verify result
parent
0aef936b
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
12 additions
and
0 deletions
+12
-0
include/ck/tensor_operation/gpu/device/device_permute.hpp
include/ck/tensor_operation/gpu/device/device_permute.hpp
+12
-0
No files found.
include/ck/tensor_operation/gpu/device/device_permute.hpp
View file @
702c7445
...
@@ -142,7 +142,19 @@ struct DevicePermute : detail::DevicePermuteBase<DevicePermute<InDataType,
...
@@ -142,7 +142,19 @@ struct DevicePermute : detail::DevicePermuteBase<DevicePermute<InDataType,
Sequence
<
NumDim
-
1
>
{}),
Sequence
<
NumDim
-
1
>
{}),
make_tuple
(
Sequence
<
0
>
{},
Sequence
<
1
>
{},
Sequence
<
2
>
{}));
make_tuple
(
Sequence
<
0
>
{},
Sequence
<
1
>
{},
Sequence
<
2
>
{}));
#if 0
const index_t N = std::accumulate(begin(lengths), std::prev(end(lengths), 2), index_t{1}, std::multiplies<index_t>{});
const auto desc_m = transform_tensor_descriptor(
desc_n_h_w,
make_tuple(make_merge_transform(make_tuple(N, H, W))),
make_tuple(Sequence<0, 1, 2>{}),
make_tuple(Sequence<0>{})
);
return PadDescriptor_M_1d(desc_m, gridSize, blockSize);
#else
return
PadDescriptor_M_1d
(
desc_n_h_w
,
gridSize
,
blockSize
);
return
PadDescriptor_M_1d
(
desc_n_h_w
,
gridSize
,
blockSize
);
#endif
}
}
using
InGrid1dDesc
=
decltype
(
MakeDescriptor_N_H_W
({
1
,
1
},
{
1
,
1
},
1
,
1
));
using
InGrid1dDesc
=
decltype
(
MakeDescriptor_N_H_W
({
1
,
1
},
{
1
,
1
},
1
,
1
));
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment