Commit cc8aee3b authored by Po-Yen, Chen's avatar Po-Yen, Chen
Browse files

Use faster device op config for example '1xHxW_fp16'

parent c1f605b9
...@@ -12,7 +12,7 @@ using DevicePermuteInstance = ck::tensor_operation::device::DevicePermute ...@@ -12,7 +12,7 @@ using DevicePermuteInstance = ck::tensor_operation::device::DevicePermute
// ######| Type| Type| Operation| | Size| Block| Block| Block| LdsExtraW| ThreadClusterLengths| ThreadClusterArrangeOrder| VectorDim| VectorDim| ScalarPerVector| ScalarPerVector| // ######| Type| Type| Operation| | Size| Block| Block| Block| LdsExtraW| ThreadClusterLengths| ThreadClusterArrangeOrder| VectorDim| VectorDim| ScalarPerVector| ScalarPerVector|
// ######| | | | | | | | | | | | | | | | // ######| | | | | | | | | | | | | | | |
// ######| | | | | | | | | | | | | | | | // ######| | | | | | | | | | | | | | | |
< ADataType, BDataType, PassThrough, 3, 256, 16, 32, 32, 0, S<1, 16, 16>, S<0, 1, 2>, 2, 1, 1, 1>; < ADataType, BDataType, PassThrough, 3, 256, 1, 32, 32, 3, S<1, 32, 8>, S<0, 1, 2>, 2, 1, 2, 1>;
// clang-format on // clang-format on
#include "run_permute_example.inc" #include "run_permute_example.inc"
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment