Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Commits
119cb7b1
"examples/vscode:/vscode.git/clone" did not exist on "75124fc91ebb6007b364e37ab3589e66d3566355"
Commit
119cb7b1
authored
Dec 21, 2022
by
rocking
Browse files
Add comment to explan in detail
parent
19c6cb14
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
0 deletions
+2
-0
include/ck/tensor_operation/gpu/device/impl/device_gemm_multiple_d_layernorm_xdl_cshuffle.hpp
...ce/impl/device_gemm_multiple_d_layernorm_xdl_cshuffle.hpp
+2
-0
No files found.
include/ck/tensor_operation/gpu/device/impl/device_gemm_multiple_d_layernorm_xdl_cshuffle.hpp
View file @
119cb7b1
...
@@ -349,6 +349,8 @@ struct DeviceGemmMultipleDLayernorm_Xdl_CShuffle
...
@@ -349,6 +349,8 @@ struct DeviceGemmMultipleDLayernorm_Xdl_CShuffle
template
<
typename
DoPads
,
index_t
MPerTile
,
index_t
NPerTile
>
template
<
typename
DoPads
,
index_t
MPerTile
,
index_t
NPerTile
>
static
auto
MakeCountDescriptor_M_N
(
index_t
M
,
index_t
N
)
static
auto
MakeCountDescriptor_M_N
(
index_t
M
,
index_t
N
)
{
{
// We will broadcast [N] to [M, N] in this descriptor
// Hence, 1st stride is 0
const
auto
grid_desc_m_n
=
const
auto
grid_desc_m_n
=
make_naive_tensor_descriptor
(
make_tuple
(
M
,
N
),
make_tuple
(
I0
,
I1
));
make_naive_tensor_descriptor
(
make_tuple
(
M
,
N
),
make_tuple
(
I0
,
I1
));
return
PadTensorDescriptor
(
grid_desc_m_n
,
make_tuple
(
MPerTile
,
NPerTile
),
DoPads
{});
return
PadTensorDescriptor
(
grid_desc_m_n
,
make_tuple
(
MPerTile
,
NPerTile
),
DoPads
{});
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment