Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Commits
83f75313
Commit
83f75313
authored
May 18, 2022
by
rocking
Browse files
Let it be the multiple number of CU
parent
b7a82d29
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
include/ck/tensor_operation/gpu/device/device_binary_elementwise.hpp
...tensor_operation/gpu/device/device_binary_elementwise.hpp
+1
-1
No files found.
include/ck/tensor_operation/gpu/device/device_binary_elementwise.hpp
View file @
83f75313
...
...
@@ -86,7 +86,7 @@ struct DeviceBinaryElementwise : public BaseOperator
p_c_
(
p_c
),
functor_
(
functor
),
threadPerBlock_
(
threadPerBlock
),
gridSize_
(
12
8
)
// FIXME - Calculate the grid size by number of CU in the future
gridSize_
(
12
0
)
// FIXME - Calculate the grid size by number of CU in the future
{
a_grid_desc_m0_
=
MakeDescriptor_M0
(
shape
,
stride_a
,
gridSize_
,
threadPerBlock_
);
b_grid_desc_m0_
=
MakeDescriptor_M0
(
shape
,
stride_b
,
gridSize_
,
threadPerBlock_
);
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment