Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
MIGraphX
Commits
94bba2c1
"src/targets/vscode:/vscode.git/clone" did not exist on "0f95c57dc3fc377b5c8901452ebc51e2c8ee68d7"
Commit
94bba2c1
authored
Oct 18, 2023
by
Paul
Browse files
Fix implicit conversion that is causing perf degration
parent
1fd73bac
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
src/targets/gpu/gemm_impl.cpp
src/targets/gpu/gemm_impl.cpp
+3
-3
No files found.
src/targets/gpu/gemm_impl.cpp
View file @
94bba2c1
...
...
@@ -123,13 +123,13 @@ auto rocblas_invoke(F f, Pack p, Ts... xs)
static
bool
is_transposed
(
const
shape
&
s
)
{
return
s
.
transposed
()
and
s
.
strides
().
back
()
!=
1
;
}
static
rocblas_int
get_batch_stride
(
const
argument
&
a
)
static
rocblas_int
get_batch_stride
(
const
shape
&
s
)
{
// This value is not needed for non-strided inputs
if
(
a
.
get_shape
()
.
strides
().
size
()
<
3
)
if
(
s
.
strides
().
size
()
<
3
)
return
0
;
else
return
a
.
get_shape
().
strides
()[
a
.
get_shape
()
.
strides
().
size
()
-
3
];
return
s
.
strides
()[
s
.
strides
().
size
()
-
3
];
}
/**
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment