Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel_ROCM
Commits
41b94703
Commit
41b94703
authored
Nov 20, 2024
by
Andriy Roshchenko
Browse files
Fix GPU verification reporting logic.
parent
25c6d97b
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
16 additions
and
12 deletions
+16
-12
example/01_gemm/run_gemm_example.inc
example/01_gemm/run_gemm_example.inc
+8
-6
example/35_splitK_gemm/run_splitK_gemm_example.inc
example/35_splitK_gemm/run_splitK_gemm_example.inc
+8
-6
No files found.
example/01_gemm/run_gemm_example.inc
View file @
41b94703
...
@@ -416,13 +416,15 @@ bool run_gemm(const ProblemType& problem_size, const ExecutionConfig& config)
...
@@ -416,13 +416,15 @@ bool run_gemm(const ProblemType& problem_size, const ExecutionConfig& config)
c_m_n_device_ref_buf
.
FromDevice
(
c_m_n_device_ref_result
.
mData
.
data
());
c_m_n_device_ref_buf
.
FromDevice
(
c_m_n_device_ref_result
.
mData
.
data
());
c_m_n_device_buf
.
FromDevice
(
c_m_n_device_result
.
mData
.
data
());
c_m_n_device_buf
.
FromDevice
(
c_m_n_device_result
.
mData
.
data
());
pass
=
ck
::
utils
::
check_err
(
c_m_n_device_result
,
auto
gpu_
pass
=
ck
::
utils
::
check_err
(
c_m_n_device_result
,
c_m_n_device_ref_result
,
c_m_n_device_ref_result
,
"Error: Incorrect results!"
,
"Error: Incorrect results!"
,
get_rtol
<
CDataType
>
(),
get_rtol
<
CDataType
>
(),
get_atol
<
CDataType
>
());
get_atol
<
CDataType
>
());
if
(
pass
)
if
(
gpu_
pass
)
std
::
cout
<<
"Verification on GPU: PASS"
<<
std
::
endl
;
std
::
cout
<<
"Verification on GPU: PASS"
<<
std
::
endl
;
pass
=
pass
&&
gpu_pass
;
}
}
return
pass
;
return
pass
;
...
...
example/35_splitK_gemm/run_splitK_gemm_example.inc
View file @
41b94703
...
@@ -308,13 +308,15 @@ bool run_splitK_gemm(const ProblemSize& problem_size, const ExecutionConfig& con
...
@@ -308,13 +308,15 @@ bool run_splitK_gemm(const ProblemSize& problem_size, const ExecutionConfig& con
c_m_n_device_ref_buf
.
FromDevice
(
c_m_n_device_ref_result
.
mData
.
data
());
c_m_n_device_ref_buf
.
FromDevice
(
c_m_n_device_ref_result
.
mData
.
data
());
c_m_n_device_buf
.
FromDevice
(
c_m_n_device_result
.
mData
.
data
());
c_m_n_device_buf
.
FromDevice
(
c_m_n_device_result
.
mData
.
data
());
pass
=
ck
::
utils
::
check_err
(
c_m_n_device_result
,
auto
gpu_
pass
=
ck
::
utils
::
check_err
(
c_m_n_device_result
,
c_m_n_device_ref_result
,
c_m_n_device_ref_result
,
"Error: Incorrect results!"
,
"Error: Incorrect results!"
,
get_rtol
<
CDataType
>
(),
get_rtol
<
CDataType
>
(),
get_atol
<
CDataType
>
());
get_atol
<
CDataType
>
());
if
(
pass
)
if
(
gpu_
pass
)
std
::
cout
<<
"Verification on GPU: PASS"
<<
std
::
endl
;
std
::
cout
<<
"Verification on GPU: PASS"
<<
std
::
endl
;
pass
&=
gpu_pass
;
}
}
if
(
config
.
time_kernel
)
if
(
config
.
time_kernel
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment