Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel_ROCM
Commits
e8c19535
Commit
e8c19535
authored
Oct 25, 2024
by
aska-0096
Browse files
update preprocess number for mi308; bring back printout in ckprofiler
parent
47294b4b
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
7 additions
and
4 deletions
+7
-4
include/ck/host_utility/flush_cache.hpp
include/ck/host_utility/flush_cache.hpp
+4
-1
profiler/include/profiler/profile_gemm_multiply_multiply_impl.hpp
.../include/profiler/profile_gemm_multiply_multiply_impl.hpp
+3
-3
No files found.
include/ck/host_utility/flush_cache.hpp
View file @
e8c19535
...
...
@@ -351,7 +351,10 @@ float launch_and_time_kernel_with_preprocess(const StreamConfig& stream_config,
}
#else
// return total_time / nrepeat;
return
(
total_time
-
0.01
*
nrepeat
)
/
nrepeat
;
hipDeviceProp_t
deviceProps
;
hip_check_error
(
hipGetDeviceProperties
(
&
deviceProps
,
0
));
float
preprocess_offset
=
deviceProps
.
multiProcessorCount
==
80
?
0.005
:
0.01
;
return
(
total_time
-
preprocess_offset
*
nrepeat
)
/
nrepeat
;
#endif
}
else
...
...
profiler/include/profiler/profile_gemm_multiply_multiply_impl.hpp
View file @
e8c19535
...
...
@@ -267,9 +267,9 @@ bool profile_gemm_multiply_multiply_impl(int do_verification,
float
gb_per_sec
=
num_btype
/
1.E6
/
ave_time
;
//
std::cout << "Perf: " << std::setw(10) << ave_time << " ms, " << tflops
//
<< " TFlops, " << gb_per_sec << " GB/s, " << op_name << ", KBatch "
//
<< kbatch_curr << std::endl;
std
::
cout
<<
"Perf: "
<<
std
::
setw
(
10
)
<<
ave_time
<<
" ms, "
<<
tflops
<<
" TFlops, "
<<
gb_per_sec
<<
" GB/s, "
<<
op_name
<<
", KBatch "
<<
kbatch_curr
<<
std
::
endl
;
#if defined CK_ENABLE_FP8 || defined CK_ENABLE_INT8
// set softer tolerances for fp8
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment