Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
composable_kernel
Commits
1d4f5453
Commit
1d4f5453
authored
Aug 31, 2021
by
ltqin
Browse files
program once issue
parent
0df62d59
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
2 deletions
+4
-2
host/driver_offline/include/driver_gemm_xdlops_v2r4.hpp
host/driver_offline/include/driver_gemm_xdlops_v2r4.hpp
+4
-2
No files found.
host/driver_offline/include/driver_gemm_xdlops_v2r4.hpp
View file @
1d4f5453
#pragma once
#ifndef DRIVER_GEMM_XDLOPS_V2R4
#define DRIVER_GEMM_XDLOPS_V2R4
#include "common_header.hpp"
#include "tensor_descriptor.hpp"
...
...
@@ -139,7 +140,7 @@ __host__ float driver_gemm_xdlops_v2r4(const FloatAB* p_a_grid,
const
index_t
grid_size
=
GridwiseGemm
::
CalculateGridSize
(
c_m_n_grid_desc
);
const
auto
kernel
=
kernel_gemm_xdlops_v2r
3
<
GridwiseGemm
,
const
auto
kernel
=
kernel_gemm_xdlops_v2r
4
<
GridwiseGemm
,
FloatAB
,
FloatC
,
remove_reference_t
<
AK0MK1GridDesc
>
,
...
...
@@ -189,3 +190,4 @@ __host__ float driver_gemm_xdlops_v2r4(const FloatAB* p_a_grid,
#endif
return
ave_time
;
}
#endif
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment