"...composable_kernel_rocm.git" did not exist on "9e33fe70c34de4816928a0d8bdf2458fe411a589"
Commit 32850b93 authored by Wen-Heng (Jack) Chung's avatar Wen-Heng (Jack) Chung
Browse files

Ported xdlops kernels to debug bwdwrw fp32/fp16/bfp16 issue. Verified atleast fwd data fp32 works.

parent 583755a7
#ifndef CK_COMMON_HEADER_HPP #ifndef CK_COMMON_HEADER_HPP
#define CK_COMMON_HEADER_HPP #define CK_COMMON_HEADER_HPP
#define MIOPEN_USE_FP16 1 #define MIOPEN_USE_FP16 0
#define MIOPEN_USE_BFP16 0 #define MIOPEN_USE_BFP16 0
#define MIOPEN_USE_FP32 0 #define MIOPEN_USE_FP32 1
#define __HIP_PLATFORM_HCC__ 1 #define __HIP_PLATFORM_HCC__ 1
......
This diff is collapsed.
This diff is collapsed.
...@@ -32,6 +32,7 @@ constexpr auto get_convolution_output_default_4d_tensor_descriptor(InDesc, WeiDe ...@@ -32,6 +32,7 @@ constexpr auto get_convolution_output_default_4d_tensor_descriptor(InDesc, WeiDe
constexpr auto HO = HI + 1 - Y; constexpr auto HO = HI + 1 - Y;
constexpr auto WO = WI + 1 - X; constexpr auto WO = WI + 1 - X;
printf("H0=%d, W0=%d\n", HO, WO);
return make_ConstantTensorDescriptor_packed(Sequence<N, K, HO, WO>{}); return make_ConstantTensorDescriptor_packed(Sequence<N, K, HO, WO>{});
} }
......
This diff is collapsed.
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment