Unverified Commit 600fc000 authored by zjing14's avatar zjing14 Committed by GitHub
Browse files

add more instances for bfp16 gemm (#1036)



* add more instances for bfp16

* reduce the gemm input values to prevent round-off errors

---------
Co-authored-by: default avatarJing Zhang <jizha@amd.com>
Co-authored-by: default avatarillsilin <Illia.Silin@amd.com>
parent 49e52bb3
......@@ -76,7 +76,7 @@ int profile_gemm_impl(int do_verification,
break;
default:
a_m_k.GenerateTensorValue(GeneratorTensor_3<ADataType>{0.0, 0.1});
b_k_n.GenerateTensorValue(GeneratorTensor_3<BDataType>{-0.05, 0.05});
b_k_n.GenerateTensorValue(GeneratorTensor_3<BDataType>{-0.01, 0.01});
}
using AElementOp = ck::tensor_operation::element_wise::PassThrough;
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment