- 25 Jul, 2023 8 commits
-
-
ltqin authored
-
ltqin authored
-
ltqin authored
-
ltqin authored
-
danyao12 authored
-
danyao12 authored
-
danyao12 authored
-
ltqin authored
* first change bias load * add bias dim and scalervector parameter * make CDE0BlockTransferSrcVectorDim not work * changse toinstance * add limit for CDE0BlockTransferSrcScalarPerVector
-
- 24 Jul, 2023 1 commit
-
-
ltqin authored
-
- 21 Jul, 2023 3 commits
-
-
Illia Silin authored
-
Illia Silin authored
-
Bartłomiej Kocot authored
-
- 20 Jul, 2023 1 commit
-
-
danyao12 authored
-
- 18 Jul, 2023 4 commits
-
-
Bartłomiej Kocot authored
* Grouped 3d conv backward data support * Fix comments
-
Rostyslav Geyyer authored
-
danyao12 authored
-
Illia Silin authored
* allow building CK for specific data types * add CI build and test stage on Naiv3x without some int8 instances * add missing gemm fp16 instances * add the changes to the missed cmake file * add empty lines at end of source files * Do not build quantization client example on navi3 in CI * disable batched_gemm_multi_d_int8 instances with DTYPES * disable device_conv2d_bwd_data_instance with DTYPES * fix ckprofiler for conv_bwd_data for int8 * properly isolate the conv_bwd_data int8 instances * remove empty line
-
- 17 Jul, 2023 3 commits
-
-
Illia Silin authored
* check if gpu_targets are supported by compiler * set default list of targets and filter for them
-
danyao12 authored
-
danyao12 authored
-
- 15 Jul, 2023 8 commits
-
-
danyao12 authored
-
danyao12 authored
Merge branch 'attn-train-develop-qloop' of https://github.com/ROCmSoftwarePlatform/composable_kernel into attn-train-develop-qloop
-
danyao12 authored
-
danyao12 authored
-
danyao12 authored
-
Dan Yao authored
Skip dropout
-
danyao12 authored
-
arvindcheru authored
* Disable Werror to ignore xnack+ warnings
-
- 14 Jul, 2023 8 commits
- 13 Jul, 2023 2 commits
- 12 Jul, 2023 2 commits
-
-
Bartłomiej Kocot authored
* Support NHWGC conv2d_bwd_weight * Fix client example * Fix client example * Fix comments * Redesign grouped_conv_bwd_weight instances * Clang format fix --------- Co-authored-by:zjing14 <zhangjing14@gmail.com>
-
danyao12 authored
-