- 06 Mar, 2024 1 commit
-
-
Paul Fultz II authored
* Format * Format * Format * Remove const * Use the right template * Format * Format * add row/col instances * Add missing file * fixed * Format * Updates * Format * fixed rrr layout * Format * Update test and embed modules * Restore older version * Update year * Set -fPIC * Format * Use double for isnan * rename host folder to codegen + minor fix * add codegen CI test * add option to build components without building CK * fix the groovy syntax * fix typo * use the correct function for the codegen stage --------- Co-authored-by:
Jing Zhang <jizha@amd.com> Co-authored-by:
Illia Silin <98187287+illsilin@users.noreply.github.com> Co-authored-by:
illsilin <Illia.Silin@amd.com>
-
- 05 Mar, 2024 1 commit
-
-
Illia Silin authored
-
- 13 Feb, 2024 1 commit
-
-
Illia Silin authored
-
- 05 Feb, 2024 1 commit
-
-
Illia Silin authored
* delete dangling docker images * fix groovy syntax * fix groovy syntax again * try a different way to delete dangling images
-
- 30 Jan, 2024 2 commits
-
-
Illia Silin authored
-
Illia Silin authored
-
- 26 Jan, 2024 1 commit
-
-
Illia Silin authored
-
- 24 Jan, 2024 1 commit
-
-
Illia Silin authored
* fix cppcheck errors, first pass * fix format * fix returned value in examples * add macro definitions for cppcheck * fix the profile_gemm logic * update the gemm profiler logic * add more difinitions to cppcheck, fix couple more errors * replace runtime error with message in device function * fix a couple of int4 issues * no return for fill function * fix errors in data_types.hpp * fix format * fix few remaining errors * fix errors in data_types.hpp * fix last couple of errors in datat_types.hpp
-
- 15 Jan, 2024 1 commit
-
-
Illia Silin authored
* add cppcheck to the CK CI * fix the path to CK source for cppcheck * fix the path to CK source for cppcheck one more time * fix the path to CK source for cppcheck third time * change the path to ck_cppcheck.log * install latest cppcheck from source * fix bug in ck.hpp and use 20 threads for cppcheck * create a switch to turn cppckeck on and off in CI
-
- 05 Jan, 2024 1 commit
-
-
Illia Silin authored
* add docker for rocm6.0.1 rc1 * modify the path to clang for test compilers in CI * fix the hipcc/clang path for test compilers in CI * fix the dockerfile for older rocm versions
-
- 16 Dec, 2023 1 commit
-
-
Illia Silin authored
* upgrade to rocm6.0 compiler * move rocm6.0 from private to public repo * switch to testing hipTensor mainline in CI
-
- 07 Dec, 2023 1 commit
-
-
Illia Silin authored
* switch from ROCmSoftwarePlatform to ROCm org * replace ROCmSoftwarePlatform with ROCm in few more places
-
- 06 Dec, 2023 1 commit
-
-
Illia Silin authored
* turn on -O3 compiler flag explicitly * change cmake syntax for CI * modify cmake line breaks in jenkinsfile
-
- 05 Dec, 2023 1 commit
-
-
Illia Silin authored
* add daily build with mainline compiler * fix the compiler paths for ci * remove the -flto flag * build with clang by default
-
- 30 Nov, 2023 1 commit
-
-
Jun Liu authored
-
- 09 Nov, 2023 1 commit
-
-
Illia Silin authored
-
- 03 Nov, 2023 1 commit
-
-
Illia Silin authored
-
- 01 Nov, 2023 1 commit
-
-
Illia Silin authored
-
- 30 Oct, 2023 1 commit
-
-
Illia Silin authored
* replace ccache with sccache, pin package versions * put ccache back temporarily to avoid breaking other CI jobs * add sccashe_wrapper.sh script * fix the package version syntax * fix the pymysql package issue * run sccache_wrapper before build if ccache server found * set the paths before calling the sccache_wrapper * use /tmp instead of /usr/local for cache * try using sccache --start-server instead of wrapper * try using redis server with sccache * define SCCACHE_REDIS * add redis and ping packages, and redis port * use the new sccache redis server * do not use sccache with staging compiler * fix the condition syntax * add stunnel to redis * add tunnel verification * separate caches for different architectures * fix syntax for the cache tag * quse double brackets for conditions * add bash line to the script * add a switch for sccache and only use it in build stage * run check_host function when enabling sccache * fix the invocation tags for sccache * fix groovy syntax * set the invocation tag in groovy * disable sccache in clang-format stage * try another syntax for invocation tags * use local sccache server if can't connect to redis * fix script syntax * update README * refresh readme * readme updates * remove the timing and verification caveat from readme --------- Co-authored-by:Lisa Delaney <lisa.delaney@amd.com>
-
- 19 Oct, 2023 1 commit
-
-
Illia Silin authored
* apply the patch for dl kernels on gfx11 * build DL kernels on navi32 CI
-
- 16 Oct, 2023 1 commit
-
-
Illia Silin authored
* add a hipTensor test to CI * use jenkins git plugin * change hipTensor folder location in CI * change the git method for hipTensor * run tests usign ctest * check the hipTensor contents * only build hipTensor on MI100/200 * pull hipTensor as zip archive * fix jenkins syntax * add path to the CK installation * combine build commands into one shell * change jenkins syntax for CK installer path * try different syntax * allow unzip overwrite * fix jenkins file syntax * remove any old versions of hipTensor before building * add option to select hipTensor branch for testing
-
- 29 Sep, 2023 1 commit
-
-
Illia Silin authored
-
- 19 Sep, 2023 1 commit
-
-
Illia Silin authored
* update to rocm5.7 by default * fix jenkinsfile syntax
-
- 13 Sep, 2023 1 commit
-
-
Illia Silin authored
* enable building DL kernels with the daily staging compiler * move the DL_KERNELS flag to another function
-
- 12 Sep, 2023 1 commit
-
-
Illia Silin authored
-
- 23 Aug, 2023 1 commit
-
-
Jun Liu authored
* experiment with config file * experiment with version.h config * add more info to version.h * minor updates * minor updates * fix case where DTYPE is not used * large amount of files but minor changes * remove white space * minor changes to add more MACROs * fix cmakedefine01 * fix issue with CK internal conflict * fix define and define value * fix clang-format * fix formatting issue * experiment with cmake * clang format v12 to be consistent with miopen * avoid clang-format for config file
-
- 10 Aug, 2023 1 commit
-
-
Illia Silin authored
* add docker for rocm5.7 RC1 * fix rocm5.7 rc1 build * build QA with rocm5.7 rc1 compiler
-
- 09 Aug, 2023 1 commit
-
-
Illia Silin authored
-
- 26 Jul, 2023 1 commit
-
-
Illia Silin authored
-
- 18 Jul, 2023 1 commit
-
-
Illia Silin authored
* allow building CK for specific data types * add CI build and test stage on Naiv3x without some int8 instances * add missing gemm fp16 instances * add the changes to the missed cmake file * add empty lines at end of source files * Do not build quantization client example on navi3 in CI * disable batched_gemm_multi_d_int8 instances with DTYPES * disable device_conv2d_bwd_data_instance with DTYPES * fix ckprofiler for conv_bwd_data for int8 * properly isolate the conv_bwd_data int8 instances * remove empty line
-
- 07 Jul, 2023 1 commit
-
-
Illia Silin authored
-
- 30 Jun, 2023 1 commit
-
-
Illia Silin authored
* upgrade default compiler to rocm5.6 release * do daily runs with rocm5.6 instead of 5.5
-
- 28 Jun, 2023 1 commit
-
-
Illia Silin authored
* upgrade to rocm5.6 rc4 * add rocm5.7 docker
-
- 21 Jun, 2023 1 commit
-
-
Illia Silin authored
-
- 16 Jun, 2023 1 commit
-
-
Illia Silin authored
-
- 15 Jun, 2023 1 commit
-
-
Illia Silin authored
* enable gfx941/942 targets * fix clang format * fix the cmake logic for multiple targets * fix cmake syntax for looping over targets * add gfx941/942 support for gemm_xdl instances
-
- 23 May, 2023 1 commit
-
-
Illia Silin authored
* enable dl kernels on navi3 * do not build xdl tests and examples on Navi * run tests before building everything on jenkins * disable gemm_bilinear on gfx1030 * add gpu targets to installer on Navi * put tests in the same order as before * reduce the number of navi targets in CI * build CI installed for gfx940 as well * only build for MI300 during QA runs
-
- 03 May, 2023 1 commit
-
-
Illia Silin authored
-
- 21 Apr, 2023 1 commit
-
-
Illia Silin authored
* switch to the new rocm5.6 compiler and docker * fix syntax
-
- 19 Apr, 2023 1 commit
-
-
Haocong WANG authored
* [Navi3x] Fix Gridwise_multiple_d operation (#649) * Add CMake Option "USE_OPT_NAVI3X" * fix bug * standardize docs (#655) * Separate bibtex requirement from rocm-docs-core (#656) * separate bibtex requirement from rocm-docs-core * point requirements to source rocm-docs-core repo * Add CMake Option "USE_OPT_NAVI3X" (#647) * Add CMake Option "USE_OPT_NAVI3X" * remove navi3x opt compile option from cmake script * Conv + quantization + tanh (#645) * Rename file. Prepare to support another activation * Add comment for quantization * Extract out_elementop * Add tanh example * Add conv + bias + tanh quantization instance * Add missing parameter * Refine cmake * Add external api and client example * Extract variable in example * Fix the comment --------- Co-authored-by:
zjing14 <zhangjing14@gmail.com> * Add a denorm test fix (#603) * Add type_convert implementations for bf16 * Add the fix for conv_fwd * Add the fix for conv_bwd_data * Add the fix for conv_bwd_weight * Format * Format * Another format * Add a macro to use workaround on MI200 only * Format --------- Co-authored-by:
Rosty Geyyer <rosty.geyyer@amd.com> Co-authored-by:
zjing14 <zhangjing14@gmail.com> * simplify karg in device/grid of split-k op (#644) * simplify karg in device/grid split-k op * fix mk_kn_mn instances * add more instances * use name from tensor layout * fix 3rd dword of buffer source descriptor (#659) * add fp64 instances (#658) Co-authored-by:
root <root@ctr-ubbsmc15.amd.com> * Issue #666: Revert "simplify karg in device/grid of split-k op (#644)" (#665) This reverts commit bb5530af . * Groupnorm + swish external api (#668) * Rename to proper naming * Add example of groupnorm + swish * Extract duplicate code in example * Add groupnorm + swish instances * Ractor instance generation, split into multiple cpp file * Add external api and client example * Refine profiler message * Use ck math version of exp * Refine problem size in example * Add host version of exp * add a marco to turn on/off denorm fix (off by default) (#673) * add a marco to turn off denorm fix by default * expose the marco --------- Co-authored-by:
root <root@ctr-ubbsmc15.amd.com> * fixed quant example (#672) Co-authored-by:
root <root@ctr-ubbsmc15.amd.com> * Add dependabot config and pin rocm-docs-core (#663) * [gtest] suppress unsafe buffer warn (#670) ref: https://github.com/ROCmSoftwarePlatform/MIOpen/pull/1912 * Add memory index guard in wmma device ops (#667) * Add more macros to turn on/off denorm fix (#678) Co-authored-by:
Rosty Geyyer <rosty.geyyer@amd.com> * Fix a typo (#676) * Add (#677) * Allow using ROCm release candidate compilers. (#679) * enable use of rocm5.5 release candidate 4 * upgrade to ROCM5.5 RC5 * try fix the PUB_KEY error, remove the cmake-data package * upgrade to latest cmake version * use private dockerhub repo for rocm5.5 rc5 * add missing bracket * add vector load check * solve conflicts --------- Co-authored-by:
Sam Wu <sjwu@ualberta.ca> Co-authored-by:
Sam Wu <sam.wu2@amd.com> Co-authored-by:
rocking5566 <ChunYu.Lai@amd.com> Co-authored-by:
zjing14 <zhangjing14@gmail.com> Co-authored-by:
Rostyslav Geyyer <46627076+geyyer@users.noreply.github.com> Co-authored-by:
Rosty Geyyer <rosty.geyyer@amd.com> Co-authored-by:
carlushuang <carlus.huang@amd.com> Co-authored-by:
root <root@ctr-ubbsmc15.amd.com> Co-authored-by:
Jun Liu <Liu.Jun@amd.com> Co-authored-by:
Illia Silin <98187287+illsilin@users.noreply.github.com>
-