- 02 Aug, 2024 1 commit
-
-
Sam Wu authored
-
- 19 Jul, 2024 1 commit
-
-
Makarand Maydeo authored
fix ASAN target list
-
- 18 Jul, 2024 2 commits
-
-
pramenku authored
removed GFX12 target
-
Illia Silin authored
-
- 27 Jun, 2024 7 commits
-
-
carlushuang authored
* WA for rocm-6.2+ s constrait for buffer resource * add missing memory clobber
-
Bartłomiej Kocot authored
-
Bartłomiej Kocot authored
* Fix continous dim selection in contraction * Fixes
-
Bartłomiej Kocot authored
* Support large tensors in grouped conv fwd * Multi ABD fixes * Fix calculate element space size
-
zjing14 authored
-
Bartłomiej Kocot authored
* Integrate universal gemm with conv fwd * Fix conv fwd wmma test * Fix instances * Remove direct load check
-
zjing14 authored
-
- 05 Jun, 2024 1 commit
-
-
Jun Liu authored
-
- 22 May, 2024 3 commits
-
-
Jun Liu authored
-
Bartłomiej Kocot authored
* Optimize grouped conv bwd weight for small M and N * Fixes
-
Illia Silin authored
* set individual gpu targets for instances, examples, tests * fix path to hip compiler * fix path to hip compiler once more * aggregate device macros in ck_tile config header * fix the cmake logic for instances * fix clang format * add gfx900 and gfx906 to default set of targets
-
- 21 May, 2024 1 commit
-
-
Rostyslav Geyyer authored
* Move grouped conv fwd client examples * Update existing examples * Format
-
- 20 May, 2024 1 commit
-
-
Illia Silin authored
-
- 17 May, 2024 4 commits
-
-
Jun Liu authored
-
Illia Silin authored
-
dependabot[bot] authored
Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core) from 1.1.1 to 1.1.2. - [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v1.1.1...v1.1.2 ) --- updated-dependencies: - dependency-name: rocm-docs-core dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
rocking authored
error: no viable conversion from returned value of type '__half' to function return type 'fp16_hip_t' (aka '_Float16') Co-authored-by:carlushuang <carlus.huang@amd.com>
-
- 15 May, 2024 4 commits
-
-
Illia Silin authored
-
carlushuang authored
-
jakpiase authored
* add unit tests for grouped gemm two stage * add reviewers suggestions --------- Co-authored-by:Adam Osewski <19374865+aosewski@users.noreply.github.com>
-
Jun Liu authored
-
- 11 May, 2024 1 commit
-
-
Illia Silin authored
-
- 10 May, 2024 3 commits
-
-
Illia Silin authored
* code clean-up * remove the profiling output samples
-
carlushuang authored
* add random norm * normalized default to 0/3 * change squant->auto
-
Bartłomiej Kocot authored
-
- 09 May, 2024 2 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
-
- 08 May, 2024 2 commits
-
-
Illia Silin authored
-
Bartłomiej Kocot authored
-
- 07 May, 2024 3 commits
-
-
Illia Silin authored
* enable logging using environment variable * update ck.hpp header * fix typo * fix clang format * Update include/ck/utility/env.hpp Co-authored-by:
Bartłomiej Kocot <barkocot@amd.com> --------- Co-authored-by:
Bartłomiej Kocot <barkocot@amd.com>
-
Jun Liu authored
-
carlushuang authored
* add alibi support * fix code * update code based on comment * Support more hdim * fix fp8 bias * support seqlen_k=0 case * remove unused printf * fix format --------- Co-authored-by:rocking <ChunYu.Lai@amd.com>
-
- 06 May, 2024 1 commit
-
-
Sam Wu authored
Also add component owners as codeowners for header directory
-
- 02 May, 2024 1 commit
-
-
Illia Silin authored
-
- 01 May, 2024 2 commits
-
-
Illia Silin authored
-
Illia Silin authored
-