- 24 Apr, 2023 9 commits
-
-
root authored
Merge branch 'aosewski/ggemm_splitk' of https://github.com/ROCmSoftwarePlatform/composable_kernel into aosewski/ggemm_splitk
-
Jing Zhang authored
-
Adam Osewski authored
-
Adam Osewski authored
-
zjing14 authored
-
Jing Zhang authored
-
rocking authored
* [What] Remove pure conv int8 instance [Why] We will never use pure int8 conv in AI, use int8 quantization instead * Change layout * Share the kernel parameter * Support more type of NHWGC for group conv * Revise client example of conv 2d, use NHWGC layout * Add instance to cmake * Revise layout of group conv quantization instance * Revise layout of external api of group conv quantization * Revise layout of group conv quantization client example * Fix clang format * Add comment to describe meaning of each parameter
-
Jing Zhang authored
-
Jing Zhang authored
-
- 23 Apr, 2023 1 commit
-
-
Jing Zhang authored
-
- 22 Apr, 2023 3 commits
-
-
Jing Zhang authored
-
Jing Zhang authored
-
Illia Silin authored
* simplify karg in device/grid split-k op * fix mk_kn_mn instances * add more instances * use name from tensor layout --------- Co-authored-by:carlushuang <carlus.huang@amd.com>
-
- 21 Apr, 2023 2 commits
-
-
Illia Silin authored
* switch to the new rocm5.6 compiler and docker * fix syntax
-
Sam Wu authored
Co-authored-by:samjwu <samjwu@users.noreply.github.com>
-
- 20 Apr, 2023 1 commit
-
-
zjing14 authored
-
- 18 Apr, 2023 1 commit
-
-
Illia Silin authored
* enable use of rocm5.5 release candidate 4 * upgrade to ROCM5.5 RC5 * try fix the PUB_KEY error, remove the cmake-data package * upgrade to latest cmake version * use private dockerhub repo for rocm5.5 rc5 * add missing bracket
-
- 17 Apr, 2023 2 commits
-
-
zjing14 authored
-
rocking5566 authored
-
- 16 Apr, 2023 2 commits
-
-
Haocong WANG authored
-
Rostyslav Geyyer authored
Co-authored-by:Rosty Geyyer <rosty.geyyer@amd.com>
-
- 11 Apr, 2023 5 commits
-
-
Haocong WANG authored
-
-
Sam Wu authored
-
zjing14 authored
Co-authored-by:root <root@ctr-ubbsmc15.amd.com>
-
zjing14 authored
* add a marco to turn off denorm fix by default * expose the marco --------- Co-authored-by:root <root@ctr-ubbsmc15.amd.com>
-
- 10 Apr, 2023 2 commits
-
-
zjing14 authored
-
rocking5566 authored
* Rename to proper naming * Add example of groupnorm + swish * Extract duplicate code in example * Add groupnorm + swish instances * Ractor instance generation, split into multiple cpp file * Add external api and client example * Refine profiler message * Use ck math version of exp * Refine problem size in example * Add host version of exp
-
- 07 Apr, 2023 3 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
-
- 05 Apr, 2023 6 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
- 04 Apr, 2023 2 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
-
- 03 Apr, 2023 1 commit
-
-
Adam Osewski authored
-