- 16 May, 2024 1 commit
-
-
letaoqin authored
-
- 25 Apr, 2024 1 commit
-
-
ltqin authored
* add flush cache to device op * add flush cache parameter to ckProfiler * change calculate size a and b method * chang evaluation time method foro AVERAGE to MEDIAN * format code * adjust some code * fix core dumped * remove loop call flush icache in kernel * remove loop(outer) call flush icache --------- Co-authored-by:letaoqin <letaoqin@amd.com>
-
- 19 Jan, 2024 1 commit
-
-
Haocong WANG authored
* Optimize GEMM on MI200/300: 1. Add new blockwise gemm pipeline 2. Add irregular splitk intances * clang format + typo fix * Fix a bug
-
- 28 Nov, 2023 1 commit
-
-
zjing14 authored
-
- 17 Nov, 2023 1 commit
-
-
zjing14 authored
* improve 4k gemm perf * add f8 instances * format --------- Co-authored-by:Jing Zhang <jizha@amd.com>
-
- 31 May, 2023 1 commit
-
-
Illia Silin authored
-
- 16 Sep, 2022 1 commit
-
-
Chao Liu authored
-
- 25 Jun, 2022 1 commit
-
-
Chao Liu authored
-
- 12 May, 2022 1 commit
-
-
JD authored
* Add host API * manually rebase on develop * clean * manually rebase on develop * exclude tests from all target * address review comments * update client app name * fix missing lib name * clang-format update * refactor * refactor * refactor * refactor * refactor * fix test issue * refactor * refactor * refactor * upate cmake and readme Co-authored-by:Chao Liu <chao.liu2@amd.com>
-