- 25 Apr, 2023 2 commits
- 24 Apr, 2023 2 commits
- 21 Apr, 2023 1 commit
-
-
ltqin authored
-
- 20 Apr, 2023 1 commit
-
-
ltqin authored
-
- 19 Apr, 2023 2 commits
- 18 Apr, 2023 2 commits
- 17 Apr, 2023 1 commit
-
-
ltqin authored
-
- 13 Apr, 2023 1 commit
-
-
ltqin authored
-
- 12 Apr, 2023 3 commits
-
-
ltqin authored
-
ltqin authored
Merge branch 'lib_gemm_softmax_gemm_type' of https://github.com/ROCmSoftwarePlatform/composable_kernel into lib_gemm_softmax_gemm_type
-
ltqin authored
-
- 10 Apr, 2023 2 commits
-
-
zjing14 authored
-
rocking5566 authored
* Rename to proper naming * Add example of groupnorm + swish * Extract duplicate code in example * Add groupnorm + swish instances * Ractor instance generation, split into multiple cpp file * Add external api and client example * Refine profiler message * Use ck math version of exp * Refine problem size in example * Add host version of exp
-
- 07 Apr, 2023 2 commits
-
-
ltqin authored
-
- 03 Apr, 2023 1 commit
-
-
ltqin authored
-
- 31 Mar, 2023 4 commits
-
-
ltqin authored
-
ltqin authored
Merge branch 'lib_gemm_softmax_gemm_type' of https://github.com/ROCmSoftwarePlatform/composable_kernel into lib_gemm_softmax_gemm_type
-
ltqin authored
-
zjing14 authored
-
- 30 Mar, 2023 3 commits
-
-
zjing14 authored
Co-authored-by:root <root@ctr-ubbsmc15.amd.com>
-
Haocong WANG authored
-
carlushuang authored
* simplify karg in device/grid split-k op * fix mk_kn_mn instances * add more instances * use name from tensor layout
-
- 29 Mar, 2023 3 commits
-
-
Rostyslav Geyyer authored
* Add type_convert implementations for bf16 * Add the fix for conv_fwd * Add the fix for conv_bwd_data * Add the fix for conv_bwd_weight * Format * Format * Another format * Add a macro to use workaround on MI200 only * Format --------- Co-authored-by:
Rosty Geyyer <rosty.geyyer@amd.com> Co-authored-by:
zjing14 <zhangjing14@gmail.com>
-
rocking5566 authored
* Rename file. Prepare to support another activation * Add comment for quantization * Extract out_elementop * Add tanh example * Add conv + bias + tanh quantization instance * Add missing parameter * Refine cmake * Add external api and client example * Extract variable in example * Fix the comment --------- Co-authored-by:zjing14 <zhangjing14@gmail.com>
-
Haocong WANG authored
* Add CMake Option "USE_OPT_NAVI3X" * remove navi3x opt compile option from cmake script
-
- 27 Mar, 2023 2 commits
- 24 Mar, 2023 3 commits
- 23 Mar, 2023 3 commits
-
-
Haocong WANG authored
* Add CMake Option "USE_OPT_NAVI3X" * fix bug
-
ltqin authored
-
ltqin authored
-
- 22 Mar, 2023 2 commits
-
-
Po Yen Chen authored
-
Illia Silin authored
* remove XDL parameters from WMMA kernel string * get rid f two more parameters
-