- 19 Jul, 2023 2 commits
- 18 Jul, 2023 2 commits
- 05 Jul, 2023 1 commit
-
-
Paul authored
-
- 23 Jun, 2023 3 commits
- 22 Jun, 2023 1 commit
-
-
Paul authored
-
- 16 Jun, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 15 Jun, 2023 3 commits
-
-
Illia Silin authored
* enable gfx941/942 targets * fix clang format * fix the cmake logic for multiple targets * fix cmake syntax for looping over targets * add gfx941/942 support for gemm_xdl instances
-
zjing14 authored
* Changed wei layout * changed layout for examples * fixed client example --------- Co-authored-by:root <root@ctr-ubbsmc15.amd.com>
-
Qianfeng authored
* Add getAvailableComputeUnitCount() interface * Use available number of compute units to set kernel grid size
-
- 14 Jun, 2023 2 commits
-
-
Illia Silin authored
* fix CI builds with latest staging compiler * remove mount flags from dockerfile
-
Rostyslav Geyyer authored
* Add generic instance gemm_add_add_fastgelu * Add a client example for generic gemm_add_add_fastgelu * Update CMakeLists * Format * Format * Add generic instance gemm_add_fastgelu * Format * Add a gemm_add_fastgelu client example * Format * Add generic instance gemm_fastgelu * Format * Fix argument order * Add gemm_fastgelu client example * Add exceptions if argument is not supported
-
- 12 Jun, 2023 4 commits
-
-
Rostyslav Geyyer authored
-
Bartłomiej Kocot authored
* Add DeviceBatchedGemmMultipleD_Dl * Fix batched_gemm tests * Fix comments * test_batched_gemm_multi_d fixes * Fix args for isSupported batchedGemmMultipleDDl * Disable tests for gfx90a
-
Po Yen Chen authored
* Fix wrong pointer type * Rename type trait get_unsigned_int<> to get_carrier<> * Add 3-bytes carrier type * Add missing __device__ specifier * Rename template non-type parameter * Leave the rest byte uninitialized * Avoid invoking (host) STL algorithms * Remove unnecessary 'inline' specifier * Extract common logic out as helper method * Hide dummy member function * Add missing __device__ specifier
-
ltqin authored
* add check input parameter * add instance for vector load = 1 * move gerneral instance to first pos * fix read bias code * regular code for bias load --------- Co-authored-by:zjing14 <zhangjing14@gmail.com>
-
- 09 Jun, 2023 1 commit
-
-
Alan Turner authored
-
- 08 Jun, 2023 1 commit
-
-
carlushuang authored
-
- 07 Jun, 2023 10 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Illia Silin authored
* update dockerfile to build rocm5.6 rc3 * fix couple of docker issues
-
- 06 Jun, 2023 5 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
Merge branch 'migx-jit-lib' of https://github.com/ROCmSoftwarePlatform/composable_kernel into migx-jit-lib
-
Alan Turner authored
-
- 02 Jun, 2023 3 commits
-
-
Illia Silin authored
-
Paul authored
-
Paul authored
-