- 16 Jun, 2023 4 commits
- 15 Jun, 2023 12 commits
-
-
rocking authored
-
rocking authored
-
rocking authored
-
Illia Silin authored
* enable gfx941/942 targets * fix clang format * fix the cmake logic for multiple targets * fix cmake syntax for looping over targets * add gfx941/942 support for gemm_xdl instances
-
zjing14 authored
* Changed wei layout * changed layout for examples * fixed client example --------- Co-authored-by:root <root@ctr-ubbsmc15.amd.com>
-
Qianfeng authored
* Add getAvailableComputeUnitCount() interface * Use available number of compute units to set kernel grid size
-
rocking authored
-
rocking authored
-
rocking authored
-
rocking authored
-
rocking authored
-
rocking authored
-
- 14 Jun, 2023 2 commits
-
-
Illia Silin authored
* fix CI builds with latest staging compiler * remove mount flags from dockerfile
-
Rostyslav Geyyer authored
* Add generic instance gemm_add_add_fastgelu * Add a client example for generic gemm_add_add_fastgelu * Update CMakeLists * Format * Format * Add generic instance gemm_add_fastgelu * Format * Add a gemm_add_fastgelu client example * Format * Add generic instance gemm_fastgelu * Format * Fix argument order * Add gemm_fastgelu client example * Add exceptions if argument is not supported
-
- 12 Jun, 2023 4 commits
-
-
Rostyslav Geyyer authored
-
Bartłomiej Kocot authored
* Add DeviceBatchedGemmMultipleD_Dl * Fix batched_gemm tests * Fix comments * test_batched_gemm_multi_d fixes * Fix args for isSupported batchedGemmMultipleDDl * Disable tests for gfx90a
-
Po Yen Chen authored
* Fix wrong pointer type * Rename type trait get_unsigned_int<> to get_carrier<> * Add 3-bytes carrier type * Add missing __device__ specifier * Rename template non-type parameter * Leave the rest byte uninitialized * Avoid invoking (host) STL algorithms * Remove unnecessary 'inline' specifier * Extract common logic out as helper method * Hide dummy member function * Add missing __device__ specifier
-
ltqin authored
* add check input parameter * add instance for vector load = 1 * move gerneral instance to first pos * fix read bias code * regular code for bias load --------- Co-authored-by:zjing14 <zhangjing14@gmail.com>
-
- 09 Jun, 2023 3 commits
- 08 Jun, 2023 6 commits
- 07 Jun, 2023 2 commits
-
-
Illia Silin authored
* update dockerfile to build rocm5.6 rc3 * fix couple of docker issues
-
rocking authored
-
- 06 Jun, 2023 2 commits
- 02 Jun, 2023 4 commits
-
-
Illia Silin authored
-
rocking authored
-
rocking authored
-
rocking authored
-
- 01 Jun, 2023 1 commit
-
-
who who who authored
-