"tests/models/unets/test_models_unet_1d.py" did not exist on "7c5fef81e0aecff65c041a9dfb23aff22bf64f4b"
- 23 Jun, 2023 3 commits
- 22 Jun, 2023 1 commit
-
-
Paul authored
-
- 16 Jun, 2023 2 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
- 15 Jun, 2023 3 commits
-
-
Illia Silin authored
* enable gfx941/942 targets * fix clang format * fix the cmake logic for multiple targets * fix cmake syntax for looping over targets * add gfx941/942 support for gemm_xdl instances
-
zjing14 authored
* Changed wei layout * changed layout for examples * fixed client example --------- Co-authored-by:root <root@ctr-ubbsmc15.amd.com>
-
Qianfeng authored
* Add getAvailableComputeUnitCount() interface * Use available number of compute units to set kernel grid size
-
- 14 Jun, 2023 2 commits
-
-
Illia Silin authored
* fix CI builds with latest staging compiler * remove mount flags from dockerfile
-
Rostyslav Geyyer authored
* Add generic instance gemm_add_add_fastgelu * Add a client example for generic gemm_add_add_fastgelu * Update CMakeLists * Format * Format * Add generic instance gemm_add_fastgelu * Format * Add a gemm_add_fastgelu client example * Format * Add generic instance gemm_fastgelu * Format * Fix argument order * Add gemm_fastgelu client example * Add exceptions if argument is not supported
-
- 12 Jun, 2023 4 commits
-
-
Rostyslav Geyyer authored
-
Bartłomiej Kocot authored
* Add DeviceBatchedGemmMultipleD_Dl * Fix batched_gemm tests * Fix comments * test_batched_gemm_multi_d fixes * Fix args for isSupported batchedGemmMultipleDDl * Disable tests for gfx90a
-
Po Yen Chen authored
* Fix wrong pointer type * Rename type trait get_unsigned_int<> to get_carrier<> * Add 3-bytes carrier type * Add missing __device__ specifier * Rename template non-type parameter * Leave the rest byte uninitialized * Avoid invoking (host) STL algorithms * Remove unnecessary 'inline' specifier * Extract common logic out as helper method * Hide dummy member function * Add missing __device__ specifier
-
ltqin authored
* add check input parameter * add instance for vector load = 1 * move gerneral instance to first pos * fix read bias code * regular code for bias load --------- Co-authored-by:zjing14 <zhangjing14@gmail.com>
-
- 09 Jun, 2023 1 commit
-
-
Alan Turner authored
-
- 08 Jun, 2023 1 commit
-
-
carlushuang authored
-
- 07 Jun, 2023 10 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Illia Silin authored
* update dockerfile to build rocm5.6 rc3 * fix couple of docker issues
-
- 06 Jun, 2023 5 commits
-
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
-
Alan Turner authored
Merge branch 'migx-jit-lib' of https://github.com/ROCmSoftwarePlatform/composable_kernel into migx-jit-lib
-
Alan Turner authored
-
- 02 Jun, 2023 3 commits
-
-
Illia Silin authored
-
Paul authored
-
Paul authored
-
- 01 Jun, 2023 5 commits
-
-
Paul Fultz II authored
* Move functions to cpp file * Move another function to cpp file * Fix semicolon * Move solution to common.hpp * Fix compile errors * Use enum for data types * Remove -Werror * Fix header install * Fix relative path * Fix header path * Install all headers
-
who who who authored
-
Po Yen Chen authored
* Remove M/N/KPad local variables * Use M/N/KPad to name padded lengths * Replace duplicated local variable by parameters * Rename variables M/N/KRaw to M/N/K * Move AK0/BK0 compute logic into GridwiseGemm * Use macro to shorten code * Move CalculateGridSize() logic into GridwiseGemm * Add comment to credit the implementation source * Reuse the existing implementation * Remove no-longer used data members * Remove elementwise-op objects from interfaces * Reserve kernel arg as whole object in interfaces * Remove redundant data member * Make 3rd type parameter optional * Remove unnesscary type parameters * Remove no-longer used descriptor-creation methods * Move kernel arg type definition into GridwiseGemm * Add macro to switch between code sections * Move argument field computing logic into device op side * Make utility method 'static' * Declare special methods * Unify MakeArgument() usage * Adapt the new GridwiseGemm interface * Push-down class 'GridwiseGemm::Argument' fields * Remove no-longer used methods * Add unused parameters * Force copying parameters in 'Embed' ctor * Remove no-longer used descriptors * Fallback change on BaseArgument * Remove macro 'INTEGER_DIVIDE_CEIL' * Make variable naming more consistent * Make sure methods are only invoked on right place * Remove tailing underscore in public attribute name * Remove necessary methods * Hide computing logic of derived attributes * Make new 'Embed' ctor only available for device code * Make sure 'Embed' type args are not references * Move check for karg.K into CheckValidity() * Remove more integer division logic form device code * Undo changes on Embed * Separate 'Problem' concept out from 'Argument' * Add overloaded version of __builtin_amdgcn_readfirstlane() * Remove 'static' specifiers * Remove more 'static' specifier * Replace unsigne char by std::byte * Add 'const' specifier to never changing variable * Add 'inline' specifier to funcion definition * Share same name for kernel interfaces * Fix wrong boundar calculation logic * Leave the third template arg for compatibility * Remove unnecessary parameters * Fix wrong error message (for type name) * Create descriptor on device side * Fix wrong debug message * Remove no-longer used data members * Rename type trait * Remove std:: qualifier from standard types * Replace 'size_t' by 'unsigned' * Use type alias to hint usage * Replace static_for<> by ordinary 'for' loop * Reject unsupported argument * Rename readfirstlane() to amd_wave_read_first_lane() * Rename file readfirstlance.hpp as amd_wave_read_first_lane.hpp * Update function calls * Reorder statements * Re-format files --------- Co-authored-by:zjing14 <zhangjing14@gmail.com>
-
Alan Turner authored
-
Alan Turner authored
No commit message
-