"vscode:/vscode.git/clone" did not exist on "191d99a7f9fc58a7aeb0ef2c2c0a7bf8df3c39fc"
- 22 May, 2023 3 commits
-
-
Adam Osewski authored
Merge branch 'aosewski/test_ggemm_splitk' of https://github.com/ROCmSoftwarePlatform/composable_kernel into aosewski/test_ggemm_splitk
-
Adam Osewski authored
-
Adam Osewski authored
-
- 18 May, 2023 1 commit
-
-
Sam Wu authored
* update documentation dependencies add version number to docs rename doc config directories enable more doc formats on rtd add license section in docs
-
- 17 May, 2023 1 commit
-
-
Adam Osewski authored
-
- 16 May, 2023 2 commits
-
-
Adam Osewski authored
Merge branch 'aosewski/test_ggemm_splitk' of https://github.com/ROCmSoftwarePlatform/composable_kernel into aosewski/test_ggemm_splitk
-
Adam Osewski authored
-
- 15 May, 2023 5 commits
-
-
zjing14 authored
-
Bartłomiej Kocot authored
* Add contraction profiler and tests * Build and style fixes * Allow to use any elementwise operator for ref_contraction * Introduce profile_contraction_scale and profile_contraction_bilinear * Make ref_contraction generic and extend interface tests * Stylistic minor fixes * Extend test_contraction_interface
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
- 12 May, 2023 3 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
- 11 May, 2023 8 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
rocking authored
-
- 10 May, 2023 6 commits
-
-
Adam Osewski authored
-
Adam Osewski authored
* Disable logging * extract out of if statement KBatch update.
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
- 04 May, 2023 11 commits
-
-
Rostyslav Geyyer authored
* Add TypeConvert class and start refactoring * Refactor TypeConvert as a struct * Get back to template functions type_convert * Add a type_convert_bf16_rtn, set rtz as default * Clean up * Add UnaryConvertPrecision struct for high-precision workloads * Format * Update type_convert to UnaryConvert on threadwise level * Update UnaryConvertPrecision * Format * Fix chmod * Add a flag to pick converion method * Format * Remove the added flag * Merge elementwise op with type conversion * Move type_convert to elemwise op, update the op * Update type_convert_precision -> bf16_convert_rtn * Clean up * Update comments * Update the CK_WORKAROUND_DENORM_FIX flag handling * Update the unneeded op to work but warn user * Remove the message * Use a PassThrough instead of ConvertBF16RTN to calcaulate reference * Format * Add missing include
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-
Adam Osewski authored
-