- 17 May, 2023 1 commit
-
-
Po-Yen, Chen authored
-
- 16 May, 2023 2 commits
-
-
Po-Yen, Chen authored
Merge branch 'feature/integrage-karg-simplification-pr' of github.com:ROCmSoftwarePlatform/composable_kernel into feature/integrage-karg-simplification-pr
-
Po-Yen, Chen authored
-
- 15 May, 2023 2 commits
-
-
zjing14 authored
-
Bartłomiej Kocot authored
* Add contraction profiler and tests * Build and style fixes * Allow to use any elementwise operator for ref_contraction * Introduce profile_contraction_scale and profile_contraction_bilinear * Make ref_contraction generic and extend interface tests * Stylistic minor fixes * Extend test_contraction_interface
-
- 11 May, 2023 1 commit
-
-
rocking authored
-
- 10 May, 2023 1 commit
-
-
Po-Yen, Chen authored
-
- 09 May, 2023 5 commits
-
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
- 06 May, 2023 10 commits
-
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
- 04 May, 2023 18 commits
-
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Rostyslav Geyyer authored
* Add TypeConvert class and start refactoring * Refactor TypeConvert as a struct * Get back to template functions type_convert * Add a type_convert_bf16_rtn, set rtz as default * Clean up * Add UnaryConvertPrecision struct for high-precision workloads * Format * Update type_convert to UnaryConvert on threadwise level * Update UnaryConvertPrecision * Format * Fix chmod * Add a flag to pick converion method * Format * Remove the added flag * Merge elementwise op with type conversion * Move type_convert to elemwise op, update the op * Update type_convert_precision -> bf16_convert_rtn * Clean up * Update comments * Update the CK_WORKAROUND_DENORM_FIX flag handling * Update the unneeded op to work but warn user * Remove the message * Use a PassThrough instead of ConvertBF16RTN to calcaulate reference * Format * Add missing include
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-
Po-Yen, Chen authored
-