• Qianfeng's avatar
    [SWDEV-281541][MSRCHA-100] Implementation of Dynamic Generic Reduction (#1108) · 9e80cdce
    Qianfeng authored
    
    
    * add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files
    
    * make inner product compatible on gfx900
    
    * Update src/include/miopen/solver/ck_utility_common.hpp
    
    * compiler parameter use stream
    
    * use int instead of index_t in kernel wrapper
    
    * DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element
    
    * Add dynamic generic reduction kernel layer (kernel wrappers, kernel implementations and utilities)
    
    * Some updates to dynamic composable kernel facility for the need of dynamic generic reduction
    
    * Update to generic reduction C++ host interface layer to support dynamic generic reduction
    
    * Update to remove tidy complaints in host interface layer
    
    * Change the unary operator form from void op(T &x) to T op(T x)
    
    * Update to pass single workspace pointer for all kernels (fix for OpenCL backend)
    
    * Use cppcheck-suppress to prevent some strange warnings
    
    * Re-use operator [] and () for DynamicBuffer and update to depending codes
    
    * Remove useless codes in first call threadwise/warpwise/blockwise kernel wrappers
    
    * [performance] Remove un-needed local buffer initialization
    Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
    Co-authored-by: default avatarJD <Jehandad.Khan@amd.com>
    9e80cdce
reduction_operator.hpp 11.8 KB