"composable_kernel/include/utility/integral_constant.hpp" did not exist on "88b77181aab1198b41b612f6d03b6dfb2d32bd40"
  • ltqin's avatar
    Implement MI200 FP16 Denorm fix inside threadwise copy (#191) · b39f07f1
    ltqin authored
    
    
    * start convert
    
    * using buffer load
    
    * add kernel transfer fun
    
    * using asm for transfer
    
    * add transpose_half_to_bhalf_2x2
    
    * add TypeMap struct
    
    * add LDSDataType to v2r3 and v2r4r2
    
    * change convert fun name
    
    * remove asm in half transfer to bhalf
    
    * fix bug for type_convert
    
    * cshuffle_v1 add LDSDataType
    
    * add ldstype for gridegemm v2r4
    
    * add lds datat ype to v3r1 2 3
    
    * init complete
    
    * fix function name
    
    * remove comments
    
    * format
    
    * fix for merge develop
    Co-authored-by: default avatarltqin <letaoqin@amd.com>
    b39f07f1
fp16_transfer_bf16.cpp 5.35 KB