"external/vscode:/vscode.git/clone" did not exist on "52c3fe05be9b6cfc0602918bf3f5177cf6713290"
  1. 26 Apr, 2022 1 commit
    • ltqin's avatar
      Implement MI200 FP16 Denorm fix inside threadwise copy (#191) · b39f07f1
      ltqin authored
      
      
      * start convert
      
      * using buffer load
      
      * add kernel transfer fun
      
      * using asm for transfer
      
      * add transpose_half_to_bhalf_2x2
      
      * add TypeMap struct
      
      * add LDSDataType to v2r3 and v2r4r2
      
      * change convert fun name
      
      * remove asm in half transfer to bhalf
      
      * fix bug for type_convert
      
      * cshuffle_v1 add LDSDataType
      
      * add ldstype for gridegemm v2r4
      
      * add lds datat ype to v3r1 2 3
      
      * init complete
      
      * fix function name
      
      * remove comments
      
      * format
      
      * fix for merge develop
      Co-authored-by: default avatarltqin <letaoqin@amd.com>
      b39f07f1