1. 05 Sep, 2021 2 commits
  2. 31 Aug, 2021 1 commit
    • ltqin's avatar
      Backward weight v4r4r2 with xdlops (#18) · 627d8ef3
      ltqin authored
      
      
      * start
      
      * modify transformat
      
      * modify device convolutiion
      
      * modify host
      
      * added host conv bwd and wrw
      
      * remove bwd, seperate wrw
      
      * clean
      
      * hacall k to zero
      
      * out log
      
      * fixed
      
      * fixed
      
      * change to (out in wei)
      
      * input hack
      
      * hack to out
      
      * format
      
      * fix by comments
      
      * change wei hacks(wei transform has not merge)
      
      * fix program once issue
      
      * fix review comment
      
      * fix vector load issue
      
      * tweak
      Co-authored-by: default avatarltqin <letaoqin@amd.com>
      Co-authored-by: default avatarJing Zhang <jizhan@amd.com>
      Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
      627d8ef3
  3. 27 Aug, 2021 1 commit
    • Chao Liu's avatar
      Misc fixes (#24) · 10bb8110
      Chao Liu authored
      * use cast_pointer_to_generic_address_space() in v6r1 kernel wrapper, DynamcBuffer and buffer_load take customized invalid-element-value, add buffer_load/store for fp64
      
      * use remove_cvref_t
      10bb8110
  4. 25 Aug, 2021 1 commit
  5. 23 Aug, 2021 2 commits
  6. 19 Aug, 2021 2 commits
    • zjing14's avatar
      refactor dynamic xdlops iGemm (#13) · a2ad6d35
      zjing14 authored
      * xdlops refactor
      
      * fixed commnt
      
      * clean xdlops_gemm
      
      * add make c into xldops-gemm
      
      * change mfma_info
      
      * refactor xdlops, hide c desc
      
      * clean
      
      * clean
      
      * clean
      
      * apply hacks changes to v4r4r4_nhwc
      
      * rename hacks and use single stage adapter
      
      * enable fp16 mfma
      a2ad6d35
    • zjing14's avatar
      Added host_conv_wrw for verification (#15) · ba6f79a7
      zjing14 authored
      * added host conv wrw
      ba6f79a7
  7. 16 Aug, 2021 4 commits
  8. 13 Aug, 2021 3 commits
  9. 11 Aug, 2021 2 commits
  10. 10 Aug, 2021 8 commits
  11. 09 Aug, 2021 8 commits
  12. 08 Aug, 2021 1 commit
  13. 07 Aug, 2021 2 commits
  14. 06 Aug, 2021 3 commits