1. 05 Sep, 2021 1 commit
    • Chao Liu's avatar
      GEMM driver and kernel (#29) · 19613902
      Chao Liu authored
      * add gemm driver
      
      * tweak
      
      * add gemm kernel: mk_kn_mn and km_kn_mn
      
      * tweak
      
      * add GEMM km_nk_mn
      
      * fix comment
      19613902
  2. 31 Aug, 2021 1 commit
    • ltqin's avatar
      Backward weight v4r4r2 with xdlops (#18) · 627d8ef3
      ltqin authored
      
      
      * start
      
      * modify transformat
      
      * modify device convolutiion
      
      * modify host
      
      * added host conv bwd and wrw
      
      * remove bwd, seperate wrw
      
      * clean
      
      * hacall k to zero
      
      * out log
      
      * fixed
      
      * fixed
      
      * change to (out in wei)
      
      * input hack
      
      * hack to out
      
      * format
      
      * fix by comments
      
      * change wei hacks(wei transform has not merge)
      
      * fix program once issue
      
      * fix review comment
      
      * fix vector load issue
      
      * tweak
      Co-authored-by: default avatarltqin <letaoqin@amd.com>
      Co-authored-by: default avatarJing Zhang <jizhan@amd.com>
      Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
      627d8ef3
  3. 23 Aug, 2021 1 commit
    • zjing14's avatar
      Xdlops refactor fix (#22) · 9d3f634a
      zjing14 authored
      * added constexpr ahead of adptor; clean unused driver; rename M/NPerWave to M/NPerXDL
      
      * fixed bwd
      
      * fixed comment
      9d3f634a
  4. 19 Aug, 2021 1 commit
    • zjing14's avatar
      refactor dynamic xdlops iGemm (#13) · a2ad6d35
      zjing14 authored
      * xdlops refactor
      
      * fixed commnt
      
      * clean xdlops_gemm
      
      * add make c into xldops-gemm
      
      * change mfma_info
      
      * refactor xdlops, hide c desc
      
      * clean
      
      * clean
      
      * clean
      
      * apply hacks changes to v4r4r4_nhwc
      
      * rename hacks and use single stage adapter
      
      * enable fp16 mfma
      a2ad6d35
  5. 13 Aug, 2021 1 commit
  6. 11 Aug, 2021 2 commits
  7. 10 Aug, 2021 5 commits
  8. 09 Aug, 2021 6 commits
  9. 07 Aug, 2021 1 commit
    • Chao Liu's avatar
      fix · 61487e0a
      Chao Liu authored
      61487e0a
  10. 06 Aug, 2021 2 commits
  11. 27 Jul, 2021 1 commit
    • Chao Liu's avatar
      [MIOpen Downstream] Initial MIOpen integration (#52) · f63a23ac
      Chao Liu authored
      * update online kernel wrapper bundle all descriptors in a tuple
      
      * change __CONSTANT__ to CONSTANT
      
      * rename
      
      * adding tuning
      
      * added IsValidCompileParameter
      
      * reorginze
      
      * adding tunable for fp16 and int8
      
      * fix kernel compile warning and bug fixes
      
      * suppress warning about cast CONSTANT (address space 4) pointer
      
      * fix building issue
      f63a23ac
  12. 18 Jul, 2021 1 commit