1. 13 Aug, 2021 1 commit
  2. 11 Aug, 2021 2 commits
  3. 10 Aug, 2021 8 commits
  4. 09 Aug, 2021 8 commits
  5. 08 Aug, 2021 1 commit
  6. 07 Aug, 2021 2 commits
  7. 06 Aug, 2021 7 commits
  8. 30 Jul, 2021 3 commits
  9. 28 Jul, 2021 3 commits
  10. 27 Jul, 2021 3 commits
  11. 18 Jul, 2021 1 commit
  12. 17 Jul, 2021 1 commit
    • zjing14's avatar
      Add xdlops v4r4r4 into online compilation (#48) · fbdf4332
      zjing14 authored
      
      
      * init for v4r4 xdlops olc
      
      * refactor wrap
      
      * init impl of v4r4 nchw xdlops olc
      
      * tuning
      
      * test perf
      
      * fixed v4r4 nhwc
      
      * tuned v4r4 nhwc
      
      * use gridwise_gemm_xdlops_v2r3
      
      * swap a/b
      
      * add pointer support into offline v2r3
      
      * debugging v4r4r4 transform for olc
      
      * change timer of olc
      
      * refactor v4r4 xdlops nchw olc
      
      * remove transform fun in v4r4 xdlops nhwc olc
      Co-authored-by: default avatarChao Liu <chao.liu2@amd.com>
      fbdf4332