1. 05 Jul, 2021 1 commit
    • Chao Liu's avatar
      DL GEMM fp32/fp16/int8 (#41) · b8b2d0a6
      Chao Liu authored
      * add threadwise copy the copy a tensor in one copy, added kpack to DL GEMM
      
      * add kpack into fwd v4r5 nchw fp32
      b8b2d0a6
  2. 10 Jun, 2021 1 commit
  3. 12 May, 2021 1 commit
  4. 11 May, 2021 1 commit
  5. 28 Apr, 2021 1 commit
  6. 13 Apr, 2021 1 commit
  7. 25 Mar, 2021 1 commit
  8. 06 Aug, 2020 1 commit
    • Chao Liu's avatar
      Bwd Data NHWC (#22) · bbcb67d0
      Chao Liu authored
      * fix buffer_store bug
      * remove obsolete kernels
      * add bwd-data-v5r1-nhwc 
      bbcb67d0
  9. 24 Jun, 2020 1 commit
  10. 03 Dec, 2019 1 commit
    • Chao Liu's avatar
      backward data (#7) · 8f5f6496
      Chao Liu authored
      * enabled atomic add in tensor copy
      * added gridwise GEMM
      * added backward data conv using GEMM + atomic
      * added backward data conv using GEMM, no atomic
      8f5f6496
  11. 04 Nov, 2019 1 commit
  12. 11 Oct, 2019 1 commit
  13. 22 Sep, 2019 1 commit
  14. 21 Sep, 2019 1 commit
  15. 10 Sep, 2019 1 commit
  16. 09 Sep, 2019 1 commit
  17. 02 Sep, 2019 1 commit
  18. 19 Jun, 2019 2 commits
  19. 13 Jun, 2019 1 commit