1. 13 Apr, 2021 1 commit
  2. 25 Mar, 2021 1 commit
  3. 29 Jul, 2020 1 commit
  4. 24 Jun, 2020 1 commit
  5. 20 Jan, 2020 1 commit
    • Chao Liu's avatar
      Added bwd data v3r1 v4r1, tweaking v1 (#10) · c5da0377
      Chao Liu authored
      * Added bwd data v3r1: breaking down compute into a series of load balanced GEMM, and launch in a single kernel
      * Added bwd data v4r1: like v3r1, but launch GEMMs in multiple kernels
      * Tweaked v1r1  and v1r2 (atomic) on AMD GPU
      c5da0377
  6. 03 Dec, 2019 1 commit
    • Chao Liu's avatar
      backward data (#7) · 8f5f6496
      Chao Liu authored
      * enabled atomic add in tensor copy
      * added gridwise GEMM
      * added backward data conv using GEMM + atomic
      * added backward data conv using GEMM, no atomic
      8f5f6496
  7. 04 Nov, 2019 1 commit
  8. 11 Oct, 2019 1 commit