1. 20 Jan, 2020 1 commit
    • Chao Liu's avatar
      Added bwd data v3r1 v4r1, tweaking v1 (#10) · c5da0377
      Chao Liu authored
      * Added bwd data v3r1: breaking down compute into a series of load balanced GEMM, and launch in a single kernel
      * Added bwd data v4r1: like v3r1, but launch GEMMs in multiple kernels
      * Tweaked v1r1  and v1r2 (atomic) on AMD GPU
      c5da0377
  2. 29 Jul, 2019 1 commit
  3. 27 Jun, 2019 1 commit
  4. 12 Jun, 2019 1 commit
  5. 03 Apr, 2019 1 commit
  6. 02 Apr, 2019 1 commit
  7. 15 Feb, 2019 2 commits
  8. 14 Feb, 2019 1 commit
  9. 05 Nov, 2018 1 commit