"docs/vscode:/vscode.git/clone" did not exist on "8bde6a543ba00adf2f7e330fbfebd624c876ab4d"
  1. 09 Aug, 2021 1 commit
  2. 18 Jul, 2021 1 commit
  3. 05 Jul, 2021 1 commit
    • Chao Liu's avatar
      DL GEMM fp32/fp16/int8 (#41) · b8b2d0a6
      Chao Liu authored
      * add threadwise copy the copy a tensor in one copy, added kpack to DL GEMM
      
      * add kpack into fwd v4r5 nchw fp32
      b8b2d0a6
  4. 11 May, 2021 1 commit
  5. 24 Jun, 2020 1 commit
  6. 20 Jan, 2020 1 commit
    • Chao Liu's avatar
      Added bwd data v3r1 v4r1, tweaking v1 (#10) · c5da0377
      Chao Liu authored
      * Added bwd data v3r1: breaking down compute into a series of load balanced GEMM, and launch in a single kernel
      * Added bwd data v4r1: like v3r1, but launch GEMMs in multiple kernels
      * Tweaked v1r1  and v1r2 (atomic) on AMD GPU
      c5da0377
  7. 05 Jul, 2019 1 commit
  8. 13 Jun, 2019 1 commit
  9. 12 Jun, 2019 2 commits
  10. 11 Jun, 2019 1 commit
  11. 01 Apr, 2019 1 commit
  12. 15 Feb, 2019 3 commits
  13. 14 Feb, 2019 1 commit