1. 23 Oct, 2021 1 commit
  2. 18 Oct, 2021 1 commit
  3. 16 Oct, 2021 1 commit
  4. 14 Oct, 2021 2 commits
  5. 13 Oct, 2021 1 commit
  6. 08 Oct, 2021 2 commits
  7. 07 Oct, 2021 1 commit
  8. 06 Oct, 2021 1 commit
  9. 02 Oct, 2021 1 commit
  10. 30 Sep, 2021 1 commit
  11. 28 Sep, 2021 1 commit
  12. 24 Sep, 2021 2 commits
  13. 08 Sep, 2021 1 commit
    • Masaki Kozuki's avatar
      enable ninja (#1164) · 9ce0a10f
      Masaki Kozuki authored
      - passing include directories to `CUDAExtension`'s `include_dirs` argument
      - removing `-I/path/to/dir` arguments from `extra_compile_args`
      9ce0a10f
  14. 04 Sep, 2021 1 commit
    • Burc Eryilmaz's avatar
      fix CUBLAS guards (#1162) · 54b93919
      Burc Eryilmaz authored
      
      
      * support for fused dense layer with cublasLt, fusion in both fprop and bprop
      
      * fix typo causing syntax error
      
      * add fused GEMM+gelu+GEMM modue
      
      * fix typo for workspace size
      
      * update cublas check for 11600
      
      * add tests for fused dense layer
      
      * fix CUDA 10.x path
      
      * safer guard around CUBLAS constants, remove unreferenced variable
      
      * more guard changes
      
      * guard against cublas version instead of cuda
      Co-authored-by: default avatarSukru Eryilmaz <seryilmaz@computelab-dgx1v-32.nvidia.com>
      54b93919
  15. 02 Sep, 2021 13 commits
  16. 01 Sep, 2021 5 commits
  17. 31 Aug, 2021 3 commits
  18. 30 Aug, 2021 1 commit
  19. 20 Aug, 2021 1 commit