1. 30 May, 2023 2 commits
  2. 29 May, 2023 2 commits
  3. 11 May, 2023 3 commits
  4. 27 Apr, 2023 2 commits
  5. 26 Apr, 2023 3 commits
  6. 30 Mar, 2023 1 commit
  7. 10 Aug, 2022 1 commit
  8. 14 Jun, 2022 1 commit
  9. 11 Jun, 2022 4 commits
  10. 26 May, 2022 1 commit
  11. 25 May, 2022 1 commit
  12. 02 Apr, 2021 2 commits
  13. 01 Apr, 2021 1 commit
  14. 31 Mar, 2021 3 commits
  15. 30 Mar, 2021 3 commits
  16. 27 Mar, 2021 2 commits
  17. 26 Mar, 2021 1 commit
  18. 25 Mar, 2021 1 commit
  19. 24 Mar, 2021 1 commit
  20. 18 Mar, 2021 2 commits
  21. 16 Mar, 2021 3 commits
    • Conglong Li's avatar
      1-bit Adam v2 (#817) · 68c8481b
      Conglong Li authored
      Authors: @awan-10 @conglongli @samyam @jeffra
      
      What's new:
      
      NCCL-based implementation which provides better performance and usability compared to the MPI-based implementation.
      Add support to momentum masks for those parameters with constant zero gradients during training.
      Bug fixes (e.g., #813).
      
      * NCCL-based 1-bit Adam + Code Refactor for Comm. Backends (#594)
      
      * NCCL based 1-bit Implementation + Refactor to add communication backends (#593)
      
      * add nccl 1-bit optim.
      
      * temporary commit to save stuff.
      
      * Use dist collectives instead of mpi routines.
      
      * remove old code for comm.
      
      * Fix bugs. still does not work.
      
      * modify to test the nccl side code path
      
      * Initial gather impl. Works intra-node.
      
      * Updates to comm. phase 2. nccl comm. passed the tests.
      
      * refactor code to introduce nccl/mpi as backends for onebit adam.
      
      * Refactor updates to test/engine.
      
      * Fix compile/runtime errors.
      
      * simplify support for nccl/mpi backends.
      
      * Add missign file
      
      *...
      68c8481b
    • Jeff Rasley's avatar
      bump version 0.3.13 · 12a53b43
      Jeff Rasley authored
      12a53b43
    • Olatunji Ruwase's avatar
      7bcd72a2