1. 04 Apr, 2024 1 commit
  2. 02 Feb, 2024 1 commit
  3. 20 Apr, 2023 1 commit
  4. 11 Apr, 2023 2 commits
  5. 10 Mar, 2023 1 commit
  6. 28 Feb, 2023 1 commit
  7. 23 Feb, 2023 1 commit
  8. 15 Feb, 2023 1 commit
  9. 04 Feb, 2023 1 commit
  10. 12 Dec, 2022 1 commit
    • Min Xu's avatar
      [test] ci py 3.11 tests (#1099) · 4a98000c
      Min Xu authored
      
      
      * [test] ci py 3.11 tests
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      
      * fixed setup.py
      
      * fixed ci config
      
      * fixed ci config's python 3.11 version
      
      * fixed torch installs on cpu
      
      * update pygit2 for 3.11
      
      * we don't run benchmark on cpu, so no need to install the benchmark reqs
      
      * update torch install
      
      * try to install torchvision
      
      * numpy version 311
      
      * fix cpu test dependency installation
      
      * pip git install cmd fix
      
      * bypass some tests in 3.11. failure due to packages they use haven't been updated for 3.11 yet
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      4a98000c
  11. 11 Dec, 2022 3 commits
  12. 05 Dec, 2022 1 commit
  13. 21 Oct, 2022 1 commit
  14. 05 Oct, 2022 3 commits
  15. 30 Sep, 2022 1 commit
  16. 25 Sep, 2022 1 commit
  17. 24 Sep, 2022 3 commits
  18. 23 Sep, 2022 6 commits
    • Min Xu's avatar
      [fix] SDP syncing buffers during gradient accumulation (#1075) · bfd57ff3
      Min Xu authored
      
      
      - Fixes from Benjamin.
      
      Original commit msg:
        - Fixes #1041. I just had a minute or two, hoping that it's enough :)
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      bfd57ff3
    • Min Xu's avatar
      disable code cov (#1077) · abfa7193
      Min Xu authored
      
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      abfa7193
    • Min Xu's avatar
      disable codecov (#1076) · 72fcabec
      Min Xu authored
      
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      72fcabec
    • Anupam Bhatnagar's avatar
      0.4.10 release · 6f03e415
      Anupam Bhatnagar authored
      6f03e415
    • Min Xu's avatar
      [fix] better handling non-flatten in FSDP (#1072) · 429f3d31
      Min Xu authored
      * [fix] better handling non-flatten in FSDP
      
      - see the detailed comment about that backward firing case
      - also minor debugging help in FSDP
      - also minor fix in FPW's state dict
      
      * [feat] disallow reset_parameters by default
      
      * [feat] adding fsdp_instances API - useful in check wrapping by user code
      
      * [fix] one line fix but more than a day of debugging
      
      * fixed the case of loading combined check with empty fsdp instances
      
      * fixed another bug around state loading the root/nonroot module full param caching due to not resharding after forward
      
      * [feat] support .half and .float better
      
      * fixed a bug in gather optim state losses extra keys from the original state_dict
      
      * fixed a test failure in mixed precision
      
      * fixed another bug affecting no_sync grad acc
      
      * fixed a bug and a test in fsdp optim state
      
      * fixed another corner case
      
      * added a comment
      
      * skip ssd offload tests
      
      * skip fsdp one for ssd overload
      
      Co-authored-by: Mi...
      429f3d31
    • Min Xu's avatar
      [fix] don't import ProcessGroup eagerly (#1074) · 47ce21ac
      Min Xu authored
      
      
      * [fix] don't import ProcessGroup eagerly
      
      - move the import into typing since it is only used for type checking
      - fixes #1057
      
      * more fixes
      
      * one more
      
      * tested at least
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      47ce21ac
  19. 13 Sep, 2022 3 commits
  20. 10 Sep, 2022 1 commit
  21. 07 Sep, 2022 4 commits
  22. 26 Aug, 2022 1 commit
  23. 25 Aug, 2022 1 commit
    • Min Xu's avatar
      [chore] update nightly version (#1064) · 15d4cf15
      Min Xu authored
      
      
      * update nightly version
      
      * update wgit to use numpy for load/store
      
      - this is introduced with new nightly torch version, which made torch.save() not
        producing deterministic bytes
      - this make tensor<->numpy conversion and then do the save/load to avoid that issues.
      
      * fixed tests
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      15d4cf15