1. 11 Dec, 2022 2 commits
  2. 05 Dec, 2022 1 commit
  3. 21 Oct, 2022 1 commit
  4. 05 Oct, 2022 3 commits
  5. 30 Sep, 2022 1 commit
  6. 25 Sep, 2022 1 commit
  7. 24 Sep, 2022 3 commits
  8. 23 Sep, 2022 6 commits
    • Min Xu's avatar
      [fix] SDP syncing buffers during gradient accumulation (#1075) · bfd57ff3
      Min Xu authored
      
      
      - Fixes from Benjamin.
      
      Original commit msg:
        - Fixes #1041. I just had a minute or two, hoping that it's enough :)
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      bfd57ff3
    • Min Xu's avatar
      disable code cov (#1077) · abfa7193
      Min Xu authored
      
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      abfa7193
    • Min Xu's avatar
      disable codecov (#1076) · 72fcabec
      Min Xu authored
      
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      72fcabec
    • Anupam Bhatnagar's avatar
      0.4.10 release · 6f03e415
      Anupam Bhatnagar authored
      6f03e415
    • Min Xu's avatar
      [fix] better handling non-flatten in FSDP (#1072) · 429f3d31
      Min Xu authored
      * [fix] better handling non-flatten in FSDP
      
      - see the detailed comment about that backward firing case
      - also minor debugging help in FSDP
      - also minor fix in FPW's state dict
      
      * [feat] disallow reset_parameters by default
      
      * [feat] adding fsdp_instances API - useful in check wrapping by user code
      
      * [fix] one line fix but more than a day of debugging
      
      * fixed the case of loading combined check with empty fsdp instances
      
      * fixed another bug around state loading the root/nonroot module full param caching due to not resharding after forward
      
      * [feat] support .half and .float better
      
      * fixed a bug in gather optim state losses extra keys from the original state_dict
      
      * fixed a test failure in mixed precision
      
      * fixed another bug affecting no_sync grad acc
      
      * fixed a bug and a test in fsdp optim state
      
      * fixed another corner case
      
      * added a comment
      
      * skip ssd offload tests
      
      * skip fsdp one for ssd overload
      
      Co-authored-by: Mi...
      429f3d31
    • Min Xu's avatar
      [fix] don't import ProcessGroup eagerly (#1074) · 47ce21ac
      Min Xu authored
      
      
      * [fix] don't import ProcessGroup eagerly
      
      - move the import into typing since it is only used for type checking
      - fixes #1057
      
      * more fixes
      
      * one more
      
      * tested at least
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      47ce21ac
  9. 13 Sep, 2022 3 commits
  10. 10 Sep, 2022 1 commit
  11. 07 Sep, 2022 4 commits
  12. 26 Aug, 2022 1 commit
  13. 25 Aug, 2022 1 commit
    • Min Xu's avatar
      [chore] update nightly version (#1064) · 15d4cf15
      Min Xu authored
      
      
      * update nightly version
      
      * update wgit to use numpy for load/store
      
      - this is introduced with new nightly torch version, which made torch.save() not
        producing deterministic bytes
      - this make tensor<->numpy conversion and then do the save/load to avoid that issues.
      
      * fixed tests
      Co-authored-by: default avatarMin Xu <min.xu.public@gmail.com>
      15d4cf15
  14. 11 Aug, 2022 1 commit
  15. 08 Aug, 2022 2 commits
  16. 03 Aug, 2022 1 commit
  17. 31 Jul, 2022 1 commit
    • Riyasat Ohib's avatar
      Implmentation of dense_sst_to_dst and sst_dst_to_dense (#1048) · c1dada48
      Riyasat Ohib authored
      [Feat] Implements dense_sst_to_dst and sst_dst_to_dense methods and adds tests
      
      1. Implements the dense_sst_to_dst and sst_dst_to_dense method.
      2. Adds tests for perfect reconstruction with all top-k across different dims.
      3. Adds tests for the two new methods.
      c1dada48
  18. 29 Jul, 2022 1 commit
  19. 28 Jul, 2022 1 commit
  20. 27 Jul, 2022 1 commit
    • Riyasat Ohib's avatar
      [Feat] dense to sst implementation (#1034) · 608492af
      Riyasat Ohib authored
      * [Feat] dense to sst implementation
      1. Implementation of dense_to_sst function.
      2. calculating the threshold for both the cases of top-k-element and top-k-percentage (fraction)
      3. assertions to verify that the top_k_elements is smaller than the numel along the same dim
      4. top_k_percent to top-k conversion
      5. When calculating SST, now the real part of the complex dense_freq is used instead of the magnitudes.
      
      * [Feat, Tests] transform method addition, handling of top_k_element None case
      1. Addition of a transform method
      2. Adds code to handle the dim=None case for top_k_element
      
      * [Feat, Refactor] Reorganizations, new assertions and fixes.
      1. XOR for validation that both of topk percent and element are not set, or both simultaneously unset. One and only one is set.
      3. Distills topk and percent both to topk using unified helper function .
      5. Adds a scatter topk values function to scatter values for SST and in future DST.
      6. Validation for percentage...
      608492af
  21. 26 Jul, 2022 4 commits