1. 20 Oct, 2021 1 commit
    • Quentin Duval's avatar
      [feat] layer memory tracking (#808) · ad92220c
      Quentin Duval authored
      
      
      * [feat] layer memory tracking
      
      * [feat] layer memory tracking (add tests in CI)
      
      * [feat] layer memory tracking: doc typos
      
      * [feat] layer memory tracking: mypy fixes
      
      * [feat] layer memory tracking: fixes for FSDP all gather tracking on pytorch 1.9 and above
      
      * [feat] layer memory tracking: lint
      
      * [feat] layer memory tracking: mypy
      Co-authored-by: default avatarQuentinDuval <QuentinDuval@users.noreply.github.com>
      ad92220c
  2. 22 Sep, 2021 1 commit
    • tmarkstrum's avatar
      Switch default branch from master to main (#807) · b09ddb2d
      tmarkstrum authored
      * update master branch to main
      
      * added FAQ about updating the branch from master to main
      
      * fixed some false positive correction
      
      * added what is new section
      
      * fixed the quoted code area
      
      * added release what is new section
      
      * added a step in release.md
      
      * fixed a word
      b09ddb2d
  3. 21 Sep, 2021 1 commit
  4. 20 Sep, 2021 1 commit
  5. 10 Aug, 2021 1 commit
    • Rahul Iyer's avatar
      Fix pre-commit hook failures (#756) · 31d600cc
      Rahul Iyer authored
      Pre-commit hook fails when run on all files for three reasons:
      (see trace below)
      
      1. Trailing whitespace on multiple files
      2. mypy fails to load numpy and then subsequently fails to load
      LazyModule from pipe.py
      3. isort sees issues with known_third_party packages
      
      ```
      > pre-commit run --all-files
      
      Trim Trailing Whitespace.................................................Failed
      - hook id: trailing-whitespace
      - exit code: 1
      - files were modified by this hook
      
      Fixing docs/source/conf.py
      Fixing fairscale/experimental/nn/auto_shard.py
      Fixing docs/source/deep_dive/activation_checkpointing.rst
      Fixing docs/source/tutorials/pipe.rst
      Fixing docs/source/installation_instructions.rst
      Fixing docs/source/deep_dive/pipeline_parallelism.rst
      Fixing docs/source/tutorials/activation_checkpointing.rst
      Fixing docs/source/tutorials/offload_model.rst
      Fixing docs/source/deep_dive/oss_sdp_fsdp.rst
      Fixing docs/source/what_is_fairscale.rst
      Fixing CHANGELOG.md
      Fixing fairscale/experimental/nn/offload.py
      Fixing docs/source/index.rst
      Fixing docs/source/deep_dive/adascale.rst
      Fixing README.md
      Fixing docs/source/tutorials/oss.rst
      Fixing docs/source/deep_dive/offload.rst
      
      Check python ast.........................................................Passed
      Check for merge conflicts................................................Passed
      Don't commit to branch...................................................Passed
      Check for added large files..............................................Passed
      Fix End of Files.........................................................Failed
      - hook id: end-of-file-fixer
      - exit code: 1
      - files were modified by this hook
      
      Fixing requirements.txt
      Fixing docs/source/getting_started.rst
      Fixing docs/source/installation_instructions.rst
      Fixing codecov.yml
      Fixing docs/source/deep_dive/adascale.rst
      Fixing docs/source/tutorials/oss.rst
      Fixing docs/source/deep_dive/offload.rst
      
      black....................................................................Passed
      flake8...................................................................Passed
      seed isort known_third_party.............................................Failed
      - hook id: seed-isort-config
      - exit code: 1
      - files were modified by this hook
      isort....................................................................Passed
      mypy.....................................................................Failed
      - hook id: mypy
      - exit code: 2
      
      setup.cfg:45: error: Error importing plugin 'numpy.typing.mypy_plugin': No module named 'numpy'
      Found 1 error in 1 file (checked 197 source files)
      ```
      31d600cc
  6. 01 Aug, 2021 1 commit
  7. 27 Jul, 2021 1 commit
  8. 12 Jul, 2021 1 commit
  9. 28 May, 2021 1 commit
  10. 27 May, 2021 2 commits
  11. 26 May, 2021 1 commit
  12. 18 May, 2021 1 commit
  13. 12 May, 2021 1 commit
    • anj-s's avatar
      [chore] Rename and move checkpoint_activations from misc folder. (#654) · 72c6bab2
      anj-s authored
      * rename files
      
      * add newly renamed file
      
      * rename and move checkpoint activations related files
      
      * add test files to ci list
      
      * fix lint errors
      
      * modify docs
      
      * add changelog
      
      * retain old path for now
      
      * fix lint errors
      
      * add another import test case
      
      * fix merge conflict
      
      * add missing test file
      72c6bab2
  14. 26 Apr, 2021 1 commit
  15. 19 Apr, 2021 1 commit
  16. 15 Apr, 2021 1 commit
  17. 13 Apr, 2021 1 commit
  18. 02 Apr, 2021 4 commits
  19. 25 Mar, 2021 1 commit
  20. 18 Mar, 2021 1 commit
  21. 09 Mar, 2021 2 commits
  22. 08 Mar, 2021 1 commit
  23. 25 Feb, 2021 1 commit
  24. 23 Feb, 2021 2 commits
  25. 19 Feb, 2021 1 commit
  26. 25 Jan, 2021 1 commit
  27. 28 Dec, 2020 1 commit
  28. 16 Dec, 2020 1 commit
    • Min Xu's avatar
      [feat]: AdaScale work with lr_scheduler and tests, examples (#229) · d65cd838
      Min Xu authored
      * [doc]: AdaScale example and notes
      
      * formatted notes correctly as suggested by Benjamin
      
      * added feature and unit test to make sure lr_scheduler works
      
      * update the example with lr_scheduler
      
      * fixed doc with "make html"
      
      * addressed Mike's suggestions
      d65cd838
  29. 09 Dec, 2020 1 commit
  30. 05 Dec, 2020 1 commit
  31. 04 Dec, 2020 1 commit
  32. 01 Dec, 2020 1 commit
  33. 27 Nov, 2020 1 commit
  34. 21 Nov, 2020 1 commit
    • Benjamin Lefaudeux's avatar
      [feat] ShardedDataParallel with autoreduce (#157) · ad933b34
      Benjamin Lefaudeux authored
      * rewrite using autograd and Variable execution queue to make the reduce automatic
      * share buckets with OSS to remove duplication
      * some speed still likely on the table since the speed vs. bucketing does not match expectations, could be a follow up
      ad933b34