- 26 Mar, 2021 1 commit
-
-
Min Xu authored
- added DDP equivalency test - added rmf, state_dict_norm functions to testing utils - added more debugging output to objects_are_equal
-
- 25 Mar, 2021 3 commits
-
-
Benjamin Lefaudeux authored
-
Benjamin Lefaudeux authored
* re-activating unit test * removing changed that slipped in
-
Sam Shleifer authored
Co-authored-by:Min Xu <24926999+min-xu-ai@users.noreply.github.com>
-
- 22 Mar, 2021 1 commit
-
-
Benjamin Lefaudeux authored
-
- 20 Mar, 2021 1 commit
-
-
Myle Ott authored
* Add new test for weight init (fails) * Set FSDP.compute_device so summon_full_params works before module moves to CUDA * Override FSDP.apply to enable custom weight init
-
- 19 Mar, 2021 3 commits
-
-
Benjamin Lefaudeux authored
* param buckets * unifying the buckets
-
msbaines authored
-
msbaines authored
-
- 18 Mar, 2021 9 commits
-
-
Benjamin Lefaudeux authored
-
Min Xu authored
-
Benjamin Lefaudeux authored
* extracting the buckets in a dedicated class, fixing the resize_ bug * adding a unit test * copyright
-
Myle Ott authored
-
Benjamin Lefaudeux authored
* enabling disabled tests
-
Min Xu authored
* [feat] FSDP: add auto_wrap_bn - add an utility function to handle wrapping of BN * changelog
-
Min Xu authored
* [feature] FSDP: enable pytorch SyncBN - not fully validated yet but at least not asserting - this enables VISSL to move forward with its next PR * add the test file * changelog and lint * addressed comment
-
Benjamin Lefaudeux authored
-
Benjamin Lefaudeux authored
-
- 17 Mar, 2021 4 commits
-
-
anj-s authored
* debugging statements * fix index inputs and streams * fix lint errors * remove print * lint errors * address comments * lint error Co-authored-by:Anjali Sridhar <anj@devfair0443.h2.fair>
-
Min Xu authored
-
Benjamin Lefaudeux authored
* Deactivating buckets for a single rank, not crashing but not useful
-
Benjamin Lefaudeux authored
-
- 15 Mar, 2021 2 commits
-
-
Benjamin Lefaudeux authored
* extending the current state_dict interface, make it possible to do everything in a single call, and to checkpoint on all ranks
-
Benjamin Lefaudeux authored
-
- 12 Mar, 2021 2 commits
-
-
Min Xu authored
* FSDP: multi-pass autograd graph and mixed precision - added BACKWARD_PRE/POST checking - better assert_state - fixed issue of backward hook misfiring * fix * cleanup * Update fairscale/nn/data_parallel/fully_sharded_data_parallel.py Co-authored-by:
Myle Ott <myleott@fb.com> Co-authored-by:
Myle Ott <myleott@fb.com>
-
msbaines authored
-
- 11 Mar, 2021 2 commits
-
-
Benjamin Lefaudeux authored
* Adding a hard sync barrier before the broadcast, mostly useful for Gloo actually, NCCL is synced behind the scene * adding a proper unit test * adding a unit test for https://github.com/facebookresearch/fairscale/pull/510
-
Benjamin Lefaudeux authored
-
- 10 Mar, 2021 1 commit
-
-
Siddharth Goyal authored
* experimental: Add spectrain support * Address review comments * Address review comments
-
- 09 Mar, 2021 9 commits
-
-
Benjamin Lefaudeux authored
-
Benjamin Lefaudeux authored
* seemingly fix flakyness for gloo by checking all coms handles
-
anj-s authored
* smal fix, remove unused flags * remove usused flag * add back max_batch flag * adding back lazy_construction * adding back lazy_construction * add missing device arg Co-authored-by:Anjali Sridhar <anj@devfair0443.h2.fair>
-
Min Xu authored
* [chore] 0.3.1 release - mainly because vissl needs the new version - added a doc on release steps * Update CHANGELOG.md Co-authored-by:
anj-s <32556631+anj-s@users.noreply.github.com> * review comments Co-authored-by:
anj-s <32556631+anj-s@users.noreply.github.com>
-
Myle Ott authored
-
brett koonce authored
-
Myle Ott authored
-
Benjamin Lefaudeux authored
-
Sam Shleifer authored
-
- 08 Mar, 2021 2 commits
-
-
Myle Ott authored
-
Sam Shleifer authored
* Document FSDP tips and tricks in a separate file
-