- 11 Apr, 2023 2 commits
-
-
Dale Evans authored
All the examples were deleted from the repo as part of issue #712
-
gregor-soniox authored
-
- 10 Mar, 2023 1 commit
-
-
mrbaozi authored
-
- 28 Feb, 2023 1 commit
-
-
Min Xu authored
- fixes #1107 Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 23 Feb, 2023 1 commit
-
-
Nikita Shulga authored
Just use `torch.inf`, as torch._six is gone after https://github.com/pytorch/pytorch/pull/94709
-
- 15 Feb, 2023 1 commit
-
-
Junyeol Ryu authored
* [fix] typo in wikitext2_data.py * [fix] typo and code duplication in fsdp.py
-
- 04 Feb, 2023 1 commit
-
-
Ikko Eltociear Ashimine authored
heirarchy -> hierarchy
-
- 12 Dec, 2022 1 commit
-
-
Min Xu authored
* [test] ci py 3.11 tests Co-authored-by:
Min Xu <min.xu.public@gmail.com> * fixed setup.py * fixed ci config * fixed ci config's python 3.11 version * fixed torch installs on cpu * update pygit2 for 3.11 * we don't run benchmark on cpu, so no need to install the benchmark reqs * update torch install * try to install torchvision * numpy version 311 * fix cpu test dependency installation * pip git install cmd fix * bypass some tests in 3.11. failure due to packages they use haven't been updated for 3.11 yet Co-authored-by:
Min Xu <min.xu.public@gmail.com>
-
- 11 Dec, 2022 3 commits
-
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Anupam Bhatnagar authored
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 05 Dec, 2022 1 commit
-
-
Changyu Gao authored
* Fix gradient accumulation Add ``is_scaled_loss`` flag to support both scaled / unscaled loss Fix ``test_grad_accum`` and``test_set_num_gradients_to_accumulate`` * Add a method to scale grad for grad_accum using unscaled loss - Revert the changes in `step` method - Add a method `scale_grad_by_num_grads_to_accum`to handle gradient accumulation using unscaled loss more explicitly - Add gradient tests * Implement _compute_corr_mean_between_grads * Improve tests and comments * Use ubuntu-20.04 instead of latest Use ubuntu-20.04 to fix the `arch x64 not found` issue [Version 3.10 with arch x64 not found actions/setup-python#401](https://github.com/actions/setup-python/issues/401) * Switch flake8 from gitlab to github Flake8 was moved to Github See discussions https://www.reddit.com/r/Python/comments/yvfww8/flake8_took_down_the_gitlab_repository_in_favor/ * Fix scikit-learn package * Update PyTorch versions * Resolve comments from Min * Minor fix * Disable broken tests for new versions of PyTorch
-
- 21 Oct, 2022 1 commit
-
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 05 Oct, 2022 3 commits
-
-
Changyu Gao authored
* Fix gradient accumulation - Add ``is_scaled_loss`` flag to support both scaled / unscaled loss - Add a method `scale_grad_by_num_grads_to_accum`to handle gradient accumulation using unscaled loss more explicitly - Fix ``test_grad_accum`` and``test_set_num_gradients_to_accumulate`` - Add tests for gradient
-
Anupam Bhatnagar authored
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 30 Sep, 2022 1 commit
-
-
Anupam Bhatnagar authored
-
- 25 Sep, 2022 1 commit
-
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 24 Sep, 2022 3 commits
-
-
Min Xu authored
* simlificed the readme * clean up ssd offload * try to fix readthedocs Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Min Xu authored
* tmp * test again * test again * add new test * clean up * add test file to the testlist * more comments * add changelog Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 23 Sep, 2022 6 commits
-
-
Min Xu authored
- Fixes from Benjamin. Original commit msg: - Fixes #1041. I just had a minute or two, hoping that it's enough :) Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Min Xu authored
Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Anupam Bhatnagar authored
-
Min Xu authored
* [fix] better handling non-flatten in FSDP - see the detailed comment about that backward firing case - also minor debugging help in FSDP - also minor fix in FPW's state dict * [feat] disallow reset_parameters by default * [feat] adding fsdp_instances API - useful in check wrapping by user code * [fix] one line fix but more than a day of debugging * fixed the case of loading combined check with empty fsdp instances * fixed another bug around state loading the root/nonroot module full param caching due to not resharding after forward * [feat] support .half and .float better * fixed a bug in gather optim state losses extra keys from the original state_dict * fixed a test failure in mixed precision * fixed another bug affecting no_sync grad acc * fixed a bug and a test in fsdp optim state * fixed another corner case * added a comment * skip ssd offload tests * skip fsdp one for ssd overload Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Min Xu authored
* [fix] don't import ProcessGroup eagerly - move the import into typing since it is only used for type checking - fixes #1057 * more fixes * one more * tested at least Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 13 Sep, 2022 3 commits
- 10 Sep, 2022 1 commit
-
-
Min Xu authored
* [minor] [FSDP] add a better for pure fp16 * [minor] [wrap] add a flag to help fsdp pure fp16 wrapping
-
- 07 Sep, 2022 4 commits
-
-
Min Xu authored
-
Min Xu authored
* [feat] add random_sparse_mask api * correct test skip Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Anupam Bhatnagar authored
-
Min Xu authored
* [fix]: add a context for supporting state_dict from a non-FSDP parent module * formatting Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 26 Aug, 2022 1 commit
-
-
Min Xu authored
* [feat] support sst disabled and dst disabled cases * added tests Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 25 Aug, 2022 1 commit
-
-
Min Xu authored
* update nightly version * update wgit to use numpy for load/store - this is introduced with new nightly torch version, which made torch.save() not producing deterministic bytes - this make tensor<->numpy conversion and then do the save/load to avoid that issues. * fixed tests Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 11 Aug, 2022 1 commit
-
-
Min Xu authored
* added a profiling class * no more type ignore after merging main * fixed a int/round bug * add unit tests * skip if no cuda for a test Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
- 08 Aug, 2022 2 commits
-
-
Min Xu authored
* update examples and comment * fixed issue with fft/ifft only doing the last dim * fixed a int/round bug; fixed tests * add cuda tests * add atol and rtol * skip cuda test correctly Co-authored-by:Min Xu <min.xu.public@gmail.com>
-
Crutcher Dunnavant authored
-