- 09 Feb, 2020 4 commits
-
-
Shaden Smith authored
-
Jeff Rasley authored
-
dependabot[bot] authored
Bumps [tensorflow-gpu](https://github.com/tensorflow/tensorflow) from 1.14.0 to 1.15.2. - [Release notes](https://github.com/tensorflow/tensorflow/releases) - [Changelog](https://github.com/tensorflow/tensorflow/blob/master/RELEASE.md) - [Commits](https://github.com/tensorflow/tensorflow/compare/v1.14.0...v1.15.2 ) Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
- 08 Feb, 2020 3 commits
-
-
Shaden Smith authored
-
Jeff Rasley authored
Add Azure tutorial text and scripts
-
Shaden Smith authored
-
- 07 Feb, 2020 6 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Samyam Rajbhandari authored
adding figure caption
-
Jeff Rasley authored
* Added features.md and Getting Started guide. * trimming newlines * Adding hostfile discussion * include/exclude emphasis * Edits overview * minor edits on the overview * minor edits * Fixing broken GPT2 links * Clarify our relationship with Megatron-LM * Update README.md * Update README.md * minor edits * fix formatting Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
yuxionghe <yuxhe@microsoft.com>
-
Samyam Rajbhandari authored
* simplifying the batch config, using a single assert to test for validity and allowing for specifying only the micro batch size * Simplifying Batch Config, Adding ability to specify batch using just micro_batch, and adding a bunch of unit tests * ran formatting * Typo fixes and added the config file * reformatting * path fixes * removing print statements
-
Shaden Smith authored
* Ported DeepSpeed overview. * Renamed subsection * Formatting table of contents * initial import of Megatron tutorial * Grammatical edits, formatting, and paths. * formatting and data download instructions * formatting tutorial * formatting tutorial * formatting tutorial * formatting tutorial * formatting tutorial * formatting tutorial * new perf chart * removing TODO * adding pointer to tutorial * edits * azure to low bandwidth Co-authored-by:Samyam Rajbhandari <samyamr@microsoft.com>
-
- 06 Feb, 2020 5 commits
-
-
eltonzheng authored
* add tutorial doc for CIFAR-10 model * pass pre-commit * update cifar link to DeepScaleExamples * simplify the running command * change deepspeed.pt to deepspeed
-
Olatunji Ruwase authored
Unit tests for add_XXX_arguments
-
Olatunji Ruwase authored
Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Jeff Rasley authored
* Azure pipelines build badge * Update README.md * add license badge
-
- 05 Feb, 2020 4 commits
-
-
Jeff Rasley authored
* update examples submodule * install requirements.txt with install script * add dockerfile
-
Jeff Rasley authored
-
Shaden Smith authored
* Enables NCCL backend in @distributed_test * Adds pytest-forked to avoid CUDA re-initialization issue. * paste typo * transcription typo
-
Shaden Smith authored
* Adding testing documentation to README.md * Add pytest to requirements.txt
-
- 04 Feb, 2020 6 commits
-
-
Jeff Rasley authored
update install to use pdcp to distribute wheels
-
Jeff Rasley authored
-
Jeff Rasley authored
* add allreduce test * comment out set rank to cuda for now * switched back to gloo
-
Shaden Smith authored
* Adds distributed_test decorator and some unit tests. * Setting NCCL backend. * Parametrizes test. * rank -> local_rank * Temporarily disable CUDA initialization.
-
Shaden Smith authored
-
Shaden Smith authored
-
- 03 Feb, 2020 12 commits
-
-
Shaden Smith authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
Shaden Smith authored
-
Elton Zheng authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Shaden Smith authored
Fixing file permissions.
-
yuxionghe authored
-