Add deepspeed.init_distributed to RTD page (#645)

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

Add deepspeed.init_distributed to RTD page (#645)
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
4e2dc4e4 · Jeff Rasley · GitHub · 8cea96dd · 4e2dc4e4 · 4e2dc4e4
Unverified Commit 4e2dc4e4 authored Jan 07, 2021 by Jeff Rasley Committed by GitHub Jan 07, 2021
Show whitespace changes
Inline Side-by-side

Showing with 16 additions and 8 deletions

deepspeed/utils/distributed.py deepspeed/utils/distributed.py +9 -7

docs/code-docs/source/initialize.rst docs/code-docs/source/initialize.rst +7 -1

No files found.
--- a/deepspeed/utils/distributed.py
+++ b/deepspeed/utils/distributed.py
@@ -12,15 +12,17 @@ def init_distributed(dist_backend="nccl",
                     auto_mpi_discovery=True,
                     distributed_port=TORCH_DISTRIBUTED_DEFAULT_PORT,
                     verbose=True):
-    """
-    Initialize torch.distributed backend, potentially performing MPI discovery if needed
+    """Initialize torch.distributed backend, potentially performing MPI discovery if needed.
+
    Arguments:
-        dist_backend (str): torch distributed backend, e.g., nccl, mpi, gloo
-        auto_mpi_discovery (bool): if distributed environment variables are not set, attempt to discover them from MPI
-        distributed_port (int, optional): torch distributed backend port
-        verbose (bool, optional): verbose logging
-    """
+        dist_backend: torch distributed backend, e.g., nccl, mpi, gloo

+        auto_mpi_discovery: if distributed environment variables are not set, attempt to discover them from MPI
+
+        distributed_port: torch distributed backend port
+
+        verbose: verbose logging
+    """
    required_env = ["RANK", "WORLD_SIZE", "MASTER_ADDR", "MASTER_PORT", "LOCAL_RANK"]
    if auto_mpi_discovery and not all(map(lambda v: v in os.environ, required_env)):
        if verbose:

--- a/docs/code-docs/source/initialize.rst
+++ b/docs/code-docs/source/initialize.rst
@@ -25,7 +25,7 @@ to add DeepSpeed's builtin arguments to your application's parser.

 Training Initialization
 -----------------------
-The entrypoint for all training with DeepSpeed is ``deepspeed.initialize()``.
+The entrypoint for all training with DeepSpeed is ``deepspeed.initialize()``. Will initialize distributed backend if it is not intialized already.

 Example usage:

@@ -36,3 +36,9 @@ Example usage:
                                                         model_parameters=net.parameters())

 .. autofunction:: deepspeed.initialize
+
+Distributed Initialization
+-----------------------
+Optional distributed backend initializating separate from ``deepspeed.initialize()``. Useful in scenarios where the user wants to use torch distributed calls before calling ``deepspeed.initialize()``, such as when using model parallelism, pipeline parallelism, or certain data loader scenarios.
+
+.. autofunction:: deepspeed.init_distributed