typo (#11152)

* typo * style

typo (#11152)
* typo * style
0311ba21 · Stas Bekman · GitHub · 269c9638 · 0311ba21
Unverified Commit 0311ba21 authored Apr 08, 2021 by Stas Bekman Committed by GitHub Apr 08, 2021
Show whitespace changes
Inline Side-by-side

Showing with 3 additions and 3 deletions

docs/source/main_classes/trainer.rst docs/source/main_classes/trainer.rst +3 -3

No files found.
--- a/docs/source/main_classes/trainer.rst
+++ b/docs/source/main_classes/trainer.rst
@@ -355,9 +355,9 @@ Notes:
  able to use significantly larger batch sizes using the same hardware (e.g. 3x and even bigger) which should lead to
  significantly shorter training time.

-3. To use the second version of Sharded data-parallelism, add ``--sharded_ddp zero_dp_2`` or ``--sharded_ddp zero_dp_3`
-   to the command line arguments, and make sure you have added the distributed launcher ``-m torch.distributed.launch
-   --nproc_per_node=NUMBER_OF_GPUS_YOU_HAVE`` if you haven't been using it already.
+3. To use the second version of Sharded data-parallelism, add ``--sharded_ddp zero_dp_2`` or ``--sharded_ddp
+   zero_dp_3`` to the command line arguments, and make sure you have added the distributed launcher ``-m
+   torch.distributed.launch --nproc_per_node=NUMBER_OF_GPUS_YOU_HAVE`` if you haven't been using it already.

 For example here is how you could use it for ``run_translation.py`` with 2 GPUs: