[s2s finetune_trainer] add instructions for distributed training (#8884)

4c3d98dd · Stas Bekman · GitHub · aa60b230 · 4c3d98dd
Unverified Commit 4c3d98dd authored Dec 03, 2020 by Stas Bekman Committed by GitHub Dec 03, 2020
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 0 deletions

examples/seq2seq/README.md examples/seq2seq/README.md +5 -0

No files found.
--- a/examples/seq2seq/README.md
+++ b/examples/seq2seq/README.md
@@ -213,6 +213,11 @@ To see all the possible command line options, run:
 python finetune_trainer.py --help
 ```
+For multi-gpu training use `torch.distributed.launch`, e.g. with 2 gpus:
+```bash
+python -m torch.distributed.launch --nproc_per_node=2  finetune_trainer.py ...
+```
 **At the moment, `Seq2SeqTrainer` does not support *with teacher* distillation.**
 All `Seq2SeqTrainer`-based fine-tuning scripts are included in the `builtin_trainer` directory.