"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "49c06132df92e51be251469be8ea8137168340f3"
Unverified Commit b8b1e442 authored by Steven Basart's avatar Steven Basart Committed by GitHub
Browse files

Rename torch.run to torchrun (#30405)

torch.run does not exist anywhere as far as I can tell.
parent 696ededd
...@@ -659,7 +659,7 @@ You could also use the [`Trainer`]'s `--save_on_each_node` argument to automatic ...@@ -659,7 +659,7 @@ You could also use the [`Trainer`]'s `--save_on_each_node` argument to automatic
For [torchrun](https://pytorch.org/docs/stable/elastic/run.html), you have to ssh to each node and run the following command on both of them. The launcher waits until both nodes are synchronized before launching the training. For [torchrun](https://pytorch.org/docs/stable/elastic/run.html), you have to ssh to each node and run the following command on both of them. The launcher waits until both nodes are synchronized before launching the training.
```bash ```bash
python -m torch.run --nproc_per_node=8 --nnode=2 --node_rank=0 --master_addr=hostname1 \ torchrun --nproc_per_node=8 --nnode=2 --node_rank=0 --master_addr=hostname1 \
--master_port=9901 your_program.py <normal cl args> --deepspeed ds_config.json --master_port=9901 your_program.py <normal cl args> --deepspeed ds_config.json
``` ```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment