Unverified Commit a96de03d authored by Sepehr Sameni's avatar Sepehr Sameni Committed by GitHub
Browse files

Update README.md (#4972)

parent 16f07667
...@@ -20,7 +20,7 @@ the following parameters: ...@@ -20,7 +20,7 @@ the following parameters:
### AlexNet and VGG ### AlexNet and VGG
Since `AlexNet` and the original `VGG` architectures do not include batch Since `AlexNet` and the original `VGG` architectures do not include batch
normalization, the default initial learning rate `--lr 0.1` is to high. normalization, the default initial learning rate `--lr 0.1` is too high.
``` ```
torchrun --nproc_per_node=8 train.py\ torchrun --nproc_per_node=8 train.py\
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment