Unverified Commit ae228fef authored by Philip Meier's avatar Philip Meier Committed by GitHub
Browse files

Add default training parameters to classification refrence README (#1998)

* add default parameters to README

* fix vgg_*_bn
parent e75b4973
...@@ -4,7 +4,31 @@ This folder contains reference training scripts for image classification. ...@@ -4,7 +4,31 @@ This folder contains reference training scripts for image classification.
They serve as a log of how to train specific models, as provide baseline They serve as a log of how to train specific models, as provide baseline
training and evaluation scripts to quickly bootstrap research. training and evaluation scripts to quickly bootstrap research.
Except otherwise noted, all models have been trained on 8x V100 GPUs. Except otherwise noted, all models have been trained on 8x V100 GPUs with
the following parameters:
| Parameter | value |
| ------------------------ | ------ |
| `--batch_size` | `32` |
| `--epochs` | `90` |
| `--lr` | `0.1` |
| `--momentum` | `0.9` |
| `--wd`, `--weight-decay` | `1e-4` |
| `--lr-step-size` | `30` |
| `--lr-gamma` | `0.1` |
### AlexNet and VGG
Since `AlexNet` and the original `VGG` architectures do not include batch
normalization, the default initial learning rate `--lr 0.1` is to high.
```
python main.py --model $MODEL --lr 1e-2
```
Here `$MODEL` is one of `alexnet`, `vgg11`, `vgg13`, `vgg16` or `vgg19`. Note
that `vgg11_bn`, `vgg13_bn`, `vgg16_bn`, and `vgg19_bn` include batch
normalization and thus are trained with the default parameters.
### ResNext-50 32x4d ### ResNext-50 32x4d
``` ```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment