README.md 987 Bytes
Newer Older
1
# Comparison of Large Batch Training Optimization
2

3
4
5
6
7
8
9
## 🚀Quick Start
Run with synthetic data
```bash
colossalai run --nproc_per_node 4 train.py --config config.py -s
```


10
11
## Prepare Dataset

12
13
We use CIFAR10 dataset in this example. You should invoke the `donwload_cifar10.py` in the tutorial root directory or directly run the `auto_parallel_with_resnet.py`.
The dataset will be downloaded to `colossalai/examples/tutorials/data` by default.
14
15
16
17
18
19
If you wish to use customized directory for the dataset. You can set the environment variable `DATA` via the following command.

```bash
export DATA=/path/to/data
```

20
21
You can also use synthetic data for this tutorial if you don't wish to download the `CIFAR10` dataset by adding the `-s` or `--synthetic` flag to the command.

22
23
24
25

## Run on 2*2 device mesh

```bash
26
# run with cifar10
27
colossalai run --nproc_per_node 4 train.py --config config.py
28
29
30
31

# run with synthetic dataset
colossalai run --nproc_per_node 4 train.py --config config.py -s
```