add readme

f3622eb7 · Sugon_ldc · c141d055 · c141d055 · f3622eb7 · f3622eb7
Commit f3622eb7 authored Jun 17, 2023 by Sugon_ldc
Hide whitespace changes
Inline Side-by-side

Showing with 251 additions and 251 deletions

MobileNetv3.md MobileNetv3.md +0 -82

README.md README.md +82 -169

README_origin.md README_origin.md +169 -0

No files found.
--- a/MobileNetv3.md
+++ b/MobileNetv3.md
-# MobileNetv3_PyTorch算力测试
-
-## 模型介绍
-
-MobileNetV3是一种轻量级的卷积神经网络模型，由Google Brain团队于2019年提出。相对于MobileNetV2，MobileNetV3在准确性和模型大小上都有所提升。MobileNetV3主要采用了以下几种技术：
-
-1. 倒残差结构：MobileNetV3使用了一种新的倒残差结构，通过在瓶颈层中使用两个卷积层和一个可分离卷积层，来提高准确性和收敛速度。
-2. 自适应线性单元（AdaLIN）：MobileNetV3使用了自适应线性单元（AdaLIN）来替代传统的批归一化（Batch Normalization）层，以在准确性和模型大小之间取得平衡。
-3. 利用Squeeze-and-Excitation模块进行通道注意力：MobileNetV3引入了Squeeze-and-Excitation（SE）模块，通过学习每个通道的重要性，对特征图中的每个通道进行加权，从而提高模型的准确性。
-4. 硬件感知：MobileNetV3对不同硬件设备进行了优化，可以在CPU、GPU和移动端等设备上高效地运行。
-
-MobileNetV3可以用于图像分类、目标检测、实例分割等任务，是一种轻量级、高效、准确性较高的卷积神经网络模型，可以在移动设备等资源受限的场景下进行应用。
-
-
-
-## 模型结构
-
-MobileNetV3模型结构主要由三部分组成：特征提取网络、分类器和辅助模块。其中，特征提取网络是MobileNetV3的核心部分，采用了一些创新的设计来提高模型的准确性和效率。
-
-1. 特征提取网络：MobileNetV3的特征提取网络由一系列瓶颈块组成，每个瓶颈块包含一个1x1卷积层、一个可分离卷积层和一个1x1卷积层。MobileNetV3还引入了倒残差结构，通过在瓶颈块中使用两个卷积层和一个可分离卷积层，来提高准确性和收敛速度。此外，MobileNetV3还采用了自适应线性单元（AdaLIN）来替代传统的批归一化（Batch Normalization）层，以在准确性和模型大小之间取得平衡。MobileNetV3还引入了Squeeze-and-Excitation（SE）模块，通过学习每个通道的重要性，对特征图中的每个通道进行加权，从而提高模型的准确性。
-2. 分类器：MobileNetV3的分类器采用了全局平均池化和一些全连接层，将特征提取网络输出的特征映射到各个类别的概率。
-3. 辅助模块：MobileNetV3还引入了一些辅助模块，如多尺度训练、标签平滑等，来提高模型的准确性和泛化能力。
-
-MobileNetV3的模型结构非常适合用于移动设备等资源受限的场景下，因为它具有轻量级、高效和准确性较高的特点。
-
-
-
-## 数据集
-
-本次训练使用的[CIFAR-10数据集](https://www.cs.toronto.edu/~kriz/cifar.html)
-
-CIFAR-10是一个常用的图像分类数据集，由加拿大的加尔加里大学Alex Krizhevsky等人于2009年发布。该数据集包含了10个类别，每个类别包含了6000张32x32大小的彩色图像，共计60000张图像。
-
-CIFAR-10数据集是一个相对较小的数据集，但它包含了各种各样的物体和场景，是一个非常有代表性的数据集。它被广泛应用于图像分类算法的评估和比较，是一个标准的基准测试数据集。
-
-CIFAR-10数据集相对于其他图像数据集，具有以下几个特点：
-
-1. 数据量相对较小，但类别丰富，适合用于训练和测试轻量级的模型。
-2. 图像尺寸较小，可以在较短的时间内处理完成。
-3. 图像质量较低，具有一定的噪声和变形，对算法的鲁棒性和泛化能力提出了一定的挑战。
-
-CIFAR-10数据集是一个非常有用的数据集，既可以用于学术研究，也可以用于实际应用中。
-
-## 训练
-
-提供光源拉取的训练镜像
-
-```
-docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.10.0-centos7.6-dtk-22.10-py38-latest
-```
-
-
-
-```
-python -m torch.distributed.run --nproc_per_node 4 train.py --batch-size=128 --mode=small --print-freq=1 --dataset=CIFAR10 --ema-decay=0 --label-smoothing=0 --lr=0.2 --save-epoch-freq=10 --lr-decay=cos --lr-min=0 --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1  --data-dir /data/ --save-path /data/mobilenetv3out/ --resume ./pretrained/best_model_wts-67.52.pth 2>&1 | tee mobilenetv3_dcu_`date +%Y%m%d%H%M%S`.log
-```
-
-data-dir值为存放数据集的目录，如下图data目录下存放cifar-10-batches-py数据集目录，data-dir值为/data/
-
-![1](D:\Model_Use\MobileNetv3\1.png)
-
--resume 为预训练模型路径
-
-日志使用tee存入log文件中
-
-
-
-## 准确率数据
-
-| 卡数 |  精度  |
-| :--: | :----: |
-|  4   | 0.9144 |
-
-
-
-## 源码仓库及问题反馈
-
-http://developer.hpccube.com/codes/modelzoo/conformer_pytorch.git
-
-## 参考
-
-[https://github.com/showlo/mobilenetv3](https://github.com/showlo/mobilenetv3)
\ No newline at end of file
--- a/README.md
+++ b/README.md
-# MobileNetV3
-An implementation of MobileNetV3 with pyTorch
-
-# Theory
-&emsp;You can find the paper of MobileNetV3 at [Searching for MobileNetV3](https://arxiv.org/abs/1905.02244).
-
-# Prepare data
-
-* CIFAR-10
-* CIFAR-100
-* SVHN
-* Tiny-ImageNet
-* ImageNet: Please move validation images to labeled subfolders, you can use the script [here](https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh).
-
-# Train
-
-* Train from scratch:
-
-```
-CUDA_VISIBLE_DEVICES=3 python train.py --batch-size=128 --mode=small \
--print-freq=100 --dataset=CIFAR100 --ema-decay=0 --label-smoothing=0.1 \
--lr=0.3 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0 \
--warmup-epochs=5 --weight-decay=6e-5 --num-epochs=200 --width-multiplier=1 \
-nbd -zero-gamma -mixup
-```
-
-where the meaning of the parameters are as followed:
-
-```
-batch-size
-mode: using MobileNetV3-Small(if set to small) or MobileNetV3-Large(if set to large).
-dataset: which dataset to use(CIFAR10, CIFAR100, SVHN, TinyImageNet or ImageNet).
-ema-decay: decay of EMA, if set to 0, do not use EMA.
-label-smoothing: $epsilon$ using in label smoothing, if set to 0, do not use label smoothing.
-lr-decay: learning rate decay schedule, step or cos.
-lr-min: min lr in cos lr decay.
-warmup-epochs: warmup epochs using in cos lr deacy.
-num-epochs: total training epochs.
-nbd: no bias decay.
-zero-gamma: zero $gamma$ of last BN in each block.
-mixup: using Mixup.
-```
-
-# Pretrained models
-
-&emsp;We have provided the pretrained MobileNetV3-Small model in `pretrained`.
-
-# Experiments
-
-## Training setting
-
-### on ImageNet
-
-```
-CUDA_VISIBLE_DEVICES=5 python train.py --batch-size=128 --mode=small --print-freq=2000 --dataset=imagenet \
--ema-decay=0.99 --label-smoothing=0.1 --lr=0.1 --save-epoch-freq=50 --lr-decay=cos --lr-min=0 --warmup-epochs=5 \
--weight-decay=1e-5 --num-epochs=250 --num-workers=2 --width-multiplier=1 -dali -nbd -mixup -zero-gamma -save
-```
-
-### on CIFAR-10
-
-```
-CUDA_VISIBLE_DEVICES=1 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=CIFAR10\
-  --ema-decay=0 --label-smoothing=0 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
-  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1
-```
-
-### on CIFAR-100
-
-```
-CUDA_VISIBLE_DEVICES=1 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=CIFAR100\
-  --ema-decay=0 --label-smoothing=0 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
-  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1
-```
-
-&emsp;Using more tricks：
-```
-CUDA_VISIBLE_DEVICES=1 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=CIFAR100\
-  --ema-decay=0.999 --label-smoothing=0.1 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
-  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1\
-  -zero-gamma -nbd -mixup
-```
-
-### on SVHN
-
-```
-CUDA_VISIBLE_DEVICES=3 python train.py --batch-size=128 --mode=small --print-freq=1000 --dataset=SVHN\
-  --ema-decay=0 --label-smoothing=0 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
-  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=20 --num-workers=2 --width-multiplier=1
-```
-
-### on Tiny-ImageNet
-
-```
-CUDA_VISIBLE_DEVICES=7 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=tinyimagenet\
-  --data-dir=/media/data2/chenjiarong/ImageData/tiny-imagenet --ema-decay=0 --label-smoothing=0 --lr=0.15\
-  --save-epoch-freq=1000 --lr-decay=cos --lr-min=0 --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=200\
-  --num-workers=2 --width-multiplier=1 -dali
-```
-
-&emsp;Using more tricks：
-```
-CUDA_VISIBLE_DEVICES=7 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=tinyimagenet\
-  --data-dir=/media/data2/chenjiarong/ImageData/tiny-imagenet --ema-decay=0.999 --label-smoothing=0.1 --lr=0.15\
-  --save-epoch-freq=1000 --lr-decay=cos --lr-min=0 --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=200\
-  --num-workers=2 --width-multiplier=1 -dali -nbd -mixup
-```
-
-## MobileNetV3-Large
-
-### on ImageNet
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Offical 1.0  | 219 M     | 5.4  M     | 75.2%     |     -     |
-| Ours    1.0  | 216.6 M   | 5.47 M     | -         |     -     |
-
-### on CIFAR-10
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Ours    1.0  | 66.47 M   | 4.21 M     | -         |     -     |
-
-### on CIFAR-100
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Ours    1.0  | 66.58 M   | 4.32 M     | -         |     -     |
-
-## MobileNetV3-Small
-
-### on ImageNet
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Offical 1.0  | 56.5 M    | 2.53 M     | 67.4%     |     -     |
-| Ours    1.0  | 56.51 M   | 2.53 M     | 67.52%    | 87.58%    |
-
-&emsp;The pretrained model with top-1 accuracy 67.52% is provided in the folder [pretrained](https://github.com/ShowLo/MobileNetV3/tree/master/pretrained).
-
-### on CIFAR-10 (Average accuracy of 5 runs)
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Ours    1.0  |  17.51 M  |   1.52 M   |   92.97%  |     -     |
-
-### on CIFAR-100 (Average accuracy of 5 runs)
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Ours    1.0  |  17.60 M  |   1.61 M   |  73.69%   |  92.31%   |
-| More Tricks  |   same    |    same    |  76.24%   |  92.58%   |
-
-### on SVHN (Average accuracy of 5 runs)
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Ours    1.0  |  17.51 M  |   1.52 M   |   97.92%  |     -     |
-
-### on Tiny-ImageNet (Average accuracy of 5 runs)
-
-|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
-| -----------  | --------- | ---------- | --------- | --------- |
-| Ours    1.0  |  51.63 M  |   1.71 M   |  59.32%   |  81.38%   |
-| More Tricks  |   same    |    same    |  62.62%   |  84.04%   |
-
-## Dependency
-
-&emsp;This project uses Python 3.7 and PyTorch 1.1.0. The FLOPs and Parameters and measured using [torchsummaryX](https://github.com/nmhkahn/torchsummaryX).
+# MobileNetv3_PyTorch算力测试
+
+## 模型介绍
+
+MobileNetV3是一种轻量级的卷积神经网络模型，由Google Brain团队于2019年提出。相对于MobileNetV2，MobileNetV3在准确性和模型大小上都有所提升。MobileNetV3主要采用了以下几种技术：
+
+1. 倒残差结构：MobileNetV3使用了一种新的倒残差结构，通过在瓶颈层中使用两个卷积层和一个可分离卷积层，来提高准确性和收敛速度。
+2. 自适应线性单元（AdaLIN）：MobileNetV3使用了自适应线性单元（AdaLIN）来替代传统的批归一化（Batch Normalization）层，以在准确性和模型大小之间取得平衡。
+3. 利用Squeeze-and-Excitation模块进行通道注意力：MobileNetV3引入了Squeeze-and-Excitation（SE）模块，通过学习每个通道的重要性，对特征图中的每个通道进行加权，从而提高模型的准确性。
+4. 硬件感知：MobileNetV3对不同硬件设备进行了优化，可以在CPU、GPU和移动端等设备上高效地运行。
+
+MobileNetV3可以用于图像分类、目标检测、实例分割等任务，是一种轻量级、高效、准确性较高的卷积神经网络模型，可以在移动设备等资源受限的场景下进行应用。
+
+
+
+## 模型结构
+
+MobileNetV3模型结构主要由三部分组成：特征提取网络、分类器和辅助模块。其中，特征提取网络是MobileNetV3的核心部分，采用了一些创新的设计来提高模型的准确性和效率。
+
+1. 特征提取网络：MobileNetV3的特征提取网络由一系列瓶颈块组成，每个瓶颈块包含一个1x1卷积层、一个可分离卷积层和一个1x1卷积层。MobileNetV3还引入了倒残差结构，通过在瓶颈块中使用两个卷积层和一个可分离卷积层，来提高准确性和收敛速度。此外，MobileNetV3还采用了自适应线性单元（AdaLIN）来替代传统的批归一化（Batch Normalization）层，以在准确性和模型大小之间取得平衡。MobileNetV3还引入了Squeeze-and-Excitation（SE）模块，通过学习每个通道的重要性，对特征图中的每个通道进行加权，从而提高模型的准确性。
+2. 分类器：MobileNetV3的分类器采用了全局平均池化和一些全连接层，将特征提取网络输出的特征映射到各个类别的概率。
+3. 辅助模块：MobileNetV3还引入了一些辅助模块，如多尺度训练、标签平滑等，来提高模型的准确性和泛化能力。
+
+MobileNetV3的模型结构非常适合用于移动设备等资源受限的场景下，因为它具有轻量级、高效和准确性较高的特点。
+
+
+
+## 数据集
+
+本次训练使用的[CIFAR-10数据集](https://www.cs.toronto.edu/~kriz/cifar.html)
+
+CIFAR-10是一个常用的图像分类数据集，由加拿大的加尔加里大学Alex Krizhevsky等人于2009年发布。该数据集包含了10个类别，每个类别包含了6000张32x32大小的彩色图像，共计60000张图像。
+
+CIFAR-10数据集是一个相对较小的数据集，但它包含了各种各样的物体和场景，是一个非常有代表性的数据集。它被广泛应用于图像分类算法的评估和比较，是一个标准的基准测试数据集。
+
+CIFAR-10数据集相对于其他图像数据集，具有以下几个特点：
+
+1. 数据量相对较小，但类别丰富，适合用于训练和测试轻量级的模型。
+2. 图像尺寸较小，可以在较短的时间内处理完成。
+3. 图像质量较低，具有一定的噪声和变形，对算法的鲁棒性和泛化能力提出了一定的挑战。
+
+CIFAR-10数据集是一个非常有用的数据集，既可以用于学术研究，也可以用于实际应用中。
+
+## 训练
+
+提供光源拉取的训练镜像
+
+```
+docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.10.0-centos7.6-dtk-22.10-py38-latest
+```
+
+
+
+```
+python -m torch.distributed.run --nproc_per_node 4 train.py --batch-size=128 --mode=small --print-freq=1 --dataset=CIFAR10 --ema-decay=0 --label-smoothing=0 --lr=0.2 --save-epoch-freq=10 --lr-decay=cos --lr-min=0 --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1  --data-dir /data/ --save-path /data/mobilenetv3out/ --resume ./pretrained/best_model_wts-67.52.pth 2>&1 | tee mobilenetv3_dcu_`date +%Y%m%d%H%M%S`.log
+```
+
+data-dir值为存放数据集的目录，如下图data目录下存放cifar-10-batches-py数据集目录，data-dir值为/data/
+
+![1](D:\Model_Use\MobileNetv3\1.png)
+
+--resume 为预训练模型路径
+
+日志使用tee存入log文件中
+
+
+
+## 准确率数据
+
+| 卡数 |  精度  |
+| :--: | :----: |
+|  4   | 0.9144 |
+
+
+
+## 源码仓库及问题反馈
+
+http://developer.hpccube.com/codes/modelzoo/conformer_pytorch.git
+
+## 参考
+
+[https://github.com/showlo/mobilenetv3](https://github.com/showlo/mobilenetv3)
\ No newline at end of file
--- a/README_origin.md
+++ b/README_origin.md
+# MobileNetV3
+An implementation of MobileNetV3 with pyTorch
+
+# Theory
+&emsp;You can find the paper of MobileNetV3 at [Searching for MobileNetV3](https://arxiv.org/abs/1905.02244).
+
+# Prepare data
+
+* CIFAR-10
+* CIFAR-100
+* SVHN
+* Tiny-ImageNet
+* ImageNet: Please move validation images to labeled subfolders, you can use the script [here](https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh).
+
+# Train
+
+* Train from scratch:
+
+```
+CUDA_VISIBLE_DEVICES=3 python train.py --batch-size=128 --mode=small \
+--print-freq=100 --dataset=CIFAR100 --ema-decay=0 --label-smoothing=0.1 \
+--lr=0.3 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0 \
+--warmup-epochs=5 --weight-decay=6e-5 --num-epochs=200 --width-multiplier=1 \
+-nbd -zero-gamma -mixup
+```
+
+where the meaning of the parameters are as followed:
+
+```
+batch-size
+mode: using MobileNetV3-Small(if set to small) or MobileNetV3-Large(if set to large).
+dataset: which dataset to use(CIFAR10, CIFAR100, SVHN, TinyImageNet or ImageNet).
+ema-decay: decay of EMA, if set to 0, do not use EMA.
+label-smoothing: $epsilon$ using in label smoothing, if set to 0, do not use label smoothing.
+lr-decay: learning rate decay schedule, step or cos.
+lr-min: min lr in cos lr decay.
+warmup-epochs: warmup epochs using in cos lr deacy.
+num-epochs: total training epochs.
+nbd: no bias decay.
+zero-gamma: zero $gamma$ of last BN in each block.
+mixup: using Mixup.
+```
+
+# Pretrained models
+
+&emsp;We have provided the pretrained MobileNetV3-Small model in `pretrained`.
+
+# Experiments
+
+## Training setting
+
+### on ImageNet
+
+```
+CUDA_VISIBLE_DEVICES=5 python train.py --batch-size=128 --mode=small --print-freq=2000 --dataset=imagenet \
+--ema-decay=0.99 --label-smoothing=0.1 --lr=0.1 --save-epoch-freq=50 --lr-decay=cos --lr-min=0 --warmup-epochs=5 \
+--weight-decay=1e-5 --num-epochs=250 --num-workers=2 --width-multiplier=1 -dali -nbd -mixup -zero-gamma -save
+```
+
+### on CIFAR-10
+
+```
+CUDA_VISIBLE_DEVICES=1 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=CIFAR10\
+  --ema-decay=0 --label-smoothing=0 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
+  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1
+```
+
+### on CIFAR-100
+
+```
+CUDA_VISIBLE_DEVICES=1 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=CIFAR100\
+  --ema-decay=0 --label-smoothing=0 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
+  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1
+```
+
+&emsp;Using more tricks：
+```
+CUDA_VISIBLE_DEVICES=1 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=CIFAR100\
+  --ema-decay=0.999 --label-smoothing=0.1 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
+  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=400 --num-workers=2 --width-multiplier=1\
+  -zero-gamma -nbd -mixup
+```
+
+### on SVHN
+
+```
+CUDA_VISIBLE_DEVICES=3 python train.py --batch-size=128 --mode=small --print-freq=1000 --dataset=SVHN\
+  --ema-decay=0 --label-smoothing=0 --lr=0.35 --save-epoch-freq=1000 --lr-decay=cos --lr-min=0\
+  --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=20 --num-workers=2 --width-multiplier=1
+```
+
+### on Tiny-ImageNet
+
+```
+CUDA_VISIBLE_DEVICES=7 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=tinyimagenet\
+  --data-dir=/media/data2/chenjiarong/ImageData/tiny-imagenet --ema-decay=0 --label-smoothing=0 --lr=0.15\
+  --save-epoch-freq=1000 --lr-decay=cos --lr-min=0 --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=200\
+  --num-workers=2 --width-multiplier=1 -dali
+```
+
+&emsp;Using more tricks：
+```
+CUDA_VISIBLE_DEVICES=7 python train.py --batch-size=128 --mode=small --print-freq=100 --dataset=tinyimagenet\
+  --data-dir=/media/data2/chenjiarong/ImageData/tiny-imagenet --ema-decay=0.999 --label-smoothing=0.1 --lr=0.15\
+  --save-epoch-freq=1000 --lr-decay=cos --lr-min=0 --warmup-epochs=5 --weight-decay=6e-5 --num-epochs=200\
+  --num-workers=2 --width-multiplier=1 -dali -nbd -mixup
+```
+
+## MobileNetV3-Large
+
+### on ImageNet
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Offical 1.0  | 219 M     | 5.4  M     | 75.2%     |     -     |
+| Ours    1.0  | 216.6 M   | 5.47 M     | -         |     -     |
+
+### on CIFAR-10
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Ours    1.0  | 66.47 M   | 4.21 M     | -         |     -     |
+
+### on CIFAR-100
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Ours    1.0  | 66.58 M   | 4.32 M     | -         |     -     |
+
+## MobileNetV3-Small
+
+### on ImageNet
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Offical 1.0  | 56.5 M    | 2.53 M     | 67.4%     |     -     |
+| Ours    1.0  | 56.51 M   | 2.53 M     | 67.52%    | 87.58%    |
+
+&emsp;The pretrained model with top-1 accuracy 67.52% is provided in the folder [pretrained](https://github.com/ShowLo/MobileNetV3/tree/master/pretrained).
+
+### on CIFAR-10 (Average accuracy of 5 runs)
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Ours    1.0  |  17.51 M  |   1.52 M   |   92.97%  |     -     |
+
+### on CIFAR-100 (Average accuracy of 5 runs)
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Ours    1.0  |  17.60 M  |   1.61 M   |  73.69%   |  92.31%   |
+| More Tricks  |   same    |    same    |  76.24%   |  92.58%   |
+
+### on SVHN (Average accuracy of 5 runs)
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Ours    1.0  |  17.51 M  |   1.52 M   |   97.92%  |     -     |
+
+### on Tiny-ImageNet (Average accuracy of 5 runs)
+
+|              | Madds     | Parameters | Top1-acc  | Top5-acc  |
+| -----------  | --------- | ---------- | --------- | --------- |
+| Ours    1.0  |  51.63 M  |   1.71 M   |  59.32%   |  81.38%   |
+| More Tricks  |   same    |    same    |  62.62%   |  84.04%   |
+
+## Dependency
+
+&emsp;This project uses Python 3.7 and PyTorch 1.1.0. The FLOPs and Parameters and measured using [torchsummaryX](https://github.com/nmhkahn/torchsummaryX).