texture.rst 3.6 KB
Newer Older
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
1
2
3
Deep TEN: Deep Texture Encoding Network Example
===============================================

Hang Zhang's avatar
v0.1.0  
Hang Zhang committed
4
.. image:: ../_static/img/cvpr17.svg
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
5
6
7
8
9
10
11
12
13
        :width: 100%
        :align: left

In this section, we show an example of training/testing Encoding-Net for texture recognition on MINC-2500 dataset. Comparing to original Torch implementation, we use *different learning rate* for pre-trained base network and encoding layer (10x), disable color jittering after reducing lr and adopt much *smaller training image size* (224 instead of 352). 


Test Pre-trained Model
----------------------

Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
14
- Clone the GitHub repo::
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
15
16
17

    git clone git@github.com:zhanghang1989/PyTorch-Encoding.git

Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
18
19
- Install PyTorch Encoding (if not yet). Please follow the installation guide `Installing PyTorch Encoding <../notes/compile.html>`_.

Hang Zhang's avatar
Hang Zhang committed
20
- Download the `MINC-2500 <http://opensurfaces.cs.cornell.edu/publications/minc/>`_ dataset to ``$HOME/data/minc-2500/`` folder. Download pre-trained model (training `curve`_ as bellow, pre-trained on train-1 split using single training size of 224, with an error rate of :math:`19.98\%` using single crop on test-1 set)::
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
21

Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
22
    cd PyTorch-Encoding/experiments/recognition
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
23
24
25
26
27
28
29
30
31
    bash model/download_models.sh

.. _curve:

.. image:: ../_static/img/deep_ten_curve.svg
    :width: 70%

- Test pre-trained model on MINC-2500::

Hang Zhang's avatar
Hang Zhang committed
32
    >>> python main.py --dataset minc --model deepten --nclass 23 --resume model/minc.pth.tar --eval
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
33
34
35
36
37
38
39
    # Teriminal Output:
    #[======================================== 23/23 ===================================>...]    Step: 104ms | Tot: 3s256ms | Loss: 0.719 | Err: 19.983% (1149/5750) 


Train Your Own Model
--------------------

Hang Zhang's avatar
v0.1.0  
Hang Zhang committed
40
- Example training command for training above model::
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
41

Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
42
    python main.py --model deepten --nclass 23 --model deepten --batch-size 64 --lr 0.01 --epochs 60 
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
43

Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
44
- Detail training options::
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
45
46
47
48

  -h, --help            show this help message and exit
  --dataset DATASET     training dataset (default: cifar10)
  --model MODEL         network model type (default: densenet)
Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
49
  --backbone BACKBONE   backbone name (default: resnet50)
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
  --batch-size N        batch size for training (default: 128)
  --test-batch-size N   batch size for testing (default: 1000)
  --epochs N            number of epochs to train (default: 300)
  --start_epoch N       the epoch number to start (default: 0)
  --lr LR               learning rate (default: 0.1)
  --momentum M          SGD momentum (default: 0.9)
  --weight-decay M      SGD weight decay (default: 1e-4)
  --no-cuda             disables CUDA training
  --plot                matplotlib
  --seed S              random seed (default: 1)
  --resume RESUME       put the path to resuming file if needed
  --checkname           set the checkpoint name
  --eval                evaluating

.. todo::
    Provide example code for extracting features.

Extending the Software
----------------------

Hang Zhang's avatar
v0.2.0  
Hang Zhang committed
70
This code is well written, easy to use and extendable for your own models or datasets:
Hang Zhang's avatar
v1.0.1  
Hang Zhang committed
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93

- Write your own Dataloader ``mydataset.py`` to ``dataset/`` folder

- Write your own Model ``mymodel.py`` to ``model/`` folder

- Run the program::

    python main.py --dataset mydataset --model mymodel


Citation
--------

.. note::
    * Hang Zhang, Jia Xue, and Kristin Dana. "Deep TEN: Texture Encoding Network." *The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017*::

        @InProceedings{Zhang_2017_CVPR,
        author = {Zhang, Hang and Xue, Jia and Dana, Kristin},
        title = {Deep TEN: Texture Encoding Network},
        booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
        month = {July},
        year = {2017}
        }