image.md 3.75 KB
Newer Older
Kai Chen's avatar
Kai Chen committed
1
2
## Image

youkaichao's avatar
youkaichao committed
3
This module provides some image processing methods, which requires `opencv` to be installed.
Kai Chen's avatar
Kai Chen committed
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

### Read/Write/Show
To read or write images files, use `imread` or `imwrite`.

```python
import mmcv

img = mmcv.imread('test.jpg')
img = mmcv.imread('test.jpg', flag='grayscale')
img_ = mmcv.imread(img) # nothing will happen, img_ = img
mmcv.imwrite(img, 'out.jpg')
```

To read images from bytes

```python
with open('test.jpg', 'rb') as f:
    data = f.read()
img = mmcv.imfrombytes(data)
```

To show an image file or a loaded image

```python
mmcv.imshow('tests/data/color.jpg')
Kai Chen's avatar
Kai Chen committed
29
# this is equivalent to 
Kai Chen's avatar
Kai Chen committed
30
31
32
33
34
35

for i in range(10):
    img = np.random.randint(256, size=(100, 100, 3), dtype=np.uint8)
    mmcv.imshow(img, win_name='test image', wait_time=200)
```

Kai Chen's avatar
Kai Chen committed
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
### Color space conversion
Supported conversion methods:
- bgr2gray
- gray2bgr
- bgr2rgb
- rgb2bgr
- bgr2hsv
- hsv2bgr

```python
img = mmcv.imread('tests/data/color.jpg')
img1 = mmcv.bgr2rgb(img)
img2 = mmcv.rgb2gray(img1)
img3 = mmcv.bgr2hsv(img)
```

Kai Chen's avatar
Kai Chen committed
52
### Resize
Kai Chen's avatar
Kai Chen committed
53
54
55
There are three resize methods. All `imresize_*` methods have an argument `return_scale`,
if this argument is `False`, then the return value is merely the resized image, otherwise
is a tuple `(resized_img, scale)`.
Kai Chen's avatar
Kai Chen committed
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71

```python
# resize to a given size
mmcv.imresize(img, (1000, 600), return_scale=True)

# resize to the same size of another image
mmcv.imresize_like(img, dst_img, return_scale=False)

# resize by a ratio
mmcv.imrescale(img, 0.5)

# resize so that the max edge no longer than 1000, short edge no longer than 800
# without changing the aspect ratio
mmcv.imrescale(img, (1000, 800))
```

Kai Chen's avatar
Kai Chen committed
72
73
74
75
76
77
### Rotate
To rotate an image by some angle, use `imrotate`. The center can be specified,
which is the center of original image by default. There are two modes of rotating,
one is to keep the image size unchanged so that some parts of the image will be
cropped after rotating, the other is to extend the image size to fit the rotated
image.
Kai Chen's avatar
Kai Chen committed
78
79
80

```python
img = mmcv.imread('tests/data/color.jpg')
Kai Chen's avatar
Kai Chen committed
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108

# rotate the image clockwise by 30 degrees.
img_ = mmcv.imrotate(img, 30)

# rotate the image counterclockwise by 90 degrees.
img_ = mmcv.imrotate(img, -90)

# rotate the image clockwise by 30 degrees, and rescale it by 1.5x at the same time.
img_ = mmcv.imrotate(img, 30, scale=1.5)

# rotate the image clockwise by 30 degrees, with (100, 100) as the center.
img_ = mmcv.imrotate(img, 30, center=(100, 100))

# rotate the image clockwise by 30 degrees, and extend the image size.
img_ = mmcv.imrotate(img, 30, auto_bound=True)
```

### Flip
To flip an image, use `imflip`.

```python
img = mmcv.imread('tests/data/color.jpg')

# flip the image horizontally
mmcv.imflip(img)

# flip the image vertically
mmcv.imflip(img, direction='vertical')
Kai Chen's avatar
Kai Chen committed
109
110
111
```

### Crop
Kai Chen's avatar
Kai Chen committed
112
`imcrop` can crop the image with one or some regions, represented as (x1, y1, x2, y2).
Kai Chen's avatar
Kai Chen committed
113
114
115
116
117

```python
import mmcv
import numpy as np

118
img = mmcv.imread('tests/data/color.jpg')
Kai Chen's avatar
Kai Chen committed
119
120
121

# crop the region (10, 10, 100, 120)
bboxes = np.array([10, 10, 100, 120])
122
patch = mmcv.imcrop(img, bboxes)
Kai Chen's avatar
Kai Chen committed
123
124

# crop two regions (10, 10, 100, 120) and (0, 0, 50, 50)
Kai Chen's avatar
Kai Chen committed
125
bboxes = np.array([[10, 10, 100, 120], [0, 0, 50, 50]])
126
patches = mmcv.imcrop(img, bboxes)
Kai Chen's avatar
Kai Chen committed
127

Kai Chen's avatar
Kai Chen committed
128
# crop two regions, and rescale the patches by 1.2x
129
patches = mmcv.imcrop(img, bboxes, scale_ratio=1.2)
Kai Chen's avatar
Kai Chen committed
130
131
132
```

### Padding
Kai Chen's avatar
Kai Chen committed
133
134
There are two methods `impad` and `impad_to_multiple` to pad an image to the
specific size with given values.
Kai Chen's avatar
Kai Chen committed
135
136

```python
137
img = mmcv.imread('tests/data/color.jpg')
Kai Chen's avatar
Kai Chen committed
138
139

# pad the image to (1000, 1200) with all zeros
140
img_ = mmcv.impad(img, (1000, 1200), pad_val=0)
Kai Chen's avatar
Kai Chen committed
141
142

# pad the image to (1000, 1200) with different values for three channels.
143
img_ = mmcv.impad(img, (1000, 1200), pad_val=[100, 50, 200])
Kai Chen's avatar
Kai Chen committed
144
145
146

# pad an image so that each edge is a multiple of some value.
img_ = mmcv.impad_to_multiple(img, 32)
147
```