README.md 1.91 KB
Newer Older
Jing Zhang's avatar
Jing Zhang committed
1
# Instructions for ```pool2d_fwd``` Example
Qianfeng's avatar
Qianfeng committed
2
3
4
5
6
7
8
9
10
11
12
13
14
15

## Docker script
```bash
docker run                                                                   \
-it                                                                          \
--rm                                                                         \
--privileged                                                                 \
--group-add sudo                                                             \
-w /root/workspace                                                           \
-v ${PATH_TO_LOCAL_WORKSPACE}:/root/workspace                                \
rocm/tensorflow:rocm4.3.1-tf2.6-dev                                          \
/bin/bash
```

Jing Zhang's avatar
Jing Zhang committed
16
## Build ```pool2d_fwd```
Qianfeng's avatar
Qianfeng committed
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
```bash
mkdir build && cd build
```

```bash
# Need to specify target ID, example below is gfx908
cmake                                                                  \
-D BUILD_DEV=OFF                                                       \
-D CMAKE_BUILD_TYPE=Release                                            \
-D CMAKE_CXX_FLAGS="-DCK_AMD_GPU_GFX908 --amdgpu-target=gfx908 -O3 "   \
-D CMAKE_CXX_COMPILER=/opt/rocm/bin/hipcc                              \
-D CMAKE_PREFIX_PATH=/opt/rocm                                         \
..
```

```bash
Jing Zhang's avatar
Jing Zhang committed
33
 make -j pool2d_fwd
Qianfeng's avatar
Qianfeng committed
34
35
```

Jing Zhang's avatar
Jing Zhang committed
36
## Run ```pool2d_fwd```
Qianfeng's avatar
Qianfeng committed
37
38
39
40
```bash
#arg1: verification (0=no, 1=yes)
#arg2: initialization (0=no init, 1=integer value, 2=decimal value)
#arg3: run kernel # of times (>1)
Jing Zhang's avatar
Jing Zhang committed
41
42
#arg4 to 15: N, C, Y, X, Hi, Wi, Sy, Sx, LeftPy, LeftPx, RightPy, RightPx
./example/pool2d_fwd 1 1 10
Qianfeng's avatar
Qianfeng committed
43
44
```

Jing Zhang's avatar
Jing Zhang committed
45
Result 
Qianfeng's avatar
Qianfeng committed
46
```
Jing Zhang's avatar
Jing Zhang committed
47
48
49
in_n_c_hi_wi: dim 4, lengths {128, 192, 71, 71}, strides {967872, 1, 13632, 192}
out_n_c_ho_wo: dim 4, lengths {128, 192, 36, 36}, strides {248832, 1, 6912, 192}
launch_and_time_kernel: grid_dim {124416, 1, 1}, block_dim {64, 1, 1} 
Jing Zhang's avatar
Jing Zhang committed
50
Warm up
Jing Zhang's avatar
Jing Zhang committed
51
Start running 10 times...
Jing Zhang's avatar
Jing Zhang committed
52
Perf: 0.415453 ms, 1.37996 TFlops, 749.726 GB/s
Jing Zhang's avatar
Jing Zhang committed
53
error: 0
Jing Zhang's avatar
Jing Zhang committed
54
max_diff: 0, 1, 1
Qianfeng's avatar
Qianfeng committed
55
```