Merge pull request #165 from sshaoshuai/master

Add visualization codes and a quick demo.

Merge pull request #165 from sshaoshuai/master
Add visualization codes and a quick demo.
45311136 · Shaoshuai Shi · GitHub · 9f5d201e · b8d2131a · 45311136
Unverified Commit 45311136 authored Jul 17, 2020 by Shaoshuai Shi Committed by GitHub Jul 17, 2020
5 changed files
--- a/README.md
+++ b/README.md
@@ -2,11 +2,13 @@

 # OpenPCDet

+
 ## Introduction
 `OpenPCDet` is a clear, simple, self-contained open source project for LiDAR-based 3D object detection. 

 It is also the official code release of [`[Part-A^2 net]`](https://arxiv.org/abs/1907.03670) and [`[PV-RCNN]`](https://arxiv.org/abs/1912.13192). 

+
 ### What does `OpenPCDet` toolbox do?

 Note that we have upgrated `PCDet` from `v0.1` to `v0.2` with pretty new structures to support various datasets and models.
@@ -48,6 +50,16 @@ Contributions are also welcomed.
 - [x] Support GPU version 3D IoU calculation and rotated NMS 


+
+
+## ChangeLog
+[2020-07-17] Add simple visualization codes and a quick demo to test with custom data. 
+
+[2020-06-24] `OpenPCDet` v0.2.0 is released with pretty new structures to support more models and datasets. 
+
+[2020-03-16] `OpenPCDet` v0.1.0 is released. 
+
+
 ## Model Zoo

 ### KITTI 3D Object Detection Baselines
@@ -71,10 +83,15 @@ More datasets are on the way.
 Please refer to [INSTALL.md](docs/INSTALL.md) for installation and dataset preparation.


+## Quick Demo
+Please refer to [DEMO.md](docs/DEMO.md) for a quick demo to test with a pretrained model and 
+visualize the predicted results on your custom data or the original KITTI data.
+
 ## Get Started

 Please refer to [GETTING_STARTED.md](docs/GETTING_STARTED.md) to learn more usage about this project.

+
 ## License

 `OpenPCDet` is released under the [Apache 2.0 license](LICENSE).

--- a/docs/DEMO.md
+++ b/docs/DEMO.md
+## Quick Demo
+
+Here we provide a quick demo to test a pretrained model on the custom point cloud data and visualize the predicted results. 
+
+We suppose you already followed the [INSTALL.md](INSTALL.md) to install the `OpenPCDet` repo successfully. 
+
+1. Download the provided pretrained models as shown in the [README.md](../README.md). 
+
+2. Make sure you have already installed the `mayavi` visualization tools. If not, you could install it as follows:
+   ```
+   pip install mayavi
+   ```
+
+3. Prepare you custom point cloud data (skip this step if you use the original KITTI data). 
+   * You need to transform the coordinate of your custom point cloud to 
+the unified normative coordinate of `OpenPCDet`, that is, x-axis points towards to front direction, 
+y-axis points towards to the left direction, and z-axis points towards to the top direction.
+   * (Optional) the z-axis origin of your point cloud coordinate should be about 1.6m above the ground surface, 
+   since currently the provided models are trained on the KITTI dataset. 
+   * Set the intensity information, and save your transformed custom data to `numpy file`: 
+   ```python
+   # Transform your point cloud data
+   ...
+   
+   # Save it to the file. 
+   # The shape of points should be (num_points, 4), that is [x, y, z, intensity],  
+   # If you doesn't have the intensity information, just set them to zeros. 
+   # If you have the intensity information, you should normalize them to [0, 1].
+   points[:, 3] = 0 
+   np.save(`my_data.npy`, points) 
+   ```      
+  
+4. Run the demo with a pretrained model (e.g. PV-RCNN) and your custom point cloud data as follows:
+```shell
+python demo.py --cfg_file cfgs/kitti_models/pv_rcnn.yaml \
+    --ckpt pv_rcnn_8369.pth \
+    --data_path ${POINT_CLOUD_DATA}
+```
+Here `${POINT_CLOUD_DATA}` could be the following format: 
+* Your transformed custom data with a single numpy file like `my_data.npy`.  
+* Your transformed custom data with a directory to test with multiple point cloud data.
+* The original KITTI `.bin` data within `data/kitti`, like `data/kitti/training/velodyne/000008.bin`.   
+
+Then you could see the predicted results with visualized point cloud as follows:
+
+<p align="center">
+  <img src="demo.png" width="99%">
+</p>
--- a/docs/demo.png
+++ b/docs/demo.png
--- a/tools/demo.py
+++ b/tools/demo.py
+import torch
+import argparse
+import glob
+import numpy as np
+from pathlib import Path
+from pcdet.datasets import DatasetTemplate
+from pcdet.models import build_network, load_data_to_gpu
+from pcdet.config import cfg, cfg_from_yaml_file
+from pcdet.utils import common_utils
+from visual_utils import visualize_utils as V
+import mayavi.mlab as mlab
+
+
+class DemoDataset(DatasetTemplate):
+    def __init__(self, dataset_cfg, class_names, training=True, root_path=None, logger=None, ext='.bin'):
+        """
+        Args:
+            root_path:
+            dataset_cfg:
+            class_names:
+            training:
+            logger:
+        """
+        super().__init__(
+            dataset_cfg=dataset_cfg, class_names=class_names, training=training, root_path=root_path, logger=logger
+        )
+        self.root_path = root_path
+        self.ext = ext
+        data_file_list = glob.glob(str(root_path / f'*{self.ext}')) if self.root_path.is_dir() else [self.root_path]
+
+        data_file_list.sort()
+        self.sample_file_list = data_file_list
+
+    def __len__(self):
+        return len(self.sample_file_list)
+
+    def __getitem__(self, index):
+        if self.ext == '.bin':
+            points = np.fromfile(self.sample_file_list[index], dtype=np.float32).reshape(-1, 4)
+        elif self.ext == '.npy':
+            points = np.load(self.sample_file_list[index])
+        else:
+            raise NotImplementedError
+
+        input_dict = {
+            'points': points,
+            'frame_id': index,
+        }
+
+        data_dict = self.prepare_data(data_dict=input_dict)
+        return data_dict
+
+
+def parse_config():
+    parser = argparse.ArgumentParser(description='arg parser')
+    parser.add_argument('--cfg_file', type=str, default='cfgs/kitti_models/second.yaml',
+                        help='specify the config for demo')
+    parser.add_argument('--data_path', type=str, default='demo_data',
+                        help='specify the point cloud data file or directory')
+    parser.add_argument('--ckpt', type=str, default=None, help='specify the pretrained model')
+    parser.add_argument('--ext', type=str, default='.bin', help='specify the extension of your point cloud data file')
+
+    args = parser.parse_args()
+
+    cfg_from_yaml_file(args.cfg_file, cfg)
+
+    return args, cfg
+
+
+def main():
+    args, cfg = parse_config()
+    logger = common_utils.create_logger()
+    logger.info('-----------------Quick Demo of OpenPCDet-------------------------')
+    demo_dataset = DemoDataset(
+        dataset_cfg=cfg.DATA_CONFIG, class_names=cfg.CLASS_NAMES, training=False,
+        root_path=Path(args.data_path), ext=args.ext, logger=logger
+    )
+    logger.info(f'Total number of samples: \t{len(demo_dataset)}')
+
+    model = build_network(model_cfg=cfg.MODEL, num_class=len(cfg.CLASS_NAMES), dataset=demo_dataset)
+    model.load_params_from_file(filename=args.ckpt, logger=logger, to_cpu=True)
+    model.cuda()
+    model.eval()
+    with torch.no_grad():
+        for idx, data_dict in enumerate(demo_dataset):
+            logger.info(f'Visualized sample index: \t{idx + 1}')
+            data_dict = demo_dataset.collate_batch([data_dict])
+            load_data_to_gpu(data_dict)
+            pred_dicts, _ = model.forward(data_dict)
+
+            V.draw_scenes(
+                points=data_dict['points'][:, 1:], ref_boxes=pred_dicts[0]['pred_boxes'],
+                ref_scores=pred_dicts[0]['pred_scores'], ref_labels=pred_dicts[0]['pred_labels']
+            )
+            mlab.show(stop=True)
+
+    logger.info('Demo done.')
+
+
+if __name__ == '__main__':
+    main()
\ No newline at end of file
--- a/tools/visual_utils/visualize_utils.py
+++ b/tools/visual_utils/visualize_utils.py
+import numpy as np
+import mayavi.mlab as mlab
+from pcdet.utils import box_utils
+
+box_colormap = [
+    [1, 1, 1],
+    [0, 1, 0],
+    [0, 1, 1],
+    [1, 1, 0],
+]
+
+
+def visualize_pts(pts, fig=None, bgcolor=(0, 0, 0), fgcolor=(1.0, 1.0, 1.0),
+                  show_intensity=False, size=(600, 600), draw_origin=True):
+    if not isinstance(pts, np.ndarray):
+        pts = pts.cpu().numpy()
+    if fig is None:
+        fig = mlab.figure(figure=None, bgcolor=bgcolor, fgcolor=fgcolor, engine=None, size=size)
+
+    if show_intensity:
+        G = mlab.points3d(pts[:, 0], pts[:, 1], pts[:, 2], pts[:, 3], mode='point',
+                          colormap='gnuplot', scale_factor=1, figure=fig)
+    else:
+        G = mlab.points3d(pts[:, 0], pts[:, 1], pts[:, 2], mode='point',
+                          colormap='gnuplot', scale_factor=1, figure=fig)
+    if draw_origin:
+        mlab.points3d(0, 0, 0, color=(1, 1, 1), mode='cube', scale_factor=0.2)
+        mlab.plot3d([0, 3], [0, 0], [0, 0], color=(0, 0, 1), tube_radius=0.1)
+        mlab.plot3d([0, 0], [0, 3], [0, 0], color=(0, 1, 0), tube_radius=0.1)
+        mlab.plot3d([0, 0], [0, 0], [0, 3], color=(1, 0, 0), tube_radius=0.1)
+
+    return fig
+
+
+def draw_grid(x1, y1, x2, y2, fig, tube_radius=None, color=(0.5, 0.5, 0.5)):
+    mlab.plot3d([x1, x1], [y1, y2], [0, 0], color=color, tube_radius=tube_radius, line_width=1, figure=fig)
+    mlab.plot3d([x2, x2], [y1, y2], [0, 0], color=color, tube_radius=tube_radius, line_width=1, figure=fig)
+    mlab.plot3d([x1, x2], [y1, y1], [0, 0], color=color, tube_radius=tube_radius, line_width=1, figure=fig)
+    mlab.plot3d([x1, x2], [y2, y2], [0, 0], color=color, tube_radius=tube_radius, line_width=1, figure=fig)
+    return fig
+
+
+def draw_multi_grid_range(fig, grid_size=20, bv_range=(-60, -60, 60, 60)):
+    for x in range(bv_range[0], bv_range[2], grid_size):
+        for y in range(bv_range[1], bv_range[3], grid_size):
+            fig = draw_grid(x, y, x + grid_size, y + grid_size, fig)
+
+    return fig
+
+
+def draw_scenes(points, gt_boxes=None, ref_boxes=None, ref_scores=None, ref_labels=None):
+    if not isinstance(points, np.ndarray):
+        points = points.cpu().numpy()
+    if ref_boxes is not None and not isinstance(ref_boxes, np.ndarray):
+        ref_boxes = ref_boxes.cpu().numpy()
+    if gt_boxes is not None and not isinstance(gt_boxes, np.ndarray):
+        gt_boxes = gt_boxes.cpu().numpy()
+    if ref_scores is not None and not isinstance(ref_scores, np.ndarray):
+        ref_scores = ref_scores.cpu().numpy()
+    if ref_labels is not None and not isinstance(ref_labels, np.ndarray):
+        ref_labels = ref_labels.cpu().numpy()
+
+    fig = visualize_pts(points)
+    fig = draw_multi_grid_range(fig, bv_range=(0, -40, 80, 40))
+    if gt_boxes is not None:
+        corners3d = box_utils.boxes_to_corners_3d(gt_boxes)
+        fig = draw_corners3d(corners3d, fig=fig, color=(0, 0, 1), max_num=100)
+
+    if ref_boxes is not None:
+        ref_corners3d = box_utils.boxes_to_corners_3d(ref_boxes)
+        if ref_labels is None:
+            fig = draw_corners3d(ref_corners3d, fig=fig, color=(0, 1, 0), cls=ref_scores, max_num=100)
+        else:
+            for k in range(ref_labels.min(), ref_labels.max() + 1):
+                cur_color = tuple(box_colormap[k % len(box_colormap)])
+                mask = (ref_labels == k)
+                fig = draw_corners3d(ref_corners3d[mask], fig=fig, color=cur_color, cls=ref_scores[mask], max_num=100)
+    mlab.view(azimuth=-179, elevation=54.0, distance=104.0, roll=90.0)
+    return fig
+
+
+def draw_corners3d(corners3d, fig, color=(1, 1, 1), line_width=2, cls=None, tag='', max_num=500, tube_radius=None):
+    """
+    :param corners3d: (N, 8, 3)
+    :param fig:
+    :param color:
+    :param line_width:
+    :param cls:
+    :param tag:
+    :param max_num:
+    :return:
+    """
+    import mayavi.mlab as mlab
+    num = min(max_num, len(corners3d))
+    for n in range(num):
+        b = corners3d[n]  # (8, 3)
+
+        if cls is not None:
+            if isinstance(cls, np.ndarray):
+                mlab.text3d(b[6, 0], b[6, 1], b[6, 2], '%.2f' % cls[n], scale=(0.3, 0.3, 0.3), color=color, figure=fig)
+            else:
+                mlab.text3d(b[6, 0], b[6, 1], b[6, 2], '%s' % cls[n], scale=(0.3, 0.3, 0.3), color=color, figure=fig)
+
+        for k in range(0, 4):
+            i, j = k, (k + 1) % 4
+            mlab.plot3d([b[i, 0], b[j, 0]], [b[i, 1], b[j, 1]], [b[i, 2], b[j, 2]], color=color, tube_radius=tube_radius,
+                        line_width=line_width, figure=fig)
+
+            i, j = k + 4, (k + 1) % 4 + 4
+            mlab.plot3d([b[i, 0], b[j, 0]], [b[i, 1], b[j, 1]], [b[i, 2], b[j, 2]], color=color, tube_radius=tube_radius,
+                        line_width=line_width, figure=fig)
+
+            i, j = k, k + 4
+            mlab.plot3d([b[i, 0], b[j, 0]], [b[i, 1], b[j, 1]], [b[i, 2], b[j, 2]], color=color, tube_radius=tube_radius,
+                        line_width=line_width, figure=fig)
+
+        i, j = 0, 5
+        mlab.plot3d([b[i, 0], b[j, 0]], [b[i, 1], b[j, 1]], [b[i, 2], b[j, 2]], color=color, tube_radius=tube_radius,
+                    line_width=line_width, figure=fig)
+        i, j = 1, 4
+        mlab.plot3d([b[i, 0], b[j, 0]], [b[i, 1], b[j, 1]], [b[i, 2], b[j, 2]], color=color, tube_radius=tube_radius,
+                    line_width=line_width, figure=fig)
+
+    return fig