# Panoptic Segmentation ## Description Panoptic Segmentation combines the two distinct vision tasks - semantic segmentation and instance segmentation. These tasks are unified such that, each pixel in the image is assigned the label of the class it belongs to, and also the instance identifier of the object it is a part of. ## Environment setup The code can be run on multiple GPUs or TPUs with different distribution strategies. See the TensorFlow distributed training [guide](https://www.tensorflow.org/guide/distributed_training) for an overview of `tf.distribute`. The code is compatible with TensorFlow 2.6+. See requirements.txt for all prerequisites. ```bash $ git clone https://github.com/tensorflow/models.git $ cd models $ pip3 install -r official/requirements.txt $ export PYTHONPATH=$(pwd) ``` ## Preparing Dataset ```bash $ ./official/vision/beta/data/process_coco_panoptic.sh ``` ## Launch Training ```bash $ export MODEL_DIR="gs://" $ export TPU_NAME="" $ export ANNOTATION_FILE="gs://" $ export TRAIN_DATA="gs://" $ export EVAL_DATA="gs://" $ export OVERRIDES="task.validation_data.input_path=${EVAL_DATA},\ task.train_data.input_path=${TRAIN_DATA},\ task.annotation_file=${ANNOTATION_FILE},\ runtime.distribution_strategy=tpu" $ python3 train.py \ --experiment panoptic_fpn_coco \ --config_file configs/experiments/r50fpn_1x_coco.yaml \ --mode train \ --model_dir $MODEL_DIR \ --tpu $TPU_NAME \ --params_override=$OVERRIDES ``` ## Launch Evaluation ```bash $ export MODEL_DIR="gs://" $ export NUM_GPUS="" $ export PRECISION="" $ export ANNOTATION_FILE="gs://" $ export TRAIN_DATA="gs://" $ export EVAL_DATA="gs://" $ export OVERRIDES="task.validation_data.input_path=${EVAL_DATA}, \ task.train_data.input_path=${TRAIN_DATA}, \ task.annotation_file=${ANNOTATION_FILE}, \ runtime.distribution_strategy=mirrored, \ runtime.mixed_precision_dtype=$PRECISION, \ runtime.num_gpus=$NUM_GPUS" $ python3 train.py \ --experiment panoptic_fpn_coco \ --config_file configs/experiments/r50fpn_1x_coco.yaml \ --mode eval \ --model_dir $MODEL_DIR \ --params_override=$OVERRIDES ``` **Note**: The [PanopticSegmentationGenerator](https://github.com/tensorflow/models/blob/ac7f9e7f2d0508913947242bad3e23ef7cae5a43/official/projects/panoptic/modeling/layers/panoptic_segmentation_generator.py#L22) layer uses dynamic shapes and hence generating panoptic masks is not supported on Cloud TPUs. Running evaluation on Cloud TPUs is not supported for the same reason. However, training is supported on both Cloud TPUs and GPUs. ## Pretrained Models ### Panoptic FPN Backbone | Schedule | Experiment name | Box mAP | Mask mAP | Overall PQ | Things PQ | Stuff PQ | Checkpoints :------------| :----------- | :---------------------------| ------- | ---------- | ---------- | --------- | -------- | ------------: ResNet-50 | 1x | `panoptic_fpn_coco` | 38.19 | 34.25 | 39.14 | 45.42 | 29.65 | [ckpt](gs://tf_model_garden/vision/panoptic/panoptic_fpn/panoptic_fpn_1x) ResNet-50 | 3x | `panoptic_fpn_coco` | 40.64 | 36.29 | 40.91 | 47.68 | 30.69 | [ckpt](gs://tf_model_garden/vision/panoptic/panoptic_fpn/panoptic_fpn_3x) **Note**: Here 1x schedule refers to ~12 epochs ### Panoptic Deeplab Backbone | Experiment name | Overall PQ | Things PQ | Stuff PQ | Checkpoints :---------------------| :-------------------------------| ---------- | --------- | -------- | ------------: Dilated ResNet-50 | `panoptic_deeplab_resnet_coco` | 36.80 | 37.51 | 35.73 | [ckpt](gs://tf_model_garden/vision/panoptic/panoptic_deeplab/coco/resnet50) Dilated ResNet-101 | `panoptic_deeplab_resnet_coco` | 38.39 | 39.47 | 36.75 | [ckpt](gs://tf_model_garden/vision/panoptic/panoptic_deeplab/coco/resnet101) MobileNetV3 Large | `panoptic_deeplab_mobilenetv3_large_coco` | 30.50 | 30.10 | 31.10 | [ckpt](gs://tf_model_garden/vision/panoptic/panoptic_deeplab/coco/mobilenetv3_large) MobileNetV3 Small | `panoptic_deeplab_mobilenetv3_small_coco` | 25.06 | 23.46 | 27.48 | [ckpt](gs://tf_model_garden/vision/panoptic/panoptic_deeplab/coco/mobilenetv3_small) ___ ## Citation ``` @misc{kirillov2019panoptic, title={Panoptic Feature Pyramid Networks}, author={Alexander Kirillov and Ross Girshick and Kaiming He and Piotr Dollár}, year={2019}, eprint={1901.02446}, archivePrefix={arXiv}, primaryClass={cs.CV} } @article{Cheng2020PanopticDeepLabAS, title={Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation}, author={Bowen Cheng and Maxwell D. Collins and Yukun Zhu and Ting Liu and Thomas S. Huang and Hartwig Adam and Liang-Chieh Chen}, journal={2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, year={2020}, pages={12472-12482} } ```