Update weights & fix layer decay bug (#118)

Co-authored-by: zhiqil <zhiqil@nvidia.com>

Update weights & fix layer decay bug (#118)
Co-authored-by: zhiqil <zhiqil@nvidia.com>
dbf29e61 · Zhiqi Li · GitHub · 213b0b8d · dbf29e61 · dbf29e61
Unverified Commit dbf29e61 authored Apr 20, 2023 by Zhiqi Li Committed by GitHub Apr 20, 2023
2 changed files
--- a/autonomous_driving/occupancy_prediction/README.md
+++ b/autonomous_driving/occupancy_prediction/README.md



+<div id="top" align="center">
+
+# CVPR 2023 3D Occupancy Prediction Challenge
+**The world's First 3D Occupancy Benchmark for Scene Perception in Autonomous Driving.**
+
+
+
+

-# InternImage-based Baseline for Occupancy Prediction
+<a href="#devkit">
+  <img alt="devkit: v0.1.0" src="https://img.shields.io/badge/devkit-v0.1.0-blueviolet"/>
+</a>
+<a href="#license">
+  <img alt="License: Apache2.0" src="https://img.shields.io/badge/license-Apache%202.0-blue.svg"/>
+</a>
+
+<img src="./figs/occupanc_1.gif" width="696px">
+
+</div>
+
+## InternImage-based Baseline for CVPR23 Occupancy Prediction Challenge!!!!

 We improve our baseline with a more powerful image backbone: **InaternImage**, which shows its excellent ability within a series of leaderboards and benchmarks, such as *COCO* and *nuScenes*.


-#### 1. Openmmlab packages requirements
+#### openmmlab packages requirements
 ```bash
 torch==1.12 # recommend
 mmcv-full>=1.5.0
@@ -17,13 +36,13 @@ timm
 numpy==1.22
 ```

-### 2. Install DCNv3 for InternImage
+### Install DCNv3 for InternImage
 ```bash
 cd projects/mmdet3d_plugin/bevformer/backbones/ops_dcnv3
 bash make.sh # requires torch>=1.10
 ```

-### 3. Train with InternImage-Small
+### Train with InternImage-Small

 ```bash
 ./tools/dist_train.sh projects/configs/bevformer/bevformer_intern-s_occ.py 8 # consumes less than 14G memory
@@ -32,8 +51,13 @@ bash make.sh # requires torch>=1.10
 Notes: InatenImage provides abundant pre-trained model weights that can be used!!!


+### Performance compared to baseline
+
+model name|weight| mIoU | others | barrier | bicycle | bus | car | construction_vehicle | motorcycle | pedestrian | traffic_cone | trailer |  truck | driveable_surface | other_flat | sidewalk | terrain | manmade | vegetation | 
+----|:----------:| :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :--: | :----------------------: | :---: | :------: | :------: |
+bevformer_intern-s_occ|[Google Drive](https://drive.google.com/file/d/1LV9K8hrskKf51xY1wbqTKzK7WZmVXEV_/view?usp=sharing)| 25.11 | 6.93 | 35.57 | 10.40 | 35.97 | 41.23 | 13.72 | 20.30 | 21.10 | 18.34 | 19.18 | 28.64 | 49.82 | 30.74 | 31.00 | 27.44 | 19.29 | 17.29 | 
+bevformer_base_occ|[Google Drive](https://drive.google.com/file/d/1NyoiosafAmne1qiABeNOPXR-P-y0i7_I/view?usp=share_link)| 23.67 | 5.03 | 38.79 | 9.98 | 34.41 | 41.09 | 13.24 | 16.50 | 18.15 | 17.83 | 18.66 | 27.70 | 48.95 | 27.73 | 29.08 | 25.38 | 15.41 | 14.46 | 

-# CVPR2023 Occupancy Prediction Challenge Information
 ## Table of Contents
 - [CVPR 2023 Occupancy Prediction Challenge](#cvpr-2023-occupancy-prediction-challenge)
  - [Introduction](#introduction)

--- a/autonomous_driving/occupancy_prediction/projects/mmdet3d_plugin/bevformer/backbones/custom_layer_decay_optimizer_constructor.py
+++ b/autonomous_driving/occupancy_prediction/projects/mmdet3d_plugin/bevformer/backbones/custom_layer_decay_optimizer_constructor.py
@@ -16,12 +16,12 @@ from mmdet.utils import get_root_logger


 def get_num_layer_for_swin(var_name, num_max_layer, depths):
-    if var_name.startswith("backbone.patch_embed"):
+    if var_name.startswith("img_backbone.patch_embed"):
        return 0
    elif "level_embeds" in var_name:
        return 0
-    elif var_name.startswith("backbone.layers") or var_name.startswith(
-            "backbone.levels"):
+    elif var_name.startswith("img_backbone.layers") or var_name.startswith(
+            "img_backbone.levels"):
        if var_name.split('.')[3] not in ['downsample', 'norm']:
            stage_id = int(var_name.split('.')[2])
            layer_id = int(var_name.split('.')[4])