README.md 2.12 KB
Newer Older
wofmanaf's avatar
wofmanaf committed
1
2
3
4
5
6
7
8
9
# CrowdHuman

## Introduction

Introduced by Shao et al. in [CrowdHuman: A Benchmark for Detecting Human in a Crowd](https://arxiv.org/pdf/1805.00123.pdf)

CrowdHuman is a benchmark dataset to better evaluate detectors in crowd scenarios. The CrowdHuman dataset is large, rich-annotated and contains high diversity. CrowdHuman contains 15000, 4370 and 5000 images for training, validation, and testing, respectively. There are a total of 470K human instances from train and validation subsets and 23 persons per image, with various kinds of occlusions in the dataset. Each human instance is annotated with a head bounding-box, human visible-region bounding-box and human full-body bounding-box. We hope our dataset will serve as a solid baseline and help promote future research in human detection tasks.

## Prepare the data
zhe chen's avatar
zhe chen committed
10

wofmanaf's avatar
wofmanaf committed
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
Download the original dataset from [CrowdHuman](https://www.crowdhuman.org/download.html). Then convert annotations by detection/tools/create_crowd_anno.py

- Data Tree of CrowdHuman should look like:
  ```bash
  $ tree CrowdHuman
  CrowdHuman
  ├── annotations
  │   ├── annotation_train.json
  │   ├── annotation_train.odgt
  │   ├── annotation_val.json
  │   ├── annotation_val.odgt
  │   └── ...
  └── Images
      ├── 1074488,79b360006b38332b.jpg
      ├── 1074488,79d54000c6f9d9e5.jpg
      └── ...

zhe chen's avatar
zhe chen committed
28
  ```
wofmanaf's avatar
wofmanaf committed
29

zhe chen's avatar
zhe chen committed
30
## Model Zoo
wofmanaf's avatar
wofmanaf committed
31
32
33

### Cascade Mask R-CNN + InternImage

zhe chen's avatar
zhe chen committed
34
35
36
|    backbone    | schd | box mAP | mask mAP | train speed | train time | #param | FLOPs |                          Config                          | Download |
| :------------: | :--: | :-----: | :------: | :---------: | :--------: | :----: | :---: | :------------------------------------------------------: | :------: |
| InternImage-XL |  3x  |   TBD   |   TBD    |     TBD     |    TBD     |  TBD   |  TBD  | [config](./cascade_internimage_xl_fpn_3x_crowd_human.py) |   TBD    |
wofmanaf's avatar
wofmanaf committed
37
38
39

- Training speed is measured with A100 GPUs using current code and may be faster than the speed in logs.
- Some logs are our recent newly trained ones. There might be slight differences between the results in logs and our paper.