Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
dcnv3
Commits
6be127ee
"vscode:/vscode.git/clone" did not exist on "5820c7505c598091fdb6ef27166af57f8246bdd3"
Commit
6be127ee
authored
Mar 16, 2023
by
Wenhai Wang
Committed by
zhe chen
Mar 16, 2023
Browse files
Update README.md
parent
43e508f4
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
29 additions
and
19 deletions
+29
-19
README.md
README.md
+29
-19
No files found.
README.md
View file @
6be127ee
...
...
@@ -159,28 +159,36 @@
-
[x] InternImage-T/S/B/L/XL 语义分割模型
##
开源模型
##
相关开源项目
-
目标检测和实例分割:
[
COCO
](
detection/configs/coco/
)
-
语义分割:
[
ADE20K
](
segmentation/configs/ade20k/
)
,
[
Cityscapes
](
segmentation/configs/cityscapes/
)
-
图文检索、图像描述和视觉问答:
[
Uni-Perceiver
](
https://github.com/fundamentalvision/Uni-Perceiver
)
-
3D感知:
[
BEVFormer
](
https://github.com/fundamentalvision/BEVFormer
)
## 开源视觉预训练模型
| name | pretrain data | pre-training resolution | #param | FLOPs | Download |
| :------------: | :--------: | :--------: | :-----: | :---: | :-----------------: |
| InternImage-L | ImageNet-22K | 384x384 | 223M | 108G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_l_22k_192to384.pth
)
|
| InternImage-XL | ImageNet-22K | 384x384 | 335M | 163G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_xl_22k_192to384.pth
)
|
| InternImage-H | Joint 427M | 384x384 | 1.08B | 1478G | (2023/03/16) |
| InternImage-G | - | 384x384 | 3B | (2023/03/16) | (2023/03/16) |
## 经典视觉任务性能
**ImageNet图像分类**
| name | pretrain | resolution | acc@1 | #param | FLOPs | 22K model | 1K model |
| :------------: | :----------: | :--------: | :---: | :-----: | :---: | :-----------------: | :-----------------: |
| InternImage-T | ImageNet-1K | 224x224 | 83.5 | 30M | 5G | - |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_t_1k_224.pth
)
\|
[
cfg
](
classification/configs/internimage_t_1k_224.yaml
)
|
| InternImage-S | ImageNet-1K | 224x224 | 84.2 | 50M | 8G | - |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_s_1k_224.pth
)
\|
[
cfg
](
classification/configs/internimage_s_1k_224.yaml
)
|
| InternImage-B | ImageNet-1K | 224x224 | 84.9 | 97M | 16G | - |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_b_1k_224.pth
)
\|
[
cfg
](
classification/configs/internimage_b_1k_224.yaml
)
|
| InternImage-L | ImageNet-22K | 384x384 | 87.7 | 223M | 108G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_l_22k_192to384.pth
)
|
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_l_22kto1k_384.pth
)
\|
[
cfg
](
classification/configs/internimage_l_22kto1k_384.yaml
)
|
| InternImage-XL | ImageNet-22K | 384x384 | 88.0 | 335M | 163G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_xl_22k_192to384.pth
)
|
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_xl_22kto1k_384.pth
)
\|
[
cfg
](
classification/configs/internimage_xl_22kto1k_384.yaml
)
|
| InternImage-H | Joint 427M | 640x640 | 89.6 | 1.08B | 1478G | TODO |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_h_jointto1k_640.pth
)
\|
[
cfg
](
classification/configs/internimage_h_jointto1k_640.yaml
)
|
| InternImage-G | Joint 427M | 512x512 | 90.1 | 3B | TODO | TODO |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_g_jointto1k_512.pth
)
\|
[
cfg
](
classification/configs/internimage_g_jointto1k_512.yaml
)
|
## ImageNet-1K图像分类
| name | resolution | acc@1 | #param | FLOPs | Download |
| :------------: | :--------: | :---: | :-----: | :---: | :-----------------: |
| InternImage-T | 224x224 | 83.5 | 30M | 5G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_t_1k_224.pth
)
\|
[
cfg
](
classification/configs/internimage_t_1k_224.yaml
)
|
| InternImage-S | 224x224 | 84.2 | 50M | 8G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_s_1k_224.pth
)
\|
[
cfg
](
classification/configs/internimage_s_1k_224.yaml
)
|
| InternImage-B | 224x224 | 84.9 | 97M | 16G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_b_1k_224.pth
)
\|
[
cfg
](
classification/configs/internimage_b_1k_224.yaml
)
|
| InternImage-L | 384x384 | 87.7 | 223M | 108G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_l_22kto1k_384.pth
)
\|
[
cfg
](
classification/configs/internimage_l_22kto1k_384.yaml
)
|
| InternImage-XL | 384x384 | 88.0 | 335M | 163G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_xl_22kto1k_384.pth
)
\|
[
cfg
](
classification/configs/internimage_xl_22kto1k_384.yaml
)
|
| InternImage-H | 640x640 | 89.6 | 1.08B | 1478G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_h_jointto1k_640.pth
)
\|
[
cfg
](
classification/configs/internimage_h_jointto1k_640.yaml
)
|
| InternImage-G | 512x512 | 90.1 | 3B | (2023/03/16) |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/internimage_g_jointto1k_512.pth
)
\|
[
cfg
](
classification/configs/internimage_g_jointto1k_512.yaml
)
|
**COCO目标检测和实例分割**
## COCO目标检测和实例分割
| backbone | method | schd | box mAP | mask mAP | #param | FLOPs | Download |
| :------------: | :----------------: | :---------: | :-----: | :------: | :-----: | :---: | :---: |
...
...
@@ -196,11 +204,11 @@
| InternImage-XL | Cascade | 3x | 56.2 | 48.8 | 387M | 1782G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/cascade_internimage_xl_fpn_1x_coco.pth
)
\|
[
cfg
](
detection/configs/coco/cascade_internimage_xl_fpn_3x_coco.py
)
|
| backbone | method | box mAP (val/test) | #param | FLOPs | Download |
| :------------: | :----------------: | :---------: | :------: | :-----: | :---: |
| :------------: | :----------------: | :---------: | :------: | :-----: | :---
--
: |
| InternImage-H | DINO (TTA) | 65.0 / 65.4 | 2.18B | TODO | TODO |
| InternImage-G | DINO (TTA) | 65.3 / 65.5 | 3B | TODO | TODO |
**
ADE20K语义分割
**
##
ADE20K语义分割
| backbone | method | resolution | mIoU (ss/ms) | #param | FLOPs | Download |
| :------------: | :--------: | :--------: | :----------: | :-----: | :---: | :---: |
...
...
@@ -210,10 +218,12 @@
| InternImage-L | UperNet | 640x640 | 53.9 / 54.1 | 256M | 2526G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_l_640_160k_ade20k.pth
)
\|
[
cfg
](
segmentation/configs/ade20k/upernet_internimage_l_640_160k_ade20k.py
)
|
| InternImage-XL | UperNet | 640x640 | 55.0 / 55.3 | 368M | 3142G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_xl_640_160k_ade20k.pth
)
\|
[
cfg
](
segmentation/configs/ade20k/upernet_internimage_xl_640_160k_ade20k.py
)
|
| InternImage-H | UperNet | 896x896 | 59.9 / 60.3 | 1.12B | 3566G |
[
ckpt
](
https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_h_896_160k_ade20k.pth
)
\|
[
cfg
](
segmentation/configs/ade20k/upernet_internimage_h_896_160k_ade20k.py
)
|
| InternImage-H | Mask2Former | 896x896 | 62.5 / 62.9 | 1.31B | 4635G | ckpt
\|
cfg |
| InternImage-H | Mask2Former | 896x896 | 62.5 / 62.9 | 1.31B | 4635G | TODO |
## 模型推理速度
**模型推理速度**
[
TensorRT
](
classification/export.py
)
| name | resolution | #param | FLOPs | Batch 1 FPS(TensorRT) |
| :------------: | :--------: | :-----: | :---: | :-------------------: |
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment