models_new.rst 3.13 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
.. _models_new:

Models and pre-trained weights - New
####################################

.. note::

    These are the new models docs, documenting the new multi-weight API.
    TODO: Once all is done, remove the "- New" part in the title above, and
    rename this file as models.rst


The ``torchvision.models`` subpackage contains definitions of models for addressing
different tasks, including: image classification, pixelwise semantic
segmentation, object detection, instance segmentation, person
keypoint detection, video classification, and optical flow.

.. note ::
    Backward compatibility is guaranteed for loading a serialized 
    ``state_dict`` to the model created using old PyTorch version. 
    On the contrary, loading entire saved models or serialized 
    ``ScriptModules`` (seralized using older versions of PyTorch) 
    may not preserve the historic behaviour. Refer to the following 
    `documentation 
    <https://pytorch.org/docs/stable/notes/serialization.html#id6>`_   


Classification
==============

.. currentmodule:: torchvision.models

The following classification models are available, with or without pre-trained
weights:

.. toctree::
   :maxdepth: 1

39
   models/alexnet
Hu Ye's avatar
Hu Ye committed
40
   models/convnext
41
   models/densenet
42
   models/efficientnet
43
   models/efficientnetv2
44
   models/googlenet
Aditya Oke's avatar
Aditya Oke committed
45
   models/inception
46
   models/mobilenetv2
47
   models/mobilenetv3
48
   models/regnet
49
   models/resnet
50
   models/resnext
51
   models/shufflenetv2
Nicolas Hug's avatar
Nicolas Hug committed
52
   models/squeezenet
53
   models/swin_transformer
54
   models/vgg
55
   models/vision_transformer
56
   models/wide_resnet
57
58
59
60
61
62
63
64
65


Table of all available classification weights
---------------------------------------------

Accuracies are reported on ImageNet

.. include:: generated/classification_table.rst

66
67
68
69
70
71
72
73
74
75
76
77
Semantic Segmentation
=====================

.. currentmodule:: torchvision.models.segmentation

The following semantic segmentation models are available, with or without
pre-trained weights:

.. toctree::
   :maxdepth: 1

   models/deeplabv3
78
   models/fcn
Aditya Oke's avatar
Aditya Oke committed
79
   models/lraspp
80
81
82
83
84
85
86
87
88

Table of all available semantic segmentation weights
----------------------------------------------------

All models are evaluated on COCO val2017:

.. include:: generated/segmentation_table.rst


89
90
91
92

Object Detection, Instance Segmentation and Person Keypoint Detection
=====================================================================

93
94
95
96
97
98
99
100
.. currentmodule:: torchvision.models.detection

The following detection models are available, with or without pre-trained
weights:

.. toctree::
   :maxdepth: 1

101
   models/faster_rcnn
Hu Ye's avatar
Hu Ye committed
102
   models/fcos
103
   models/mask_rcnn
Hu Ye's avatar
Hu Ye committed
104
   models/retinanet
105
   models/ssdlite
106
107
108
109
110
111
112

Table of all available detection weights
----------------------------------------

Box MAPs are reported on COCO

.. include:: generated/detection_table.rst
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133


Video Classification
====================

.. currentmodule:: torchvision.models.video

The following video classification models are available, with or without
pre-trained weights:

.. toctree::
   :maxdepth: 1

   models/video_resnet

Table of all available video classification weights
---------------------------------------------------

Accuracies are reported on Kinetics-400

.. include:: generated/video_table.rst