index.md 1.29 KB
Newer Older
1
(multi-modality)=
2

3
# Multi-Modality
4

5
```{eval-rst}
6
.. currentmodule:: vllm.multimodal
7
```
8

9
10
11
12
vLLM provides experimental support for multi-modal models through the {mod}`vllm.multimodal` package.

Multi-modal inputs can be passed alongside text and token prompts to [supported models](#supported-mm-models)
via the `multi_modal_data` field in {class}`vllm.inputs.PromptType`.
13

14
Looking to add your own multi-modal model? Please follow the instructions listed [here](#enabling-multimodal-inputs).
15

16

17
## Module Contents
18

19
```{eval-rst}
20
.. automodule:: vllm.multimodal
21
```
22

23
### Registry
24

25
```{eval-rst}
26
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
27
```
28

29
```{eval-rst}
30
31
32
.. autoclass:: vllm.multimodal.MultiModalRegistry
    :members:
    :show-inheritance:
33
```
34

35
### Base Classes
36

37
```{eval-rst}
38
39
40
.. automodule:: vllm.multimodal.base
    :members:
    :show-inheritance:
41
```
42

43
### Input Classes
44

45
```{eval-rst}
46
.. automodule:: vllm.multimodal.inputs
47
48
    :members:
    :show-inheritance:
49
```
50

51
### Audio Classes
52

53
```{eval-rst}
54
.. automodule:: vllm.multimodal.audio
55
56
    :members:
    :show-inheritance:
57
```
58

59
60
### Image Classes

61
```{eval-rst}
62
.. automodule:: vllm.multimodal.image
63
64
    :members:
    :show-inheritance:
65
```
66

67
### Video Classes
68

69
```{eval-rst}
70
.. automodule:: vllm.multimodal.video
71
72
    :members:
    :show-inheritance:
73
```