multimodal_index.rst 1.3 KB
Newer Older
1
2
.. _multi_modality:

3
4
5
6
7
8
9
10
11
12
Multi-Modality
==============

.. currentmodule:: vllm.multimodal
    
vLLM provides experimental support for multi-modal models through the :mod:`vllm.multimodal` package.

:class:`vllm.inputs.PromptStrictInputs` accepts an additional attribute ``multi_modal_data``
which allows you to pass in multi-modal input alongside text and token prompts.

13
By default, vLLM models do not support multi-modal inputs. To enable multi-modal support for a model, please follow :ref:`the guide for adding a new multimodal model. <adding_a_new_multimodal_model>`.
14
15

# TODO: Add more instructions on how to do that once embeddings is in.
16

17
18
19
20
21
22
23
24
Guides
++++++

.. toctree::
   :maxdepth: 1

   adding_multimodal_model

25
26
27
28
29
30
31
32
Module Contents
+++++++++++++++

.. automodule:: vllm.multimodal

Registry
--------

33
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
34
35
36
37
38
39
40
41

.. autoclass:: vllm.multimodal.MultiModalRegistry
    :members:
    :show-inheritance:

Base Classes
------------

42
.. autoclass:: vllm.multimodal.MultiModalDataDict
43
44
45
    :members:
    :show-inheritance:

46
47
48
49
.. autoclass:: vllm.multimodal.MultiModalInputs
    :members:
    :show-inheritance:

50
51
52
53
54
55
56
57
58
59
.. autoclass:: vllm.multimodal.MultiModalPlugin
    :members:
    :show-inheritance:

Image Classes
-------------

.. automodule:: vllm.multimodal.image
    :members:
    :show-inheritance: