multimodal_index.rst 1.49 KB
Newer Older
1
2
.. _multi_modality:

3
4
5
6
7
8
9
10
11
12
Multi-Modality
==============

.. currentmodule:: vllm.multimodal
    
vLLM provides experimental support for multi-modal models through the :mod:`vllm.multimodal` package.

:class:`vllm.inputs.PromptStrictInputs` accepts an additional attribute ``multi_modal_data``
which allows you to pass in multi-modal input alongside text and token prompts.

13
14
15
16
.. note::
   ``multi_modal_data`` can accept keys and values beyond the builtin ones, as long as a customized plugin is registered through 
    :class:`vllm.multimodal.MULTIMODAL_REGISTRY`.

17
By default, vLLM models do not support multi-modal inputs. To enable multi-modal support for a model, please follow :ref:`the guide for adding a new multimodal model. <adding_a_new_multimodal_model>`.
18

19

20
# TODO: Add more instructions on how to do that once embeddings is in.
21

22
23
24
25
26
27
28
29
Guides
++++++

.. toctree::
   :maxdepth: 1

   adding_multimodal_model

30
31
32
33
34
35
36
37
Module Contents
+++++++++++++++

.. automodule:: vllm.multimodal

Registry
--------

38
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
39
40
41
42
43
44
45
46

.. autoclass:: vllm.multimodal.MultiModalRegistry
    :members:
    :show-inheritance:

Base Classes
------------

47
.. autoclass:: vllm.multimodal.MultiModalDataDict
48
49
50
    :members:
    :show-inheritance:

51
52
53
54
.. autoclass:: vllm.multimodal.MultiModalInputs
    :members:
    :show-inheritance:

55
56
57
58
59
60
61
62
63
64
.. autoclass:: vllm.multimodal.MultiModalPlugin
    :members:
    :show-inheritance:

Image Classes
-------------

.. automodule:: vllm.multimodal.image
    :members:
    :show-inheritance: