multimodal_index.rst 1.35 KB
Newer Older
1
2
.. _multi_modality:

3
4
5
6
7
8
9
Multi-Modality
==============

.. currentmodule:: vllm.multimodal
    
vLLM provides experimental support for multi-modal models through the :mod:`vllm.multimodal` package.

10
11
Multi-modal input can be passed alongside text and token prompts to :ref:`supported models <supported_vlms>`
via the ``multi_modal_data`` field in :class:`vllm.inputs.PromptStrictInputs`.
12

13
14
.. note::
   ``multi_modal_data`` can accept keys and values beyond the builtin ones, as long as a customized plugin is registered through 
15
   the :class:`~vllm.multimodal.MULTIMODAL_REGISTRY`.
16

17
To implement a new multi-modal model in vLLM, please follow :ref:`this guide <enabling_multimodal_inputs>`.
18

19
20
..
  TODO: Add more instructions on how to add new plugins once embeddings is in.
21

22
23
24
25
26
27
28
29
Module Contents
+++++++++++++++

.. automodule:: vllm.multimodal

Registry
--------

30
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
31
32
33
34
35
36
37
38

.. autoclass:: vllm.multimodal.MultiModalRegistry
    :members:
    :show-inheritance:

Base Classes
------------

39
.. autoclass:: vllm.multimodal.MultiModalDataDict
40
41
42
    :members:
    :show-inheritance:

43
44
45
46
.. autoclass:: vllm.multimodal.MultiModalInputs
    :members:
    :show-inheritance:

47
48
49
50
51
52
53
54
55
56
.. autoclass:: vllm.multimodal.MultiModalPlugin
    :members:
    :show-inheritance:

Image Classes
-------------

.. automodule:: vllm.multimodal.image
    :members:
    :show-inheritance: