2024-07-03 11:34:00 +08:00
.. _multi_modality:
2024-06-03 13:56:41 +08:00
Multi-Modality
==============
.. currentmodule :: vllm.multimodal
vLLM provides experimental support for multi-modal models through the :mod: `vllm.multimodal` package.
:class: `vllm.inputs.PromptStrictInputs` accepts an additional attribute `` multi_modal_data ``
which allows you to pass in multi-modal input alongside text and token prompts.
2024-07-03 11:34:00 +08:00
By default, vLLM models do not support multi-modal inputs. To enable multi-modal support for a model, please follow :ref: `the guide for adding a new multimodal model. <adding_a_new_multimodal_model>` .
2024-07-02 00:57:09 -07:00
# TODO: Add more instructions on how to do that once embeddings is in.
2024-06-03 13:56:41 +08:00
2024-07-03 11:34:00 +08:00
Guides
++++++
.. toctree ::
:maxdepth: 1
adding_multimodal_model
2024-06-03 13:56:41 +08:00
Module Contents
+++++++++++++++
.. automodule :: vllm.multimodal
Registry
--------
2024-06-28 20:09:56 +08:00
.. autodata :: vllm.multimodal.MULTIMODAL_REGISTRY
2024-06-03 13:56:41 +08:00
.. autoclass :: vllm.multimodal.MultiModalRegistry
:members:
:show-inheritance:
Base Classes
------------
2024-07-02 00:57:09 -07:00
.. autoclass :: vllm.multimodal.MultiModalDataDict
2024-06-03 13:56:41 +08:00
:members:
:show-inheritance:
2024-07-03 11:34:00 +08:00
.. autoclass :: vllm.multimodal.MultiModalInputs
:members:
:show-inheritance:
2024-06-03 13:56:41 +08:00
.. autoclass :: vllm.multimodal.MultiModalPlugin
:members:
:show-inheritance:
Image Classes
-------------
.. automodule :: vllm.multimodal.image
:members:
:show-inheritance: