2024-12-23 17:35:38 -05:00
|
|
|
(multi-modality)=
|
2024-07-03 11:34:00 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
# Multi-Modality
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-06-03 13:56:41 +08:00
|
|
|
.. currentmodule:: vllm.multimodal
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
vLLM provides experimental support for multi-modal models through the {mod}`vllm.multimodal` package.
|
|
|
|
|
|
|
|
Multi-modal inputs can be passed alongside text and token prompts to [supported models](#supported-mm-models)
|
|
|
|
via the `multi_modal_data` field in {class}`vllm.inputs.PromptType`.
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-07-10 14:55:34 +08:00
|
|
|
Currently, vLLM only has built-in support for image data. You can extend vLLM to process additional modalities
|
2024-12-23 17:35:38 -05:00
|
|
|
by following [this guide](#adding-multimodal-plugin).
|
2024-07-03 15:14:16 -07:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
Looking to add your own multi-modal model? Please follow the instructions listed [here](#enabling-multimodal-inputs).
|
2024-07-02 00:57:09 -07:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
## Guides
|
2024-07-10 14:55:34 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{toctree}
|
|
|
|
:maxdepth: 1
|
2024-07-10 14:55:34 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
adding_multimodal_plugin
|
|
|
|
```
|
2024-07-03 11:34:00 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
## Module Contents
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-06-03 13:56:41 +08:00
|
|
|
.. automodule:: vllm.multimodal
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
### Registry
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-06-28 20:09:56 +08:00
|
|
|
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-06-03 13:56:41 +08:00
|
|
|
.. autoclass:: vllm.multimodal.MultiModalRegistry
|
|
|
|
:members:
|
|
|
|
:show-inheritance:
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
### Base Classes
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-07-25 11:59:30 +08:00
|
|
|
.. autodata:: vllm.multimodal.NestedTensors
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-07-25 11:59:30 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-07-31 10:38:45 +08:00
|
|
|
.. autodata:: vllm.multimodal.BatchedTensorInputs
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-07-31 10:38:45 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-07-10 14:55:34 +08:00
|
|
|
.. autoclass:: vllm.multimodal.MultiModalDataBuiltins
|
2024-06-03 13:56:41 +08:00
|
|
|
:members:
|
|
|
|
:show-inheritance:
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-07-10 14:55:34 +08:00
|
|
|
.. autodata:: vllm.multimodal.MultiModalDataDict
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-07-10 14:55:34 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-11-09 11:31:02 +08:00
|
|
|
.. autoclass:: vllm.multimodal.MultiModalKwargs
|
2024-07-03 11:34:00 +08:00
|
|
|
:members:
|
|
|
|
:show-inheritance:
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-07-03 11:34:00 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-06-03 13:56:41 +08:00
|
|
|
.. autoclass:: vllm.multimodal.MultiModalPlugin
|
|
|
|
:members:
|
|
|
|
:show-inheritance:
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
### Image Classes
|
2024-06-03 13:56:41 +08:00
|
|
|
|
2024-12-23 17:35:38 -05:00
|
|
|
```{eval-rst}
|
2024-06-03 13:56:41 +08:00
|
|
|
.. automodule:: vllm.multimodal.image
|
|
|
|
:members:
|
|
|
|
:show-inheritance:
|
2024-12-23 17:35:38 -05:00
|
|
|
```
|