vllm/docs/source/design/multimodal/multimodal_index.md

84 lines
1.6 KiB
Markdown
Raw Normal View History

(multi-modality)=
# Multi-Modality
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. currentmodule:: vllm.multimodal
```
2024-06-03 13:56:41 +08:00
vLLM provides experimental support for multi-modal models through the {mod}`vllm.multimodal` package.
Multi-modal inputs can be passed alongside text and token prompts to [supported models](#supported-mm-models)
via the `multi_modal_data` field in {class}`vllm.inputs.PromptType`.
2024-06-03 13:56:41 +08:00
Currently, vLLM only has built-in support for image data. You can extend vLLM to process additional modalities
by following [this guide](#adding-multimodal-plugin).
Looking to add your own multi-modal model? Please follow the instructions listed [here](#enabling-multimodal-inputs).
## Guides
```{toctree}
:maxdepth: 1
adding_multimodal_plugin
```
## Module Contents
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. automodule:: vllm.multimodal
```
2024-06-03 13:56:41 +08:00
### Registry
2024-06-03 13:56:41 +08:00
```{eval-rst}
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
```
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. autoclass:: vllm.multimodal.MultiModalRegistry
:members:
:show-inheritance:
```
2024-06-03 13:56:41 +08:00
### Base Classes
2024-06-03 13:56:41 +08:00
```{eval-rst}
.. autodata:: vllm.multimodal.NestedTensors
```
```{eval-rst}
.. autodata:: vllm.multimodal.BatchedTensorInputs
```
```{eval-rst}
.. autoclass:: vllm.multimodal.MultiModalDataBuiltins
2024-06-03 13:56:41 +08:00
:members:
:show-inheritance:
```
2024-06-03 13:56:41 +08:00
```{eval-rst}
.. autodata:: vllm.multimodal.MultiModalDataDict
```
```{eval-rst}
.. autoclass:: vllm.multimodal.MultiModalKwargs
:members:
:show-inheritance:
```
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. autoclass:: vllm.multimodal.MultiModalPlugin
:members:
:show-inheritance:
```
2024-06-03 13:56:41 +08:00
### Image Classes
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. automodule:: vllm.multimodal.image
:members:
:show-inheritance:
```