74 lines
1.3 KiB
Markdown
Raw Normal View History

(multi-modality)=
# Multi-Modality
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. currentmodule:: vllm.multimodal
```
2024-06-03 13:56:41 +08:00
vLLM provides experimental support for multi-modal models through the {mod}`vllm.multimodal` package.
Multi-modal inputs can be passed alongside text and token prompts to [supported models](#supported-mm-models)
via the `multi_modal_data` field in {class}`vllm.inputs.PromptType`.
2024-06-03 13:56:41 +08:00
Looking to add your own multi-modal model? Please follow the instructions listed [here](#enabling-multimodal-inputs).
## Module Contents
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. automodule:: vllm.multimodal
```
2024-06-03 13:56:41 +08:00
### Registry
2024-06-03 13:56:41 +08:00
```{eval-rst}
.. autodata:: vllm.multimodal.MULTIMODAL_REGISTRY
```
2024-06-03 13:56:41 +08:00
```{eval-rst}
2024-06-03 13:56:41 +08:00
.. autoclass:: vllm.multimodal.MultiModalRegistry
:members:
:show-inheritance:
```
2024-06-03 13:56:41 +08:00
### Base Classes
2024-06-03 13:56:41 +08:00
```{eval-rst}
.. automodule:: vllm.multimodal.base
:members:
:show-inheritance:
```
### Input Classes
```{eval-rst}
.. automodule:: vllm.multimodal.inputs
2024-06-03 13:56:41 +08:00
:members:
:show-inheritance:
```
2024-06-03 13:56:41 +08:00
### Audio Classes
```{eval-rst}
.. automodule:: vllm.multimodal.audio
:members:
:show-inheritance:
```
### Image Classes
```{eval-rst}
.. automodule:: vllm.multimodal.image
2024-06-03 13:56:41 +08:00
:members:
:show-inheritance:
```
2024-06-03 13:56:41 +08:00
### Video Classes
2024-06-03 13:56:41 +08:00
```{eval-rst}
.. automodule:: vllm.multimodal.video
2024-06-03 13:56:41 +08:00
:members:
:show-inheritance:
```