2025-01-06 21:40:31 +08:00
|
|
|
(new-model)=
|
|
|
|
|
|
|
|
# Adding a New Model
|
|
|
|
|
2025-01-10 22:30:25 +08:00
|
|
|
This section provides more information on how to integrate a [PyTorch](https://pytorch.org/) model into vLLM.
|
2025-01-06 21:40:31 +08:00
|
|
|
|
|
|
|
```{toctree}
|
|
|
|
:caption: Contents
|
|
|
|
:maxdepth: 1
|
|
|
|
|
|
|
|
basic
|
|
|
|
registration
|
2025-01-11 21:27:24 +08:00
|
|
|
tests
|
2025-01-06 21:40:31 +08:00
|
|
|
multimodal
|
|
|
|
```
|
|
|
|
|
|
|
|
```{note}
|
|
|
|
The complexity of adding a new model depends heavily on the model's architecture.
|
|
|
|
The process is considerably straightforward if the model shares a similar architecture with an existing model in vLLM.
|
|
|
|
However, for models that include new operators (e.g., a new attention mechanism), the process can be a bit more complex.
|
|
|
|
```
|
|
|
|
|
|
|
|
```{tip}
|
|
|
|
If you are encountering issues while integrating your model into vLLM, feel free to open a [GitHub issue](https://github.com/vllm-project/vllm/issues)
|
|
|
|
or ask on our [developer slack](https://slack.vllm.ai).
|
|
|
|
We will be happy to help you out!
|
|
|
|
```
|