README.md 1.19 KB
Newer Older
1
---
2
title: Summary
3
4
5
---
[](){ #new-model }

6
7
!!! important
    Many decoder language models can now be automatically loaded using the [Transformers backend][transformers-backend] without having to implement them in vLLM. See if `vllm serve <model>` works first!
8

9
vLLM models are specialized [PyTorch](https://pytorch.org/) models that take advantage of various [features][compatibility-matrix] to optimize their performance.
10

11
12
13
The complexity of integrating a model into vLLM depends heavily on the model's architecture.
The process is considerably straightforward if the model shares a similar architecture with an existing model in vLLM.
However, this can be more complex for models that include new operators (e.g., a new attention mechanism).
14

15
16
17
18
19
20
Read through these pages for a step-by-step guide:

- [Implementing a Basic Model](basic.md)
- [Registering a Model to vLLM](registration.md)
- [Writing Unit Tests](tests.md)
- [Multi-Modal Support](multimodal.md)
21
22
23
24
25

!!! tip
    If you are encountering issues while integrating your model into vLLM, feel free to open a [GitHub issue](https://github.com/vllm-project/vllm/issues)
    or ask on our [developer slack](https://slack.vllm.ai).
    We will be happy to help you out!