index.rst 396 Bytes
Newer Older
Woosuk Kwon's avatar
Woosuk Kwon committed
1
2
Welcome to vLLM!
================
Woosuk Kwon's avatar
Woosuk Kwon committed
3

Zhuohan Li's avatar
Zhuohan Li committed
4
5
vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLM).

Woosuk Kwon's avatar
Woosuk Kwon committed
6
7
8
9
10
11
12
13
14
Documentation
-------------

.. toctree::
   :maxdepth: 1
   :caption: Getting Started

   getting_started/installation
   getting_started/quickstart
Woosuk Kwon's avatar
Woosuk Kwon committed
15
16
17
18
19
20
21

.. toctree::
   :maxdepth: 1
   :caption: Models

   models/supported_models
   models/adding_model