engine_args.md 876 Bytes
Newer Older
1
(engine-args)=
2

3
# Engine Arguments
4

5
6
7
8
9
10
Engine arguments control the behavior of the vLLM engine.

- For [offline inference](#offline-inference), they are part of the arguments to `LLM` class.
- For [online serving](#openai-compatible-server), they are part of the arguments to `vllm serve`.

Below, you can find an explanation of every engine argument:
11

12
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
13
```{eval-rst}
14
15
16
.. argparse::
    :module: vllm.engine.arg_utils
    :func: _engine_args_parser
17
    :prog: vllm serve
18
    :nodefaultconst:
19
```
20

21
## Async Engine Arguments
22

23
Additional arguments are available to the asynchronous engine which is used for online serving:
24

25
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
26
```{eval-rst}
27
28
29
.. argparse::
    :module: vllm.engine.arg_utils
    :func: _async_engine_args_parser
30
    :prog: vllm serve
31
32
    :nodefaultconst:
```