engine_args.md 914 Bytes
Newer Older
1
(engine-args)=
2

3
# Engine Arguments
4

5
6
7
8
9
10
Engine arguments control the behavior of the vLLM engine.

- For [offline inference](#offline-inference), they are part of the arguments to `LLM` class.
- For [online serving](#openai-compatible-server), they are part of the arguments to `vllm serve`.

Below, you can find an explanation of every engine argument:
11

12
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
13
```{eval-rst}
14
15
16
.. argparse::
    :module: vllm.engine.arg_utils
    :func: _engine_args_parser
17
    :prog: vllm serve
18
    :nodefaultconst:
19
    :markdownhelp:
20
```
21

22
## Async Engine Arguments
23

24
Additional arguments are available to the asynchronous engine which is used for online serving:
25

26
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
27
```{eval-rst}
28
29
30
.. argparse::
    :module: vllm.engine.arg_utils
    :func: _async_engine_args_parser
31
    :prog: vllm serve
32
    :nodefaultconst:
33
    :markdownhelp:
34
```