engine_args.md 1 KB
Newer Older
1
(engine-args)=
2

3
# Engine Arguments
4

5
6
7
8
9
Engine arguments control the behavior of the vLLM engine.

- For [offline inference](#offline-inference), they are part of the arguments to `LLM` class.
- For [online serving](#openai-compatible-server), they are part of the arguments to `vllm serve`.

10
11
For references to all arguments available from `vllm serve` see the [serve args](#serve-args) documentation.

12
Below, you can find an explanation of every engine argument:
13

14
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
15
```{eval-rst}
16
17
18
.. argparse::
    :module: vllm.engine.arg_utils
    :func: _engine_args_parser
19
    :prog: vllm serve
20
    :nodefaultconst:
21
    :markdownhelp:
22
```
23

24
## Async Engine Arguments
25

26
Additional arguments are available to the asynchronous engine which is used for online serving:
27

28
<!--- pyml disable-num-lines 7 no-space-in-emphasis -->
29
```{eval-rst}
30
31
32
.. argparse::
    :module: vllm.engine.arg_utils
    :func: _async_engine_args_parser
33
    :prog: vllm serve
34
    :nodefaultconst:
35
    :markdownhelp:
36
```