README.md 396 Bytes
Newer Older
1
2
# Configuration Options

3
4
5
6
This section lists the most common options for running vLLM.

There are three main levels of configuration, from highest priority to lowest priority:

7
- [Request parameters](../serving/openai_compatible_server.md#completions-api) and [input arguments](../api/README.md#inference-parameters)
8
9
- [Engine arguments](./engine_args.md)
- [Environment variables](./env_vars.md)