gpu.md 3.47 KB
Newer Older
1
2
3
4
---
toc_depth: 3
---

5
6
7
8
9
10
# GPU

vLLM is a Python library that supports the following GPU variants. Select your GPU type to see vendor specific instructions:

=== "NVIDIA CUDA"

11
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:installation"
12
13
14

=== "AMD ROCm"

15
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:installation"
16
17
18

=== "Intel XPU"

19
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:installation"
20
21
22
23

## Requirements

- OS: Linux
24
- Python: 3.10 -- 3.13
25

26
27
28
!!! note
    vLLM does not support Windows natively. To run vLLM on Windows, you can use the Windows Subsystem for Linux (WSL) with a compatible Linux distribution, or use some community-maintained forks, e.g. [https://github.com/SystemPanic/vllm-windows](https://github.com/SystemPanic/vllm-windows).

29
30
=== "NVIDIA CUDA"

31
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:requirements"
32
33
34

=== "AMD ROCm"

35
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:requirements"
36
37
38

=== "Intel XPU"

39
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:requirements"
40
41
42
43
44
45
46
47
48

## Set up using Python

### Create a new Python environment

--8<-- "docs/getting_started/installation/python_env_setup.inc.md"

=== "NVIDIA CUDA"

49
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:set-up-using-python"
50
51
52

=== "AMD ROCm"

53
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:set-up-using-python"
54
55
56

=== "Intel XPU"

57
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:set-up-using-python"
58

59
### Pre-built wheels {#pre-built-wheels}
60
61
62

=== "NVIDIA CUDA"

63
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:pre-built-wheels"
64
65
66

=== "AMD ROCm"

67
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:pre-built-wheels"
68
69
70

=== "Intel XPU"

71
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:pre-built-wheels"
72
73
74
75
76

### Build wheel from source

=== "NVIDIA CUDA"

77
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:build-wheel-from-source"
78
79
80

=== "AMD ROCm"

81
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:build-wheel-from-source"
82
83
84

=== "Intel XPU"

85
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:build-wheel-from-source"
86
87
88
89
90

## Set up using Docker

### Pre-built images

91
92
93
<!-- markdownlint-disable MD025 -->
# --8<-- [start:pre-built-images]

94
95
=== "NVIDIA CUDA"

96
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:pre-built-images"
97
98
99

=== "AMD ROCm"

100
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:pre-built-images"
101
102
103

=== "Intel XPU"

104
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:pre-built-images"
105

106
107
108
109
# --8<-- [end:pre-built-images]
<!-- markdownlint-enable MD025 -->

<!-- markdownlint-disable MD001 -->
110
### Build image from source
111
112
113
114
<!-- markdownlint-enable MD001 -->

<!-- markdownlint-disable MD025 -->
# --8<-- [start:build-image-from-source]
115
116
117

=== "NVIDIA CUDA"

118
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:build-image-from-source"
119
120
121

=== "AMD ROCm"

122
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:build-image-from-source"
123
124
125

=== "Intel XPU"

126
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:build-image-from-source"
127

128
129
130
# --8<-- [end:build-image-from-source]
<!-- markdownlint-enable MD025 -->

131
132
133
134
## Supported features

=== "NVIDIA CUDA"

135
    --8<-- "docs/getting_started/installation/gpu.cuda.inc.md:supported-features"
136
137
138

=== "AMD ROCm"

139
    --8<-- "docs/getting_started/installation/gpu.rocm.inc.md:supported-features"
140
141
142

=== "Intel XPU"

143
    --8<-- "docs/getting_started/installation/gpu.xpu.inc.md:supported-features"