docker.md 2.24 KB
Newer Older
Michael Yang's avatar
Michael Yang committed
1
2
3
4
# Ollama Docker image

### CPU only

5
```shell
Michael Yang's avatar
Michael Yang committed
6
7
8
9
10
11
12
13
docker run -d -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama
```

### Nvidia GPU
Install the [NVIDIA Container Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html#installation).

#### Install with Apt
1.  Configure the repository
14
15
16
17
18
19
20
21
22
23

    ```shell
    curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey \
        | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg
    curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list \
        | sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' \
        | sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list
    sudo apt-get update
    ```

Michael Yang's avatar
Michael Yang committed
24
2.  Install the NVIDIA Container Toolkit packages
25
26
27
28

    ```shell
    sudo apt-get install -y nvidia-container-toolkit
    ```
Michael Yang's avatar
Michael Yang committed
29
30
31
32

#### Install with Yum or Dnf
1.  Configure the repository

33
34
35
36
    ```shell
    curl -s -L https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo \
        | sudo tee /etc/yum.repos.d/nvidia-container-toolkit.repo
    ```
Michael Yang's avatar
Michael Yang committed
37
38
39

2. Install the NVIDIA Container Toolkit packages

40
41
42
    ```shell
    sudo yum install -y nvidia-container-toolkit
    ```
Michael Yang's avatar
Michael Yang committed
43
44

#### Configure Docker to use Nvidia driver
45
46

```shell
Michael Yang's avatar
Michael Yang committed
47
48
49
50
51
52
sudo nvidia-ctk runtime configure --runtime=docker
sudo systemctl restart docker
```

#### Start the container

53
```shell
Michael Yang's avatar
Michael Yang committed
54
55
56
docker run -d --gpus=all -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama
```

57
58
59
> [!NOTE]  
> If you're running on an NVIDIA JetPack system, Ollama can't automatically discover the correct JetPack version. Pass the environment variable JETSON_JETPACK=5 or JETSON_JETPACK=6 to the container to select version 5 or 6.

Michael Yang's avatar
Michael Yang committed
60
61
62
63
### AMD GPU

To run Ollama using Docker with AMD GPUs, use the `rocm` tag and the following command:

64
```shell
Michael Yang's avatar
Michael Yang committed
65
66
67
68
69
70
71
docker run -d --device /dev/kfd --device /dev/dri -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama:rocm
```

### Run model locally

Now you can run a model:

72
```shell
73
docker exec -it ollama ollama run llama3.2
Michael Yang's avatar
Michael Yang committed
74
75
76
77
78
```

### Try different models

More models can be found on the [Ollama library](https://ollama.com/library).