README.md 1.52 KB
Newer Older
Bruce MacDonald's avatar
Bruce MacDonald committed
1
# Ollama
Jeffrey Morgan's avatar
Jeffrey Morgan committed
2

Jeffrey Morgan's avatar
Jeffrey Morgan committed
3
Ollama is a tool for running any large language model on any machine. It's designed to be easy to use and fast, supporting the largest number of models possible by using the fastest loader available for your platform and model.
Jeffrey Morgan's avatar
Jeffrey Morgan committed
4

Jeffrey Morgan's avatar
Jeffrey Morgan committed
5
> _Note: this project is a work in progress. Certain models that can be run with `ollama` are intended for research and/or non-commercial use only._
Jeffrey Morgan's avatar
Jeffrey Morgan committed
6
7

## Install
Jeffrey Morgan's avatar
Jeffrey Morgan committed
8

9
10
Using `pip`:

Jeffrey Morgan's avatar
Jeffrey Morgan committed
11
```
Bruce MacDonald's avatar
Bruce MacDonald committed
12
pip install ollama
Jeffrey Morgan's avatar
Jeffrey Morgan committed
13
14
```

15
16
Using `docker`:

Jeffrey Morgan's avatar
Jeffrey Morgan committed
17
18
19
20
```
docker run ollama/ollama
```

Jeffrey Morgan's avatar
Jeffrey Morgan committed
21
## Quickstart
Jeffrey Morgan's avatar
Jeffrey Morgan committed
22

Jeffrey Morgan's avatar
Jeffrey Morgan committed
23
24
25
26
To run a model, use `ollama run`:

```
ollama run orca-mini-3b
Jeffrey Morgan's avatar
Jeffrey Morgan committed
27
28
```

Jeffrey Morgan's avatar
Jeffrey Morgan committed
29
You can also run models from hugging face:
Bruce MacDonald's avatar
Bruce MacDonald committed
30

Jeffrey Morgan's avatar
Jeffrey Morgan committed
31
32
33
```
ollama run huggingface.co/TheBloke/orca_mini_3B-GGML
```
Jeffrey Morgan's avatar
Jeffrey Morgan committed
34

Jeffrey Morgan's avatar
Jeffrey Morgan committed
35
36
37
38
Or directly via downloaded model files:

```
ollama run ~/Downloads/orca-mini-13b.ggmlv3.q4_0.bin
Jeffrey Morgan's avatar
Jeffrey Morgan committed
39
40
41
42
43
```

## Python SDK

### Example
Bruce MacDonald's avatar
Bruce MacDonald committed
44
45

```python
Jeffrey Morgan's avatar
Jeffrey Morgan committed
46
import ollama
Jeffrey Morgan's avatar
Jeffrey Morgan committed
47
ollama.generate("orca-mini-3b", "hi")
Jeffrey Morgan's avatar
Jeffrey Morgan committed
48
49
```

Jeffrey Morgan's avatar
Jeffrey Morgan committed
50
### `ollama.generate(model, message)`
Jeffrey Morgan's avatar
Jeffrey Morgan committed
51

Jeffrey Morgan's avatar
Jeffrey Morgan committed
52
Generate a completion
Jeffrey Morgan's avatar
Jeffrey Morgan committed
53

Bruce MacDonald's avatar
Bruce MacDonald committed
54
```python
Jeffrey Morgan's avatar
Jeffrey Morgan committed
55
ollama.generate("./llama-7b-ggml.bin", "hi")
Jeffrey Morgan's avatar
Jeffrey Morgan committed
56
57
```

Jeffrey Morgan's avatar
Jeffrey Morgan committed
58
### `ollama.models()`
Jeffrey Morgan's avatar
Jeffrey Morgan committed
59

Jeffrey Morgan's avatar
Jeffrey Morgan committed
60
List available local models
Bruce MacDonald's avatar
Bruce MacDonald committed
61

Jeffrey Morgan's avatar
Jeffrey Morgan committed
62
```python
Bruce MacDonald's avatar
Bruce MacDonald committed
63
models = ollama.models()
Jeffrey Morgan's avatar
Jeffrey Morgan committed
64
65
```

Jeffrey Morgan's avatar
Jeffrey Morgan committed
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
### `ollama.load(model)`

Manually a model for generation

```python
ollama.load("model")
```

### `ollama.unload(model)`

Unload a model

```python
ollama.unload("model")
```

Bruce MacDonald's avatar
Bruce MacDonald committed
82
83
84
### `ollama.pull(model)`

Download a model
Bruce MacDonald's avatar
Bruce MacDonald committed
85
86

```python
Bruce MacDonald's avatar
Bruce MacDonald committed
87
ollama.pull("huggingface.co/thebloke/llama-7b-ggml")
Bruce MacDonald's avatar
Bruce MacDonald committed
88
```
Jeffrey Morgan's avatar
Jeffrey Morgan committed
89

Bruce MacDonald's avatar
Bruce MacDonald committed
90
### `ollama.search(query)`
91

Bruce MacDonald's avatar
Bruce MacDonald committed
92
Search for compatible models that Ollama can run
93

Bruce MacDonald's avatar
Bruce MacDonald committed
94
95
96
```python
ollama.search("llama-7b")
```
Jeffrey Morgan's avatar
Jeffrey Morgan committed
97

Bruce MacDonald's avatar
Bruce MacDonald committed
98
99
100
## Documentation

- [Development](docs/development.md)