Commits · 3c8df3808b9e1f50dd61552ea986cf74f44ec5ad · OpenDAS / ollama

07 Mar, 2024 3 commits

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c
fix some typos (#2973) · 23ebe8fe
John authored Mar 07, 2024
```
Signed-off-by: hishope <csqiye@126.com>
```
23ebe8fe

05 Mar, 2024 1 commit
- Update api.md · ce9f7c46
  Jeffrey Morgan authored Mar 05, 2024
  
  ce9f7c46
01 Mar, 2024 1 commit
- Fix embeddings load model behavior (#2848) · 3b4bab3d
  Jeffrey Morgan authored Feb 29, 2024
  
  3b4bab3d
25 Feb, 2024 1 commit

Update langchain python tutorial (#2737) · 1f087c4d

elthommy authored Feb 25, 2024

Remove unused GPT4all
Use nomic-embed-text as embedded model
Fix a deprecation warning (__call__)

1f087c4d

22 Feb, 2024 2 commits
- Update import.md · bdc0ea1b
  Jeffrey Morgan authored Feb 22, 2024
  
  bdc0ea1b
- Update import.md · 7fab7918
  Jeffrey Morgan authored Feb 22, 2024
  
  7fab7918
21 Feb, 2024 1 commit
- Update faq.md · f0425d3d
  Jeffrey Morgan authored Feb 20, 2024
  
  f0425d3d
20 Feb, 2024 4 commits
- Update import.md · 8125ce4c
  Jeffrey Morgan authored Feb 19, 2024
```
Add instructions to get public key on windows
```
  8125ce4c
- Update faq.md · df56f1ee
  Jeffrey Morgan authored Feb 19, 2024
  
  df56f1ee
- Update faq.md · 41aca5c2
  Jeffrey Morgan authored Feb 19, 2024
  
  41aca5c2
- Update api.md to include examples for reproducible outputs · 753724d8
  Jeffrey Morgan authored Feb 19, 2024
  
  753724d8
19 Feb, 2024 2 commits
- add faqs for memory pre-loading and the keep_alive setting (#2601) · 9a7a4b95
  Patrick Devine authored Feb 19, 2024
  
  9a7a4b95
- Document setting server vars for windows · b338c063
  Daniel Hiltgen authored Feb 19, 2024
  
  b338c063
16 Feb, 2024 1 commit
- Update faq.md with the location of models on Windows (#2545) · 97746630
  Tristan Rhodes authored Feb 16, 2024
  
  97746630
15 Feb, 2024 2 commits
- typo · 1ba734de
  Daniel Hiltgen authored Feb 15, 2024
  
  1ba734de
- Implement new Go based Desktop app · 29e90cc1
  Daniel Hiltgen authored Dec 26, 2023
```
This focuses on Windows first, but coudl be used for Mac
and possibly linux in the future.
```
  29e90cc1
12 Feb, 2024 1 commit
- Fix issues with templating prompt in chat mode (#2460) · 48a273f8
  Jeffrey Morgan authored Feb 12, 2024
  
  48a273f8
09 Feb, 2024 1 commit
- Update domain name references in docs and install script (#2435) · 1c8435ff
  Jeffrey Morgan authored Feb 09, 2024
  
  1c8435ff
08 Feb, 2024 2 commits
- Update openai.md · 42b797ed
  Jeffrey Morgan authored Feb 08, 2024
  
  42b797ed
- Update openai.md · 336aa43f
  Jeffrey Morgan authored Feb 08, 2024
  
  336aa43f
07 Feb, 2024 3 commits
- Update openai.md · ab0d37fd
  Jeffrey Morgan authored Feb 07, 2024
  
  ab0d37fd
- Update openai.md · 14e71350
  Jeffrey Morgan authored Feb 07, 2024
  
  14e71350
- Initial OpenAI `/v1/chat/completions` API compatibility (#2376) · 453f572f
  Jeffrey Morgan authored Feb 07, 2024
  
  453f572f
06 Feb, 2024 1 commit
- docs: keep_alive (#2258) · 128fce54
  Bruce MacDonald authored Feb 06, 2024
  
  128fce54
05 Feb, 2024 1 commit
- Update import instructions to use convert and quantize tooling from llama.cpp submodule (#2247) · b9f91a0b
  Jeffrey Morgan authored Feb 05, 2024
  
  b9f91a0b
02 Feb, 2024 1 commit
- Update api.md · f0e9496c
  Jeffrey Morgan authored Feb 02, 2024
  
  f0e9496c
29 Jan, 2024 1 commit
- Add container hints for troubleshooting · e7dbb003
  Daniel Hiltgen authored Jan 29, 2024
```
Some users are new to containers and unsure where the server logs go
```
  e7dbb003
26 Jan, 2024 2 commits
- Update modelfile.md · 5be9bdd4
  Jeffrey Morgan authored Jan 25, 2024
  
  5be9bdd4
- Update modelfile.md to include `MESSAGE` · b7067949
  Jeffrey Morgan authored Jan 25, 2024
  
  b7067949
22 Jan, 2024 1 commit
- faq: update to use launchctl setenv · 93a75626
  Michael Yang authored Jan 22, 2024
  
  93a75626
21 Jan, 2024 1 commit

Make CPU builds parallel and customizable AMD GPUs · df54c723

Daniel Hiltgen authored Jan 21, 2024

The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.

df54c723

20 Jan, 2024 1 commit
- Add compute capability 5.0, 7.5, and 8.0 · a447a083
  Daniel Hiltgen authored Jan 20, 2024
  
  a447a083
18 Jan, 2024 3 commits
- Go bump to v1.21 to pick up slog · ecbfc018
  Daniel Hiltgen authored Jan 18, 2024
  
  ecbfc018
- Mechanical switch from log to slog · fedd705a
  Daniel Hiltgen authored Jan 18, 2024
```
A few obvious levels were adjusted, but generally everything mapped to "info" level.
```
  fedd705a
- Refine the linux cuda/rocm developer docs · 9cd20b0e
  Daniel Hiltgen authored Jan 18, 2024
  
  9cd20b0e
12 Jan, 2024 1 commit

Add group delete to uninstall instructions (#1924) · 40a0a90a

Tristram Oaten authored Jan 12, 2024

After executing the `userdel ollama` command, I saw this message:

```sh
$ sudo userdel ollama
userdel: group ollama not removed because it has other members.
```

Which reminded me that I had to remove the dangling group too. For completeness, the uninstall instructions should do this too.

Thanks!

40a0a90a

11 Jan, 2024 1 commit

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

09 Jan, 2024 1 commit
- Update api.md (#1878) · e868c8a5
  Robin Glauser authored Jan 09, 2024
```
Fixed assistant in the example response.
```
  e868c8a5