Commits · 1b272d5bcd6dcf9ddad12f3bd00cc94b4f0cb658 · OpenDAS / ollama

26 Mar, 2024 1 commit
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
23 Mar, 2024 1 commit

Revamp go based integration tests · 949b6c01

Daniel Hiltgen authored Mar 23, 2024

This uplevels the integration tests to run the server which can allow
testing an existing server, or a remote server.

949b6c01

15 Mar, 2024 1 commit

server: replace blob prefix separator from ':' to '-' (#3146) · 703684a8

Blake Mizerany authored Mar 14, 2024

This fixes issues with blob file names that contain ':' characters to be rejected by file systems that do not support them.

703684a8

13 Mar, 2024 1 commit
- Default Keep Alive environment variable (#3094) · 47cfe58a
  Patrick Devine authored Mar 13, 2024
```
---------
Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>
```
  47cfe58a
09 Mar, 2024 5 commits
- Finish unwinding idempotent payload logic · 4a5c9b80
  Daniel Hiltgen authored Mar 08, 2024
```
The recent ROCm change partially removed idempotent
payloads, but the ggml-metal.metal file for mac was still
idempotent.  This finishes switching to always extract
the payloads, and now that idempotentcy is gone, the
version directory is no longer useful.
```
  4a5c9b80
- separate out `isLocalIP` · 5b3fad96
  Jeffrey Morgan authored Mar 09, 2024
  
  5b3fad96
- simplify host checks · bfec2c6e
  Jeffrey Morgan authored Mar 08, 2024
  
  bfec2c6e
- add additional allowed hosts · 5c143af7
  Jeffrey Morgan authored Mar 08, 2024
  
  5c143af7
- add allowed host middleware and remove `workDir` middleware (#3018) · fc8c0445
  Jeffrey Morgan authored Mar 08, 2024
  
  fc8c0445
08 Mar, 2024 3 commits
- decode ggla · 76bdebba
  Michael Yang authored Mar 08, 2024
  
  76bdebba
- fix: allow importing a model from name reference (#3005) · 0cebc79c
  Bruce MacDonald authored Mar 08, 2024
  
  0cebc79c
- Revert "adjust download and upload concurrency based on available bandwidth" (#2995) · fc062059
  Jeffrey Morgan authored Mar 07, 2024
  
  fc062059
07 Mar, 2024 2 commits

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

Convert Safetensors to an Ollama model (#2824) · 2c017ca4
Patrick Devine authored Mar 06, 2024

2c017ca4

01 Mar, 2024 1 commit
- Fix embeddings load model behavior (#2848) · 3b4bab3d
  Jeffrey Morgan authored Feb 29, 2024
  
  3b4bab3d
29 Feb, 2024 1 commit

prepend image tags (#2789) · 0e19476b

Michael Yang authored Feb 29, 2024

instead of appending image tags, prepend them - this generally produces better results

0e19476b

21 Feb, 2024 9 commits
- refactor · 084d8466
  Michael Yang authored Jan 29, 2024
  
  084d8466
- lint · 6a4b9944
  Michael Yang authored Jan 29, 2024
  
  6a4b9944
- use LimitGroup for uploads · bea007de
  Michael Yang authored Jan 26, 2024
  
  bea007de
- adjust group limit based on download speed · 074934be
  Michael Yang authored Jan 26, 2024
  
  074934be
- add new LimitGroup for dynamic concurrency · 0de12368
  Michael Yang authored Jan 26, 2024
  
  0de12368
- refactor download run · 917bd610
  Michael Yang authored Jan 26, 2024
  
  917bd610
- better error message when calling `/api/generate` or `/api/chat` with embedding models · 287ba115
  Jeffrey Morgan authored Feb 20, 2024
  
  287ba115
- Support for `bert` and `nomic-bert` embedding models · 63861f58
  Jeffrey Morgan authored Feb 20, 2024
  
  63861f58
- replace strings buffer with hasher (#2437) · 210b6526
  Michael Yang authored Feb 20, 2024
```
the buffered value is going into the hasher eventually so write directly
to the hasher instead
```
  210b6526
20 Feb, 2024 1 commit
- use http.DefaultClient (#2530) · 897b2134
  Michael Yang authored Feb 20, 2024
```
default client already handles proxy
```
  897b2134
16 Feb, 2024 1 commit
- fix: chat system prompting overrides (#2542) · 88622847
  Bruce MacDonald authored Feb 16, 2024
  
  88622847
15 Feb, 2024 2 commits
- rerefactor · e43648af
  Michael Yang authored Feb 14, 2024
  
  e43648af
- Move hub auth out to new package · f397e0e9
  Daniel Hiltgen authored Feb 05, 2024
  
  f397e0e9
12 Feb, 2024 2 commits
- Fix issues with templating prompt in chat mode (#2460) · 48a273f8
  Jeffrey Morgan authored Feb 12, 2024
  
  48a273f8
- Check image filetype in api handlers (#2467) · 1f9078d6
  Jeffrey Morgan authored Feb 12, 2024
  
  1f9078d6
08 Feb, 2024 1 commit
- Fix hanging issue when sending empty content (#2399) · a0a199b1
  Jeffrey Morgan authored Feb 07, 2024
  
  a0a199b1
07 Feb, 2024 2 commits
- Initial OpenAI `/v1/chat/completions` API compatibility (#2376) · 453f572f
  Jeffrey Morgan authored Feb 07, 2024
  
  453f572f
- fix response on token error · e805ac1d
  Michael Yang authored Feb 07, 2024
  
  e805ac1d
01 Feb, 2024 6 commits
- structured debug prompt · 3d6f4850
  Michael Yang authored Jan 31, 2024
  
  3d6f4850
- use image id · f3761405
  Michael Yang authored Feb 01, 2024
  
  f3761405
- fix tests · e49dc9f3
  Michael Yang authored Feb 01, 2024
  
  e49dc9f3
- remove image tags · d125510b
  Michael Yang authored Feb 01, 2024
  
  d125510b
- account for image projection in token count · fb569880
  Michael Yang authored Feb 01, 2024
  
  fb569880
- use llm.ImageData for chat · d046bee7
  Michael Yang authored Jan 31, 2024
  
  d046bee7