Commits · d4cd6957598ba6a3a1bb4e2660ee24b82e2541da · OpenDAS / ollama

19 Dec, 2023 3 commits

Add cgo implementation for llama.cpp · d4cd6957

Daniel Hiltgen authored Nov 13, 2023

Run the server.cpp directly inside the Go runtime via cgo
while retaining the LLM Go abstractions.

d4cd6957

Update images.go · 5e7fd690
Bruce MacDonald authored Dec 11, 2023

5e7fd690

deprecate ggml · 811b1f03

Bruce MacDonald authored Nov 24, 2023



- remove ggml runner
- automatically pull gguf models when ggml detected
- tell users to update to gguf in the case automatic pull fails
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>

811b1f03

18 Dec, 2023 2 commits
- send empty messages on last chat response (#1530) · d99fa6ce
  Bruce MacDonald authored Dec 18, 2023
  
  d99fa6ce
- add magic header for unit tests (#1558) · 3948c6ea
  Patrick Devine authored Dec 18, 2023
  
  3948c6ea
15 Dec, 2023 3 commits
- add API create/copy handlers (#1541) · 86b0dd4b
  Patrick Devine authored Dec 15, 2023
  
  86b0dd4b
- add API tests for list handler (#1535) · 0174665d
  Patrick Devine authored Dec 14, 2023
  
  0174665d
- Add unit test of API routes (#1528) · 630518f0
  Patrick Devine authored Dec 14, 2023
  
  630518f0
14 Dec, 2023 1 commit

restore model load duration on generate response (#1524) · 6ee8c801

Bruce MacDonald authored Dec 14, 2023

* restore model load duration on generate response

- set model load duration on generate and chat done response
- calculate createAt time when response created

* remove checkpoints predict opts

* Update routes.go

6ee8c801

13 Dec, 2023 1 commit
- fix tests · 4a1abfe4
  Jeffrey Morgan authored Dec 13, 2023
  
  4a1abfe4
12 Dec, 2023 2 commits
- add image support to the chat api (#1490) · d9e60f63
  Patrick Devine authored Dec 12, 2023
  
  d9e60f63
- Fix issues with `/set template` and `/set system` (#1486) · 0a9d3480
  Jeffrey Morgan authored Dec 12, 2023
  
  0a9d3480
11 Dec, 2023 1 commit

Multimodal support (#1216) · 910e9401

Patrick Devine authored Dec 11, 2023




---------
Co-authored-by: Matt Apperson <mattapperson@Matts-MacBook-Pro.local>

910e9401

10 Dec, 2023 4 commits
- fix `go-staticcheck` warning · 7db5bcf7
  Jeffrey Morgan authored Dec 10, 2023
  
  7db5bcf7
- fix model name returned by `/api/generate` being different than the model name provided · fa2f095b
  Jeffrey Morgan authored Dec 10, 2023
  
  fa2f095b
- fix error on accumulating final chat response · 045b855d
  Jeffrey Morgan authored Dec 10, 2023
  
  045b855d
- fix empty response when receiving runner error · 32064a06
  Jeffrey Morgan authored Dec 10, 2023
  
  32064a06
09 Dec, 2023 1 commit
- Don't expose model information in `/api/generate` · 9e1406e4
  Jeffrey Morgan authored Dec 09, 2023
  
  9e1406e4
08 Dec, 2023 3 commits
- fix: encode full previous prompt in context (#1424) · 7e9405fd
  Bruce MacDonald authored Dec 08, 2023
  
  7e9405fd
- fix: only flush template in chat when current role encountered (#1426) · 3b0b8930
  Bruce MacDonald authored Dec 08, 2023
  
  3b0b8930
- fix: restore modelfile system in prompt template (#1425) · e3f925fc
  Bruce MacDonald authored Dec 08, 2023
  
  e3f925fc
05 Dec, 2023 10 commits

use missingkey in set empty interface when missing · 47d4e226
Bruce MacDonald authored Nov 22, 2023

47d4e226
return model configuration in generate · 5d75505e
Michael Yang authored Dec 01, 2023

5d75505e
load projectors · b9495ea1
Michael Yang authored Nov 30, 2023

b9495ea1
chat api endpoint (#1392) · 195e3d9d
Bruce MacDonald authored Dec 05, 2023

195e3d9d
server: add version handler · 1ebdbd96
Michael Yang authored Oct 12, 2023

1ebdbd96
Revert "chat api (#991)" while context variable is fixed · 00d06619
Jeffrey Morgan authored Dec 04, 2023
```
This reverts commit 7a0899d6.
```
00d06619
use NewLayer for CreateBlobHandler · a3737cbd
Michael Yang authored Nov 24, 2023

a3737cbd
add modelfamilies · 998f1785
Michael Yang authored Nov 29, 2023

998f1785

refactor layer creation · 70a93057

Michael Yang authored Nov 22, 2023

previous layer creation was not ideal because:

1. it required reading the input file multiple times, once to calculate
   the sha256 checksum, another to write it to disk, and potentially one
   more to decode the underlying gguf
2. used io.ReadSeeker which is prone to user error. if the file isn't
   reset correctly or in the right place, it could end up reading an
   empty file

there are also some brittleness when reading existing layers else
writing the inherited layers will error reading an already closed file

this commit aims to fix these issues by restructuring layer creation.

1. it will now write the layer to a temporary file as well as the hash
   function and move it to the final location on Commit
2. layers are read once once when copied to the destination. exception
   is raw model files which still requires a second read to decode the
   model metadata

70a93057

split from into one or more models · 2cb0fa7d
Michael Yang authored Nov 24, 2023

2cb0fa7d

04 Dec, 2023 1 commit

chat api (#991) · 7a0899d6

Bruce MacDonald authored Dec 04, 2023

- update chat docs
- add messages chat endpoint
- remove deprecated context and template generate parameters from docs
- context and template are still supported for the time being and will continue to work as expected
- add partial response to chat history

7a0899d6

01 Dec, 2023 1 commit
- Fix adapter loading from SHA hash · bb80a597
  Joshua Pham authored Dec 01, 2023
  
  bb80a597
30 Nov, 2023 2 commits
- upload: fix PUT retry · 13efd5f2
  Michael Yang authored Nov 29, 2023
  
  13efd5f2
- upload: separate progress tracking · c4bdfffd
  Michael Yang authored Nov 29, 2023
  
  c4bdfffd
29 Nov, 2023 5 commits
- new hasher · 26c63418
  Michael Yang authored Nov 29, 2023
  
  26c63418
- revert checksum calculation to calculate-as-you-go · 2799784a
  Michael Yang authored Nov 21, 2023
  
  2799784a
- validate model tags on copy (#1323) · 96122b72
  Bruce MacDonald authored Nov 29, 2023
  
  96122b72
- fix: disable ':' in tag names (#1280) · c2e3b891
  Timothy Jaeryang Baek authored Nov 29, 2023
```
Co-authored-by: rootedbox
```
  c2e3b891
- Allow setting parameters in the REPL (#1294) · cde31cb2
  Patrick Devine authored Nov 29, 2023
  
  cde31cb2