Commits · 25906d72d1482bc9dc2e4300a42c8db4823ee1a3 · OpenDAS / ollama

07 Aug, 2024 1 commit

manifest: Fix crash on startup when trying to clean up unused files (#5840) · 1829fb61

Jesse Gross authored Aug 05, 2024

Currently if the config field is missing in the manifest file (or
corrupted), Ollama will crash when it tries to read it. This can
happen at startup or when pulling new models.

This data is mostly just used for showing model information so we
can be tolerant of it not being present - it is not required to
run the models. Besides avoiding crashing, this also gives us the
ability to restructure the config in the future by pulling it
into the main manifest file.

1829fb61

02 Aug, 2024 1 commit
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
01 Aug, 2024 5 commits
- Refactor and format code. · 8a9f946c
  Vyacheslav Moskalev authored Aug 02, 2024
  
  8a9f946c
- Refactor code. Remove extra variable. · 3b521054
  Vyacheslav Moskalev authored Aug 01, 2024
  
  3b521054
- Better types and naming closer to style. · b0c21658
  Vyacheslav Moskalev authored Aug 01, 2024
  
  b0c21658
- Change the order of context and prompt. · 49a54831
  Vyacheslav Moskalev authored Aug 01, 2024
  
  49a54831
- Fix extra context concatenation in generate handler (#5980). · 6bc5c137
  Vyacheslav Moskalev authored Aug 01, 2024
  
  6bc5c137
30 Jul, 2024 1 commit

Add Metrics to `api\embed` response (#5709) · 1b44d873

royjhan authored Jul 30, 2024

* add prompt tokens to embed response

* rm slog

* metrics

* types

* prompt n

* clean up

* reset submodule

* update tests

* test name

* list metrics

1b44d873

26 Jul, 2024 1 commit
- include modelfile messages · 15af5584
  Michael Yang authored Jun 19, 2024
  
  15af5584
22 Jul, 2024 5 commits
- fix dupe err message (#5857) · db0968f3
  Josh authored Jul 22, 2024
  
  db0968f3
- bool · 55cd3ddc
  Michael Yang authored Jul 03, 2024
  
  55cd3ddc
- origins · d1a5227c
  Michael Yang authored Jul 03, 2024
  
  d1a5227c
- rfc: dynamic environ lookup · 35b89b2e
  Michael Yang authored Jul 03, 2024
  
  35b89b2e
- server: collect nested tool call objects when parsing (#5824) · b3e5491e
  Jeffrey Morgan authored Jul 22, 2024
  
  b3e5491e
19 Jul, 2024 1 commit
- server: validate template (#5734) · e8b954c6
  Josh authored Jul 19, 2024
```
add template validation to modelfile
```
  e8b954c6
18 Jul, 2024 2 commits
- server: check for empty tools array too (#5779) · 70b1010f
  Jeffrey Morgan authored Jul 18, 2024
  
  70b1010f
- server: only parse tool calls if tools are provided (#5771) · 319fb1ce
  Jeffrey Morgan authored Jul 18, 2024
```
* server: only parse tool calls if tools are provided

* still set `resp.Message.Content`
```
  319fb1ce
16 Jul, 2024 4 commits

remove ToolCall from GenerateResponse · c279f963
Michael Yang authored Jul 16, 2024

c279f963

add suffix support to generate endpoint · d290e875

Michael Yang authored Jun 20, 2024

this change is triggered by the presence of "suffix", particularly
useful for code completion tasks

d290e875

OpenAI: /v1/embeddings compatibility (#5285) · 987dbab0

royjhan authored Jul 16, 2024



* OpenAI v1 models

* Empty List Testing

* Add back envconfig

* v1/models docs

* Remove Docs

* OpenAI batch embed compatibility

* merge conflicts

* integrate with api/embed

* ep

* merge conflicts

* request tests

* rm resp test

* merge conflict

* merge conflict

* test fixes

* test fn renaming

* input validation for empty string

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>

987dbab0

server: omit model system prompt if empty (#5717) · 4cb5d7de
Jeffrey Morgan authored Jul 16, 2024

4cb5d7de

15 Jul, 2024 2 commits

tools · d02bbebb
Michael Yang authored Jun 20, 2024

d02bbebb

Introduce `/api/embed` endpoint supporting batch embedding (#5127) · b9f5e16c

royjhan authored Jul 15, 2024

* Initial Batch Embedding

* Revert "Initial Batch Embedding"

This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.

* Initial Draft

* mock up notes

* api/embed draft

* add server function

* check normalization

* clean up

* normalization

* playing around with truncate stuff

* Truncation

* Truncation

* move normalization to go

* Integration Test Template

* Truncation Integration Tests

* Clean up

* use float32

* move normalize

* move normalize test

* refactoring

* integration float32

* input handling and handler testing

* Refactoring of legacy and new

* clear comments

* merge conflicts

* touches

* embedding type 64

* merge conflicts

* fix hanging on single string

* refactoring

* test values

* set context length

* clean up

* testing clean up

* testing clean up

* remove function closure

* Revert "remove function closure"

This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.

* remove function closure

* remove redundant error check

* clean up

* more clean up

* clean up

b9f5e16c

14 Jul, 2024 1 commit
- remove template (#5655) · 057d3186
  Patrick Devine authored Jul 13, 2024
  
  057d3186
13 Jul, 2024 2 commits
- server: prepend system message in chat handler · f7ee0123
  jmorganca authored Jul 13, 2024
  
  f7ee0123
- server: fix `context`, `load_duration` and `total_duration` fields (#5676) · 1ed0aa8f
  Jeffrey Morgan authored Jul 13, 2024
```
* server: fix `contet`, `load_duration` and `total_duration` fields

* Update server/routes.go
```
  1ed0aa8f
05 Jul, 2024 3 commits
- fix model reloading · ac7a842e
  Michael Yang authored Jul 03, 2024
```
ensure runtime model changes (template, system prompt, messages,
options) are captured on model updates without needing to reload the
server
```
  ac7a842e
- comments · 2c3fe1fd
  Michael Yang authored Jun 20, 2024
  
  2c3fe1fd
- update message processing · 269ed6e6
  Michael Yang authored Jun 17, 2024
  
  269ed6e6
03 Jul, 2024 1 commit

Only set default keep_alive on initial model load · 955f2a4e

Daniel Hiltgen authored Jul 02, 2024

This change fixes the handling of keep_alive so that if client
request omits the setting, we only set this on initial load.  Once
the model is loaded, if new requests leave this unset, we'll keep
whatever keep_alive was there.

955f2a4e

02 Jul, 2024 3 commits

fix generate template · 65a5040e
Michael Yang authored Jul 02, 2024

65a5040e

OpenAI: v1/completions compatibility (#5209) · d626b99b

royjhan authored Jul 02, 2024



* OpenAI v1 models

* Refactor Writers

* Add Test

Co-Authored-By: Attila Kerekes

* Credit Co-Author
Co-Authored-By: Attila Kerekes <439392+keriati@users.noreply.github.com>

* Empty List Testing

* Use Namespace for Ownedby

* Update Test

* Add back envconfig

* v1/models docs

* Use ModelName Parser

* Test Names

* Remove Docs

* Clean Up

* Test name
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Add Middleware for Chat and List

* Completions Endpoint

* Testing Cleanup

* Test with Fatal

* Add functionality to chat test

* Rename function

* float types

* type cleanup

* cleaning

* more cleaning

* Extra test cases

* merge conflicts

* merge conflicts

* merge conflicts

* merge conflicts

* cleaning

* cleaning

---------
Co-authored-by: Attila Kerekes <439392+keriati@users.noreply.github.com>
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

d626b99b

OpenAI: /v1/models and /v1/models/{model} compatibility (#5007) · 996bb1b8

royjhan authored Jul 02, 2024



* OpenAI v1 models

* Refactor Writers

* Add Test

Co-Authored-By: Attila Kerekes

* Credit Co-Author
Co-Authored-By: Attila Kerekes <439392+keriati@users.noreply.github.com>

* Empty List Testing

* Use Namespace for Ownedby

* Update Test

* Add back envconfig

* v1/models docs

* Use ModelName Parser

* Test Names

* Remove Docs

* Clean Up

* Test name
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Add Middleware for Chat and List

* Testing Cleanup

* Test with Fatal

* Add functionality to chat test

* OpenAI: /v1/models/{model} compatibility (#5028)

* Retrieve Model

* OpenAI Delete Model

* Retrieve Middleware

* Remove Delete from Branch

* Update Test

* Middleware Test File

* Function name

* Cleanup

* Test Update

* Test Update

---------
Co-authored-by: Attila Kerekes <439392+keriati@users.noreply.github.com>
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

996bb1b8

01 Jul, 2024 2 commits
- add capabilities · a30915bd
  Michael Yang authored Jun 11, 2024
  
  a30915bd
- rename templates to template · 58e3fff3
  Michael Yang authored Jun 10, 2024
  
  58e3fff3
25 Jun, 2024 1 commit

llm: speed up gguf decoding by a lot (#5246) · cb42e607

Blake Mizerany authored Jun 24, 2024

Previously, some costly things were causing the loading of GGUF files
and their metadata and tensor information to be VERY slow:

  * Too many allocations when decoding strings
  * Hitting disk for each read of each key and value, resulting in a
    not-okay amount of syscalls/disk I/O.

The show API is now down to 33ms from 800ms+ for llama3 on a macbook pro
m3.

This commit also prevents collecting large arrays of values when
decoding GGUFs (if desired). When such keys are encountered, their
values are null, and are encoded as such in JSON.

Also, this fixes a broken test that was not encoding valid GGUF.

cb42e607

21 Jun, 2024 1 commit

Sort the ps output · 642cee13

Daniel Hiltgen authored Jun 21, 2024

Provide consistent ordering for the ps command - longest duration listed first

642cee13

19 Jun, 2024 1 commit

Extend api/show and ollama show to return more model info (#4881) · fedf7163

royjhan authored Jun 19, 2024



* API Show Extended

* Initial Draft of Information
Co-Authored-By: Patrick Devine <pdevine@sonic.net>

* Clean Up

* Descriptive arg error messages and other fixes

* Second Draft of Show with Projectors Included

* Remove Chat Template

* Touches

* Prevent wrapping from files

* Verbose functionality

* Docs

* Address Feedback

* Lint

* Resolve Conflicts

* Function Name

* Tests for api/show model info

* Show Test File

* Add Projector Test

* Clean routes

* Projector Check

* Move Show Test

* Touches

* Doc update

---------
Co-authored-by: Patrick Devine <pdevine@sonic.net>

fedf7163

16 Jun, 2024 1 commit
- Add ModifiedAt Field to /api/show (#5033) · 89c79bec
  royjhan authored Jun 15, 2024
```
* Add Mod Time to Show

* Error Handling
```
  89c79bec
06 Jun, 2024 1 commit
- API app/browser access (#4879) · 1a29e9a8
  royjhan authored Jun 06, 2024
```
* API app/browser access

* Add tauri (resolves #2291, #4791, #3799, #4388)
```
  1a29e9a8