Commits · e172f095ba4af2c98d7744ce4ffcf4cd3a8e123c · OpenDAS / ollama

01 Apr, 2025 1 commit

api: return model capabilities from the show endpoint (#10066) · e172f095

Bruce MacDonald authored Apr 01, 2025

With support for multimodal models becoming more varied and common it is important for clients to be able to easily see what capabilities a model has. Retuning these from the show endpoint will allow clients to easily see what a model can do.

e172f095

13 Mar, 2025 1 commit

add verbose mode to the show command (#9640) · 4bed7392

Patrick Devine authored Mar 13, 2025

Add metadata and tensor information to the show command to be able to
see more information about a model. This outputs the same data as
shown on the model details page on ollama.com

4bed7392

05 Mar, 2025 1 commit

server/internal/registry: take over pulls from server package (#9485) · e2252d0f

Blake Mizerany authored Mar 05, 2025

This commit replaces the old pull implementation in the server package
with the new, faster, more robust pull implementation in the registry
package.

The new endpoint, and now the remove endpoint too, are behind the
feature gate "client2" enabled only by setting the OLLAMA_EXPERIMENT
environment variable include "client2".

Currently, the progress indication is wired to perform the same as the
previous implementation to avoid making changes to the CLI, and because
the status reports happen at the start of the download, and the end of
the write to disk, the progress indication is not as smooth as it could
be. This is a known issue and will be addressed in a future change.

This implementation may be ~0.5-1.0% slower in rare cases, depending on
network and disk speed, but is generally MUCH faster and more robust
than the its predecessor in all other cases.

e2252d0f

24 Feb, 2025 1 commit
- config: allow setting context length through env var (#8938) · 314573bf
  Parth Sareen authored Feb 24, 2025
```
* envconfig: allow setting context length through env var
```
  314573bf
08 Jan, 2025 1 commit
- llama: update vendored code to commit 46e3556 (#8308) · 1deafd82
  Jeffrey Morgan authored Jan 08, 2025
  
  1deafd82
03 Jan, 2025 1 commit

api: remove unused create fields · 29a8975c

Bruce MacDonald authored Jan 03, 2025

These fields are deprecated, but specifying them will not do anything. Removing them as the other deprecated fields will still work, but these do not, so they dont match our existing pattern.

29a8975c

01 Jan, 2025 1 commit
- Update the /api/create endpoint to use JSON (#7935) · 86a622cb
  Patrick Devine authored Dec 31, 2024
```
Replaces `POST /api/create` to use JSON instead of a Modelfile.

This is a breaking change.
```
  86a622cb
11 Dec, 2024 1 commit
- llama: update vendored code to commit 40c6d79f (#7875) · 527cc978
  Jeffrey Morgan authored Dec 10, 2024
  
  527cc978
05 Dec, 2024 2 commits
- api: add generate endpoint for structured outputs (#7939) · c6c52627
  Parth Sareen authored Dec 04, 2024
  
  c6c52627
- api: structured outputs - chat endpoint (#7900) · 630e7dc6
  Parth Sareen authored Dec 04, 2024
```
Adds structured outputs to chat endpoint
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
Co-authored-by: Hieu Nguyen <hieunguyen1053@outlook.com>
```
  630e7dc6
30 Nov, 2024 1 commit
- Enable index tracking for tools - openai api support (#7888) · 5f805118
  Parth Sareen authored Nov 29, 2024
  
  5f805118
12 Nov, 2024 1 commit
- api: fix typos in Go Doc comments (#7620) · d48c1c5a
  Evan authored Nov 11, 2024
  
  d48c1c5a
06 Nov, 2024 1 commit

runner.go: Remove unused arguments · a9094176

Jesse Gross authored Oct 30, 2024

Now that server.cpp is gone, we don't need to keep passing arguments
that were only ignored and only kept for compatibility.

a9094176

28 Aug, 2024 1 commit
- update deprecated warnings · 8e6da3cb
  Michael Yang authored Aug 27, 2024
  
  8e6da3cb
06 Aug, 2024 1 commit
- Fixed invalid option provided not displaying the invalid option name problem. (#6202) · d4a7216c
  Chua Chee Seng authored Aug 07, 2024
  
  d4a7216c
05 Aug, 2024 1 commit

Implement linux NUMA detection · f457d634

Daniel Hiltgen authored Aug 05, 2024

If the system has multiple numa nodes, enable numa support in llama.cpp
If we detect numactl in the path, use that, else use the basic "distribute" mode.

f457d634

30 Jul, 2024 1 commit

Add Metrics to `api\embed` response (#5709) · 1b44d873

royjhan authored Jul 30, 2024

* add prompt tokens to embed response

* rm slog

* metrics

* types

* prompt n

* clean up

* reset submodule

* update tests

* test name

* list metrics

1b44d873

29 Jul, 2024 1 commit
- api: add stringifier for `Tool` (#5891) · 46e6327e
  Jeffrey Morgan authored Jul 29, 2024
  
  46e6327e
27 Jul, 2024 1 commit
- feat: add support for min_p (resolve #1142) (#1825) · f3d7a481
  Tibor Schmidt authored Jul 27, 2024
  
  f3d7a481
18 Jul, 2024 1 commit
- always provide content even if empty (#5778) · 84e5721f
  Jeffrey Morgan authored Jul 18, 2024
  
  84e5721f
17 Jul, 2024 1 commit
- marshal json automatically for some template values (#5758) · b2554455
  Michael Yang authored Jul 17, 2024
  
  b2554455
16 Jul, 2024 4 commits
- remove ToolCall from GenerateResponse · c279f963
  Michael Yang authored Jul 16, 2024
  
  c279f963
- add suffix support to generate endpoint · d290e875
  Michael Yang authored Jun 20, 2024
```
this change is triggered by the presence of "suffix", particularly
useful for code completion tasks
```
  d290e875
- remove unneeded tool calls · 5a83f79a
  Michael Yang authored Jul 16, 2024
  
  5a83f79a
- server: return empty slice on empty `/api/embed` request (#5713) · 7ac6d462
  Jeffrey Morgan authored Jul 15, 2024
```
* server: return empty slice on empty `/api/embed` request

* fix tests
```
  7ac6d462
15 Jul, 2024 3 commits

tools · d02bbebb
Michael Yang authored Jun 20, 2024

d02bbebb
server: lowercase roles for compatibility with clients (#5695) · 9e35d9bb
Jeffrey Morgan authored Jul 15, 2024

9e35d9bb

Introduce `/api/embed` endpoint supporting batch embedding (#5127) · b9f5e16c

royjhan authored Jul 15, 2024

* Initial Batch Embedding

* Revert "Initial Batch Embedding"

This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.

* Initial Draft

* mock up notes

* api/embed draft

* add server function

* check normalization

* clean up

* normalization

* playing around with truncate stuff

* Truncation

* Truncation

* move normalization to go

* Integration Test Template

* Truncation Integration Tests

* Clean up

* use float32

* move normalize

* move normalize test

* refactoring

* integration float32

* input handling and handler testing

* Refactoring of legacy and new

* clear comments

* merge conflicts

* touches

* embedding type 64

* merge conflicts

* fix hanging on single string

* refactoring

* test values

* set context length

* clean up

* testing clean up

* testing clean up

* remove function closure

* Revert "remove function closure"

This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.

* remove function closure

* remove redundant error check

* clean up

* more clean up

* clean up

b9f5e16c

14 Jul, 2024 1 commit
- remove template (#5655) · 057d3186
  Patrick Devine authored Jul 13, 2024
  
  057d3186
02 Jul, 2024 1 commit

OpenAI: /v1/models and /v1/models/{model} compatibility (#5007) · 996bb1b8

royjhan authored Jul 02, 2024



* OpenAI v1 models

* Refactor Writers

* Add Test

Co-Authored-By: Attila Kerekes

* Credit Co-Author
Co-Authored-By: Attila Kerekes <439392+keriati@users.noreply.github.com>

* Empty List Testing

* Use Namespace for Ownedby

* Update Test

* Add back envconfig

* v1/models docs

* Use ModelName Parser

* Test Names

* Remove Docs

* Clean Up

* Test name
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Add Middleware for Chat and List

* Testing Cleanup

* Test with Fatal

* Add functionality to chat test

* OpenAI: /v1/models/{model} compatibility (#5028)

* Retrieve Model

* OpenAI Delete Model

* Retrieve Middleware

* Remove Delete from Branch

* Update Test

* Middleware Test File

* Function name

* Cleanup

* Test Update

* Test Update

---------
Co-authored-by: Attila Kerekes <439392+keriati@users.noreply.github.com>
Co-authored-by: Jeffrey Morgan <jmorgan...

996bb1b8

01 Jul, 2024 1 commit
- Switch use_mmap to a pointer type · 97c9e117
  Daniel Hiltgen authored Jun 28, 2024
```
This uses nil as undefined for a cleaner implementation.
```
  97c9e117
21 Jun, 2024 1 commit

Fix use_mmap parsing for modelfiles · 7e774922

Daniel Hiltgen authored Jun 21, 2024

Add the new tristate parsing logic for the code path for modelfiles,
as well as a unit test.

7e774922

19 Jun, 2024 1 commit

Extend api/show and ollama show to return more model info (#4881) · fedf7163

royjhan authored Jun 19, 2024



* API Show Extended

* Initial Draft of Information
Co-Authored-By: Patrick Devine <pdevine@sonic.net>

* Clean Up

* Descriptive arg error messages and other fixes

* Second Draft of Show with Projectors Included

* Remove Chat Template

* Touches

* Prevent wrapping from files

* Verbose functionality

* Docs

* Address Feedback

* Lint

* Resolve Conflicts

* Function Name

* Tests for api/show model info

* Show Test File

* Add Projector Test

* Clean routes

* Projector Check

* Move Show Test

* Touches

* Doc update

---------
Co-authored-by: Patrick Devine <pdevine@sonic.net>

fedf7163

17 Jun, 2024 1 commit

Adjust mmap logic for cuda windows for faster model load · 17179679

Daniel Hiltgen authored Jun 17, 2024

On Windows, recent llama.cpp changes make mmap slower in most
cases, so default to off.  This also implements a tri-state for
use_mmap so we can detect the difference between a user provided
value of true/false, or unspecified.

17179679

16 Jun, 2024 1 commit
- Add ModifiedAt Field to /api/show (#5033) · 89c79bec
  royjhan authored Jun 15, 2024
```
* Add Mod Time to Show

* Error Handling
```
  89c79bec
12 Jun, 2024 1 commit
- move OLLAMA_HOST to envconfig (#5009) · c69bc19e
  Patrick Devine authored Jun 12, 2024
  
  c69bc19e
06 Jun, 2024 1 commit
- Separate ListResponse and ModelResponse for api/tags vs api/ps (#4842) · 4bf1da49
  royjhan authored Jun 06, 2024
```
* Remove false time fields

* Struct Separation for List and Process

* Remove Marshaler
```
  4bf1da49
04 Jun, 2024 1 commit
- some gocritic · c895a7d1
  Michael Yang authored May 21, 2024
  
  c895a7d1
14 May, 2024 1 commit
- Ollama `ps` command for showing currently loaded models (#4327) · 68459888
  Patrick Devine authored May 13, 2024
  
  68459888
10 May, 2024 1 commit

Use `--quantize` flag and `quantize` api parameter (#4321) · 6602e793

Jeffrey Morgan authored May 10, 2024



* rename `--quantization` to `--quantize`

* backwards

* Update api/types.go
Co-authored-by: Michael Yang <mxyng@pm.me>

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

6602e793