Commits · cdf3a181dcdb42ba72d9162c4f3461f218c33d5f · OpenDAS / ollama

06 Jan, 2025 1 commit

Add CUSTOM_CPU_FLAGS to Dockerfile. (#8284) · cdf3a181

frob authored Jan 07, 2025



* Add CUSTOM_CPU_FLAGS.

* fix golangci-lint error.

---------
Co-authored-by: Richard Lyons <rick@frob.com.au>

cdf3a181

04 Jan, 2025 1 commit
- llama: fix runner api example url in README.md (#8307) · 3919f4ba
  Ubaldo Porcheddu authored Jan 04, 2025
  
  3919f4ba
03 Jan, 2025 2 commits

discover: remove leading new-line for linter · 2d33c4e9
Bruce MacDonald authored Jan 03, 2025

2d33c4e9

api: remove unused create fields · 29a8975c

Bruce MacDonald authored Jan 03, 2025

These fields are deprecated, but specifying them will not do anything. Removing them as the other deprecated fields will still work, but these do not, so they dont match our existing pattern.

29a8975c

01 Jan, 2025 1 commit
- Update the /api/create endpoint to use JSON (#7935) · 86a622cb
  Patrick Devine authored Dec 31, 2024
```
Replaces `POST /api/create` to use JSON instead of a Modelfile.

This is a breaking change.
```
  86a622cb
29 Dec, 2024 4 commits
- readme: link header to ollama.com · 459d822b
  Jeffrey Morgan authored Dec 29, 2024
  
  459d822b
- examples: updated deprecated imports (#3602) · 84489944
  Simon Schampijer authored Dec 29, 2024
  
  84489944
- docs: add /api/version endpoint documentation (#8082) · 103db421
  Anas Khan authored Dec 30, 2024
```
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  103db421
- readme: update import header · 6daddcde
  Jeffrey Morgan authored Dec 29, 2024
  
  6daddcde
28 Dec, 2024 1 commit
- readme: add Yacana multi-agent framework to community integrations (#7259) · 07f7e69b
  Emilien Lancelot authored Dec 28, 2024
  
  07f7e69b
27 Dec, 2024 2 commits
- docs: add syntax highlighting on Go template code blocks (#8215) · b68e8e57
  CIIDMike authored Dec 28, 2024
  
  b68e8e57
- readme: add TextLLaMA to community integrations · 369fb529
  Adarsh Mishra authored Dec 27, 2024
  
  369fb529
25 Dec, 2024 2 commits
- readme: add neollama to terminal section of community integrations (#8242) · 023e4bca
  Jared Donnell authored Dec 25, 2024
  
  023e4bca
- readme: add alpaca client application to community integrations (#8227) · 51af455f
  aritra saha authored Dec 25, 2024
  
  51af455f
23 Dec, 2024 3 commits
- readme: add IntelliBar to community integrations (#7950) · ffe35490
  Emanuil Rusev authored Dec 23, 2024
  
  ffe35490
- server: reuse InvalidModelNameErrMsg type (#8163) · 928de905
  湛露先生 authored Dec 23, 2024
  
  928de905
- readme: add Perplexica to community-integrations (#8198) · 36aea615
  ItzCrazyKns authored Dec 23, 2024
  
  36aea615
22 Dec, 2024 1 commit
- fix crash bug with /save when quotes are used (#8208) · dd352ab2
  Patrick Devine authored Dec 21, 2024
  
  dd352ab2
20 Dec, 2024 2 commits
- remove tutorials.md which pointed to removed tutorials (#8189) · d8bab8ea
  Patrick Devine authored Dec 20, 2024
  
  d8bab8ea
- update golang.org/x dependencies (#8172) · 9ab62eb9
  Squishedmac authored Dec 20, 2024
  
  9ab62eb9
19 Dec, 2024 1 commit

llama: test key order preservation in schema_to_grammar (#8078) · 290cf204

Parth Sareen authored Dec 18, 2024

This change adds a test to catch a regression in schema_to_grammar where
the order of keys in the JSON schema is not preserved in the generated
grammar, which is critical for step-by-step reasoning.

290cf204

18 Dec, 2024 1 commit
- scripts: sign renamed macOS binary (#8131) · a72f2dce
  Jeffrey Morgan authored Dec 17, 2024
  
  a72f2dce
17 Dec, 2024 6 commits

llama: Ensure KV cache is fully defragmented. · 08a832b4

Jesse Gross authored Dec 12, 2024

Sometimes the KV cache requires defragmentation even without
triggering the threshold heuristic. In this case, decoding
will not being able to find a KV cache slot. This is particularly
difficult for the caller to handle if it happens in between
ubatches. To avoid this, we should immediately trigger a defrag.

In addition, a heavily fragmented cache can require more than
max_moves to defragment. Currently, we stop when we hit the limit
but this can leave a cache that still does not have adequate space
even after defragmentation is triggered. Instead, we should do
multiple batches of processing until everything is complete.

Fixes #7949

08a832b4

llm: do not error on "null" format (#8139) · 2ddc32d5
Blake Mizerany authored Dec 17, 2024
```
This fixes another regression in the previous commit that fixed other
known bugs.
```
2ddc32d5
readme: change getting started guide link for pgai (#8119) · 2cde4b88
Jascha Beste authored Dec 17, 2024

2cde4b88

llm: do not silently fail for supplied, but invalid formats (#8130) · 87f0a49f

Blake Mizerany authored Dec 16, 2024

Changes in #8002 introduced fixes for bugs with mangling JSON Schemas.
It also fixed a bug where the server would silently fail when clients
requested invalid formats. It also, unfortunately, introduced a bug
where the server would reject requests with an empty format, which
should be allowed.

The change in #8127 updated the code to allow the empty format, but also
reintroduced the regression where the server would silently fail when
the format was set, but invalid.

This commit fixes both regressions. The server does not reject the empty
format, but it does reject invalid formats. It also adds tests to help
us catch regressions in the future.

Also, the updated code provides a more detailed error message when a
client sends a non-empty, but invalid format, echoing the invalid format
in the response.

This commits also takes the opportunity to remove superfluous linter
checks.

87f0a49f

llm: loosen format check to default to no format (#8127) · 0f06a6da
Jeffrey Morgan authored Dec 16, 2024

0f06a6da

darwin: restore multiple runners for x86 (#8125) · 8f805dd7

Daniel Hiltgen authored Dec 16, 2024

In 0.5.2 we simplified packaging to have avx only for macos x86. It looks like
there may still be some non-AVX systems out there, so this puts back the prior
logic of building no-AVX for the primary binary, and now 2 runners for avx and avx2.
These will be packaged in the App bundle only, so the stand-alone binary will now be
without AVX support on macos. On arm, we'll also see these runners reported
as available in the log, but they're dormant and will never be used at runtime.

8f805dd7

16 Dec, 2024 2 commits

readme: example/get started guide for pgai with Ollama (#8115) · 89d5e2f2
Michael authored Dec 16, 2024
```
readme: example/get started guide for pgai with Ollama
```
89d5e2f2

readme: add pgai to readme for semantic search (#8028) · 297ada6c

Jascha Beste authored Dec 16, 2024

* docs: switch around database integrations order and link to quickstart

* docs: link to blog post in example readme

* chore: link to main readme

* readme: removing example to link externally

readme: removing example to link externally so we don't have to keep this example up-to-date

---------

297ada6c

15 Dec, 2024 1 commit
- imageproc mllama refactor (#7537) · 8c9fb8eb
  Patrick Devine authored Dec 14, 2024
```
Refactor mllama image processing code, and add pixtral and qwen2vl
```
  8c9fb8eb
14 Dec, 2024 2 commits
- ci: be more aggressive on parallelism in build (#8102) · b75ccfc5
  Daniel Hiltgen authored Dec 14, 2024
  
  b75ccfc5
- llama: update vendor code to commit ba1cb19c (#8101) · 7a81daf0
  Jeffrey Morgan authored Dec 14, 2024
  
  7a81daf0
13 Dec, 2024 2 commits
- runner: switch logging back to stderr (#8091) · 60f75560
  Daniel Hiltgen authored Dec 13, 2024
```
This puts the low-level runner logging back on stderr for consistency with prior releases
```
  60f75560
- openai: return usage as final chunk for streams (#6784) · e28f2d49
  Anuraag (Rag) Agrawal authored Dec 13, 2024
```
* openai: return usage as final chunk for streams

---------
Co-authored-by: ParthSareen <parth.sareen@ollama.com>
```
  e28f2d49
12 Dec, 2024 2 commits
- llama: parse JSON schema using nlohmann::ordered_json to maintain ordering (#8071) · c2168505
  Pascal Patry authored Dec 12, 2024
  
  c2168505
- llama: enable JSON schema key ordering for generating grammars (#8055) · 18f6a98b
  Parth Sareen authored Dec 11, 2024
  
  18f6a98b
11 Dec, 2024 3 commits

server: more support for mixed-case model names (#8017) · b1fd7fef
Blake Mizerany authored Dec 11, 2024
```
Fixes #7944
```
b1fd7fef
ci: fix linux version (#8054) · 36d111e7
Daniel Hiltgen authored Dec 11, 2024
```
Pass through the version override so the makefiles use it
```
36d111e7

llama: preserve field order in user-defined JSON schemas (#8002) · 9039c821

Blake Mizerany authored Dec 11, 2024

Previously we decoded and re-encoded JSON schemas during validation,
which served no purpose since json.RawMessage already validates JSON
syntax. Worse, the re-encoding lost field ordering from the original
schema, which affects inference quality during step-by-step reasoning.

While fixing this ordering issue by using json.RawMessage directly,
testing revealed that schema_to_grammar (from llama.cpp) also fails to
preserve field order during grammar generation. This appears to be the
root cause of inference degradation.

This change prevents us from mangling the user's original schema order,
but we still need to address the ordering issue in schema_to_grammar.
That will be a separate change.

Updates #7978

9039c821