Commits · 36d111e788bdd90ab3e0b69033095efb68963052 · OpenDAS / ollama

11 Dec, 2024 9 commits

ci: fix linux version (#8054) · 36d111e7
Daniel Hiltgen authored Dec 11, 2024
```
Pass through the version override so the makefiles use it
```
36d111e7

llama: preserve field order in user-defined JSON schemas (#8002) · 9039c821

Blake Mizerany authored Dec 11, 2024

Previously we decoded and re-encoded JSON schemas during validation,
which served no purpose since json.RawMessage already validates JSON
syntax. Worse, the re-encoding lost field ordering from the original
schema, which affects inference quality during step-by-step reasoning.

While fixing this ordering issue by using json.RawMessage directly,
testing revealed that schema_to_grammar (from llama.cpp) also fails to
preserve field order during grammar generation. This appears to be the
root cause of inference degradation.

This change prevents us from mangling the user's original schema order,
but we still need to address the ordering issue in schema_to_grammar.
That will be a separate change.

Updates #7978

9039c821

ci: fix artifact path prefix for missing windows payloads (#8052) · 581a4a55

Daniel Hiltgen authored Dec 11, 2024

upload-artifacts strips off leading common paths so when
the ./build/ artifacts were removed, the ./dist/windows-amd64
prefix became common and was stripped, making the
later download-artifacts place them in the wrong location

581a4a55

win: builtin arm runner (#8039) · cf4d7c52

Daniel Hiltgen authored Dec 11, 2024

The new build embeds the arm runner in the
main binary, so there is no longer a lib/ollama

cf4d7c52

ci: build dir changed (#8037) · 6a6328a5
Daniel Hiltgen authored Dec 10, 2024
```
Remove no longer relevant build log dir
```
6a6328a5
llama: update vendored code to commit 40c6d79f (#7875) · 527cc978
Jeffrey Morgan authored Dec 10, 2024

527cc978
go.mod: go 1.22.8 -> 1.23.4 (#8036) · a37f4a86
Blake Mizerany authored Dec 10, 2024

a37f4a86
Return err when NewHipLib() detect error. (#8012) · 46f74e0c
湛露先生 authored Dec 11, 2024
```
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
```
46f74e0c
readme: add AI summary helper plugin to community-integrations (#7202) · 7622ea21
Phil Wornath authored Dec 11, 2024

7622ea21

10 Dec, 2024 8 commits

readme: add Kangaroo, an AI-powered SQL admin tool to community integrations (#7948) · c5d39470
Tao Zuhong authored Dec 11, 2024

c5d39470
server: lowercase hostname for Host header check (#5851) · 757eeacc
frob authored Dec 10, 2024

757eeacc
readme: add aidful-ollama-model-delete to community integrations (#8024) · dd42acf7
Dr. Daniel Bender authored Dec 10, 2024

dd42acf7

Remove unused runner CpuFeatures (#8032) · b9ccb374

Daniel Hiltgen authored Dec 10, 2024

The final implementation of #7499 removed dynamic vector requirements
in favor of a simpler filename based model, and this was left over logic that
is no longer needed.

b9ccb374

all: fix typos in documentation, code, and comments (#7021) · abfdc471
Stefan Weil authored Dec 10, 2024

abfdc471
build: fix typo in override variable (#8031) · 82a02e18
Daniel Hiltgen authored Dec 10, 2024
```
The "F" was missing.
```
82a02e18

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

Prevent underflow when FreeMemory < overhead (#8014) · 63269668
frob authored Dec 10, 2024
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
63269668

09 Dec, 2024 1 commit

prompt: Don't trim whitespace from prompts · 900f64e6

Jesse Gross authored Dec 06, 2024

New lines can be an important part of a user's prompt and trimming
it can alter the results. We previously only trimmed prompts with
images but refactoring brought this behavior to all prompts, where
it became more noticable.

The /generate endpoint adds less whitespace and therefore doesn't
need to trim it out - this brings the same behavior to /chat.

Thanks to @gabe-l-hart for spotting the issue!

Fixes #7795

900f64e6

08 Dec, 2024 2 commits
- docs: remove comment regarding tool streaming in openai.md (#7960) · da09488f
  Yannick Gloster authored Dec 07, 2024
  
  da09488f
- docs: fix syntax error in openai.md (#7986) · 7f0ccc8a
  湛露先生 authored Dec 08, 2024
  
  7f0ccc8a
06 Dec, 2024 3 commits
- bugfix: "null" value json mode (#7979) · de52b6c2
  Parth Sareen authored Dec 06, 2024
  
  de52b6c2
- readme: add llama3.3 to readme (#7975) · acd7d032
  Michael authored Dec 06, 2024
```
readme: add llama3.3 to readme
```
  acd7d032
- docs: update readmes for structured outputs (#7962) · f6e87fd6
  Parth Sareen authored Dec 06, 2024
  
  f6e87fd6
05 Dec, 2024 3 commits
- ci: skip go build for tests (#7899) · aed1419c
  Jeffrey Morgan authored Dec 04, 2024
  
  aed1419c
- api: add generate endpoint for structured outputs (#7939) · c6c52627
  Parth Sareen authored Dec 04, 2024
  
  c6c52627
- api: structured outputs - chat endpoint (#7900) · 630e7dc6
  Parth Sareen authored Dec 04, 2024
```
Adds structured outputs to chat endpoint
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
Co-authored-by: Hieu Nguyen <hieunguyen1053@outlook.com>
```
  630e7dc6
04 Dec, 2024 3 commits
- Merge pull request #7932 from ollama/mxyng/fix-merges · eb8366d6
  Michael Yang authored Dec 04, 2024
  
  eb8366d6
- fix unmarshaling merges · 44560129
  Michael Yang authored Dec 04, 2024
  
  44560129
- llm: normalise kvct parameter handling (#7926) · 539be436
  Sam authored Dec 04, 2024
  
  539be436
03 Dec, 2024 2 commits
- llm: introduce k/v context quantization (vRAM improvements) (#6279) · 1bdab9fd
  Sam authored Dec 04, 2024
  
  1bdab9fd
- docs: correct default num_predict value in modelfile.md (#7693) · 2b82c5a8
  owboson authored Dec 04, 2024
  
  2b82c5a8
02 Dec, 2024 2 commits
- docs: remove extra quote in modelfile.md (#7908) · 55c3efa9
  Tigran authored Dec 02, 2024
  
  55c3efa9
- readme: add minima to community integrations (#7906) · 1aedffad
  David Mayboroda authored Dec 02, 2024
  
  1aedffad
30 Nov, 2024 3 commits
- cmd: don't rely on reading repo file for test (#7898) · ff6c2d6d
  Jeffrey Morgan authored Nov 30, 2024
  
  ff6c2d6d
- server: add warning message for deprecated context field (#7878) · d543b282
  Jeffrey Morgan authored Nov 30, 2024
  
  d543b282
- Enable index tracking for tools - openai api support (#7888) · 5f805118
  Parth Sareen authored Nov 29, 2024
  
  5f805118
29 Nov, 2024 1 commit
- llama: fix typo and formatting in readme (#7876) · 39e29ae5
  Jeffrey Morgan authored Nov 28, 2024
  
  39e29ae5
28 Nov, 2024 1 commit
- readme: add SpaceLlama, YouLama, and DualMind to community integrations (#7216) · 30a9f063
  TheCookingSenpai authored Nov 29, 2024
  
  30a9f063
27 Nov, 2024 2 commits
- api: enable tool streaming (#7836) · ce7455a8
  Parth Sareen authored Nov 27, 2024
  
  ce7455a8
- Support Multiple LoRa Adapters (#7667) · e3936d4f
  ItzCrazyKns authored Nov 28, 2024
```
Closes #7627
```
  e3936d4f