Commits · 82a02e18d96ce2fff1791e6d1a080d3afa11370e · OpenDAS / ollama

10 Dec, 2024 3 commits

build: fix typo in override variable (#8031) · 82a02e18
Daniel Hiltgen authored Dec 10, 2024
```
The "F" was missing.
```
82a02e18

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

Prevent underflow when FreeMemory < overhead (#8014) · 63269668
frob authored Dec 10, 2024
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
63269668

09 Dec, 2024 1 commit

prompt: Don't trim whitespace from prompts · 900f64e6

Jesse Gross authored Dec 06, 2024

New lines can be an important part of a user's prompt and trimming
it can alter the results. We previously only trimmed prompts with
images but refactoring brought this behavior to all prompts, where
it became more noticable.

The /generate endpoint adds less whitespace and therefore doesn't
need to trim it out - this brings the same behavior to /chat.

Thanks to @gabe-l-hart for spotting the issue!

Fixes #7795

900f64e6

08 Dec, 2024 2 commits
- docs: remove comment regarding tool streaming in openai.md (#7960) · da09488f
  Yannick Gloster authored Dec 07, 2024
  
  da09488f
- docs: fix syntax error in openai.md (#7986) · 7f0ccc8a
  湛露先生 authored Dec 08, 2024
  
  7f0ccc8a
06 Dec, 2024 3 commits
- bugfix: "null" value json mode (#7979) · de52b6c2
  Parth Sareen authored Dec 06, 2024
  
  de52b6c2
- readme: add llama3.3 to readme (#7975) · acd7d032
  Michael authored Dec 06, 2024
```
readme: add llama3.3 to readme
```
  acd7d032
- docs: update readmes for structured outputs (#7962) · f6e87fd6
  Parth Sareen authored Dec 06, 2024
  
  f6e87fd6
05 Dec, 2024 3 commits
- ci: skip go build for tests (#7899) · aed1419c
  Jeffrey Morgan authored Dec 04, 2024
  
  aed1419c
- api: add generate endpoint for structured outputs (#7939) · c6c52627
  Parth Sareen authored Dec 04, 2024
  
  c6c52627
- api: structured outputs - chat endpoint (#7900) · 630e7dc6
  Parth Sareen authored Dec 04, 2024
```
Adds structured outputs to chat endpoint
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
Co-authored-by: Hieu Nguyen <hieunguyen1053@outlook.com>
```
  630e7dc6
04 Dec, 2024 3 commits
- Merge pull request #7932 from ollama/mxyng/fix-merges · eb8366d6
  Michael Yang authored Dec 04, 2024
  
  eb8366d6
- fix unmarshaling merges · 44560129
  Michael Yang authored Dec 04, 2024
  
  44560129
- llm: normalise kvct parameter handling (#7926) · 539be436
  Sam authored Dec 04, 2024
  
  539be436
03 Dec, 2024 2 commits
- llm: introduce k/v context quantization (vRAM improvements) (#6279) · 1bdab9fd
  Sam authored Dec 04, 2024
  
  1bdab9fd
- docs: correct default num_predict value in modelfile.md (#7693) · 2b82c5a8
  owboson authored Dec 04, 2024
  
  2b82c5a8
02 Dec, 2024 2 commits
- docs: remove extra quote in modelfile.md (#7908) · 55c3efa9
  Tigran authored Dec 02, 2024
  
  55c3efa9
- readme: add minima to community integrations (#7906) · 1aedffad
  David Mayboroda authored Dec 02, 2024
  
  1aedffad
30 Nov, 2024 3 commits
- cmd: don't rely on reading repo file for test (#7898) · ff6c2d6d
  Jeffrey Morgan authored Nov 30, 2024
  
  ff6c2d6d
- server: add warning message for deprecated context field (#7878) · d543b282
  Jeffrey Morgan authored Nov 30, 2024
  
  d543b282
- Enable index tracking for tools - openai api support (#7888) · 5f805118
  Parth Sareen authored Nov 29, 2024
  
  5f805118
29 Nov, 2024 1 commit
- llama: fix typo and formatting in readme (#7876) · 39e29ae5
  Jeffrey Morgan authored Nov 28, 2024
  
  39e29ae5
28 Nov, 2024 1 commit
- readme: add SpaceLlama, YouLama, and DualMind to community integrations (#7216) · 30a9f063
  TheCookingSenpai authored Nov 29, 2024
  
  30a9f063
27 Nov, 2024 3 commits
- api: enable tool streaming (#7836) · ce7455a8
  Parth Sareen authored Nov 27, 2024
  
  ce7455a8
- Support Multiple LoRa Adapters (#7667) · e3936d4f
  ItzCrazyKns authored Nov 28, 2024
```
Closes #7627
```
  e3936d4f
- openai: remove unused error code (#7850) · 940e6277
  Bruce MacDonald authored Nov 26, 2024
```
The writeError takes a code argument which is no longer used. Remove it for clarity.
```
  940e6277
26 Nov, 2024 4 commits

runner.go: Don't try to extract image tags for text models · 71e6a0d0

Jesse Gross authored Nov 20, 2024

When processing a prompt, we look for image tags of the form
[img-0], which are inserted by the Ollama server process.
However, this can cause errors if the original prompt has these
tags - typically an image not found error is returned.

This changes tag searching behavior to be similar to the 0.3.x
series, which will largely avoid these problems. However,they can
still happen when input text with these tags is used with image
models. The correct solution is to escape the tags but this is a
larger issue with special sequences in general so this is an
incremental fix that should avoid the problem for the majority
of cases.

71e6a0d0

runner.go: Add unit tests for context shifting · 2cd11ae3

Jesse Gross authored Nov 25, 2024

This also makes it easier to truncate long inputs the same as
shifting but does not actually implement it. This type of
truncation has a trade off between quality and time to first
token.

2cd11ae3

readme: update description for vnc-lm community integration (#7832) · 52bbad12
jake83741 authored Nov 25, 2024

52bbad12
cmd: don't submit svg files as images for now (#7830) · 30e88d7f
frob authored Nov 26, 2024

30e88d7f

25 Nov, 2024 4 commits

server: fix Transport override (#7834) · 2b7ed61c

Blake Mizerany authored Nov 25, 2024

This changes makeRequest to update the http client Transport if and only
if testMakeRequestDialContext is set. This is to avoid overriding the
default Transport when testMakeRequestDialContext is nil, which broke
existing behavior, included proxies, timeouts, and other behaviors.

Fixes #7829
Fixes #7788

2b7ed61c

readme: add HoneyHive to community integrations (#7831) · 647513a7
Shikhar Bakhda authored Nov 25, 2024

647513a7

cmd: print location of model after pushing (#7695) · a210ec74

Bruce MacDonald authored Nov 25, 2024

After a user pushes their model it is not clear what to do next. Add a link
to the output of `ollama push` that tells the user where their model can now
be found.

a210ec74

examples: update langchain-python-simple (#3591) · cfb1ddd6
Simon Schampijer authored Nov 25, 2024
```
- better formatting of input prompt
- use invoke instead of predict
```
cfb1ddd6

24 Nov, 2024 4 commits
- readme: add descriptions for QA-Pilot and shell-pilot community integrations (#4303) · 3987acd7
  reid41 authored Nov 25, 2024
  
  3987acd7
- llm: bring fileTypes into alignment with llama.cpp (#7819) · fda1e6b5
  frob authored Nov 24, 2024
  
  fda1e6b5
- readme: add description for OpenTalkGpt in community integrations (#7818) · 3440ffb3
  Adarsh Mishra authored Nov 25, 2024
  
  3440ffb3
- readme: add observability section with OpenLIT to community-integrations · a820d2b2
  Patcher authored Nov 23, 2024
  
  a820d2b2
23 Nov, 2024 1 commit
- all: update math32 go mod to v1.11.0 (#6627) · 2ebdb54f
  Meng Zhuo authored Nov 24, 2024
  
  2ebdb54f