Commits · e3936d4fb37cc0cd3a7cd9ffb58f357c5f417fff · OpenDAS / ollama

27 Nov, 2024 2 commits
- Support Multiple LoRa Adapters (#7667) · e3936d4f
  ItzCrazyKns authored Nov 28, 2024
```
Closes #7627
```
  e3936d4f
- openai: remove unused error code (#7850) · 940e6277
  Bruce MacDonald authored Nov 26, 2024
```
The writeError takes a code argument which is no longer used. Remove it for clarity.
```
  940e6277
26 Nov, 2024 4 commits

runner.go: Don't try to extract image tags for text models · 71e6a0d0

Jesse Gross authored Nov 20, 2024

When processing a prompt, we look for image tags of the form
[img-0], which are inserted by the Ollama server process.
However, this can cause errors if the original prompt has these
tags - typically an image not found error is returned.

This changes tag searching behavior to be similar to the 0.3.x
series, which will largely avoid these problems. However,they can
still happen when input text with these tags is used with image
models. The correct solution is to escape the tags but this is a
larger issue with special sequences in general so this is an
incremental fix that should avoid the problem for the majority
of cases.

71e6a0d0

runner.go: Add unit tests for context shifting · 2cd11ae3

Jesse Gross authored Nov 25, 2024

This also makes it easier to truncate long inputs the same as
shifting but does not actually implement it. This type of
truncation has a trade off between quality and time to first
token.

2cd11ae3

readme: update description for vnc-lm community integration (#7832) · 52bbad12
jake83741 authored Nov 25, 2024

52bbad12
cmd: don't submit svg files as images for now (#7830) · 30e88d7f
frob authored Nov 26, 2024

30e88d7f

25 Nov, 2024 4 commits

server: fix Transport override (#7834) · 2b7ed61c

Blake Mizerany authored Nov 25, 2024

This changes makeRequest to update the http client Transport if and only
if testMakeRequestDialContext is set. This is to avoid overriding the
default Transport when testMakeRequestDialContext is nil, which broke
existing behavior, included proxies, timeouts, and other behaviors.

Fixes #7829
Fixes #7788

2b7ed61c

readme: add HoneyHive to community integrations (#7831) · 647513a7
Shikhar Bakhda authored Nov 25, 2024

647513a7

cmd: print location of model after pushing (#7695) · a210ec74

Bruce MacDonald authored Nov 25, 2024

After a user pushes their model it is not clear what to do next. Add a link
to the output of `ollama push` that tells the user where their model can now
be found.

a210ec74

examples: update langchain-python-simple (#3591) · cfb1ddd6
Simon Schampijer authored Nov 25, 2024
```
- better formatting of input prompt
- use invoke instead of predict
```
cfb1ddd6

24 Nov, 2024 4 commits
- readme: add descriptions for QA-Pilot and shell-pilot community integrations (#4303) · 3987acd7
  reid41 authored Nov 25, 2024
  
  3987acd7
- llm: bring fileTypes into alignment with llama.cpp (#7819) · fda1e6b5
  frob authored Nov 24, 2024
  
  fda1e6b5
- readme: add description for OpenTalkGpt in community integrations (#7818) · 3440ffb3
  Adarsh Mishra authored Nov 25, 2024
  
  3440ffb3
- readme: add observability section with OpenLIT to community-integrations · a820d2b2
  Patcher authored Nov 23, 2024
  
  a820d2b2
23 Nov, 2024 5 commits

all: update math32 go mod to v1.11.0 (#6627) · 2ebdb54f
Meng Zhuo authored Nov 24, 2024

2ebdb54f
readme: add ChatGPTBox and RWKV-Runner to community integrations (#4118) · bb52abfa
josc146 authored Nov 24, 2024

bb52abfa
openai: accept X-Stainless-Retry-Count header (#6910) · 31cb1ca9
oza6ut0ne authored Nov 24, 2024

31cb1ca9
readme: add powershai, a powershell module with ollama support to community integrations (#7438) · 78f779a3
Rodrigo Ribeiro Gomes authored Nov 23, 2024

78f779a3

runner.go: Fix deadlock with many concurrent requests · 3478b2cf

Jesse Gross authored Nov 22, 2024

If there are no avilable slots for new sequences then a request
will not be added to the processing queue but will continue on
to wait for a response that never comes. Besides never giving a
response to the request, this prevents the model from being
unloaded due to the outstanding request.

To prevent this, there are semaphores that prevent more requests
from being processed than there are slots - one in the Ollama
server and one in the runner.
 - The Ollama server one works but it is not designed to protect
the runner's data internal structures and the runner can return a
final response before clearing its data structures.
 - The internal runner semaphore has similar behavior where it
 can release the semaphore when it issues a response. This is
 wrong - it should only release the semaphore after it has
 cleared the data structure.

In addition, we should return an error if a slot is not found
rather than deadlocking in the event we ever get to this spot.

Fixes #7779

3478b2cf

22 Nov, 2024 8 commits

server: remove out of date anonymous access check (#7785) · 7b5585b9

Bruce MacDonald authored Nov 22, 2024

In the past the ollama.com server would return a JWT that contained
information about the user being authenticated. This was used to return
different error messages to the user. This is no longer possible since the
token used to authenticate does not contain information about the user
anymore. Removing this code that no longer works.

Follow up changes will improve the error messages returned here, but good to
clean up first.

7b5585b9

tests: fix max queue integration test (#7782) · f0a35181
Daniel Hiltgen authored Nov 22, 2024
```
This had fallen out of sync with the envconfig behavior, where max queue default was not zero.
```
f0a35181

logs: explain client aborts better (#7783) · b85520bf

Daniel Hiltgen authored Nov 22, 2024

Users get confused by "Failed to acquire semaphore" error="context canceled"
messages in the logs, which are actually clients giving up. While there could be
a legitimate hang bug in the system, sometimes this is just short client timeouts
with an overloaded system, so this should help users understand what's going on
better.

b85520bf

Be quiet when redirecting output (#7360) · d88972ea

Daniel Hiltgen authored Nov 22, 2024

This avoids emitting the progress indicators to stderr, and the interactive
prompts to the output file or pipe. Running "ollama run model > out.txt"
now exits immediately, and "echo hello | ollama run model > out.txt"
produces zero stderr output and a typical response in out.txt

d88972ea

readme: add Local Multimodal AI Chat app to community integrations (#6931) · 25c9339e
Leon Sander authored Nov 22, 2024

25c9339e
readme: update google/uuid module (#7310) · 597072ef
Mikel Olasagasti Uranga authored Nov 22, 2024
```
update uuid.New().String() to uuid.NewString()
```
597072ef
readme: add ollamarama-matrix to community integrations (#7325) · 84b3e07f
Dustin authored Nov 21, 2024

84b3e07f
readme: add x-cmd ollama module to community integrations (#5191) · 422d5285
Edwin.JH.Lee authored Nov 22, 2024

422d5285

21 Nov, 2024 13 commits
- readme: add OrionChat to community integrations (#7084) · 723f2858
  Elias authored Nov 21, 2024
```
OrionChat is a free web-based chat interface that simplifies interactions
with multiple AI model providers. It provides a unified platform for chatting
and exploring multiple large language models (LLMs).
```
  723f2858
- cmd: delete duplicated call to sb.Reset() (#7308) · eaaf5d30
  湛露先生 authored Nov 22, 2024
```
Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>
```
  eaaf5d30
- docs: remove tutorials, add cloud section to community integrations (#7784) · 27d9c749
  Jeffrey Morgan authored Nov 21, 2024
  
  27d9c749
- env.sh: cleanup unused RELEASE_IMAGE_REPO (#6855) · b7bddeeb
  R0CKSTAR authored Nov 22, 2024
```
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
```
  b7bddeeb
- readme: add terminal tool ParLlama to community integrations (#5623) · 6a0c2ec5
  Paul Robello authored Nov 21, 2024
  
  6a0c2ec5
- readme: add a community made ollama web management tool (#7126) · baa41be2
  毛巳煜 authored Nov 21, 2024
  
  baa41be2
- readme: add Terraform AWS Ollama & Open WebUI community example (#5633) · 2157b123
  xuyangbocn authored Nov 21, 2024
  
  2157b123
- readme: add R2R to community integrations (#5587) · 37711578
  emrgnt-cmplxty authored Nov 21, 2024
  
  37711578
- readme: Add Nosia to Community Integrations (#5381) · fb2c9594
  Cyril Blaecke authored Nov 21, 2024
  
  fb2c9594
- readme: Add Spring AI library reference (#5981) · 7fbcd55d
  Christian Tzolov authored Nov 21, 2024
  
  7fbcd55d
- readme: add Parakeet to community integrations · b4348bdd
  Philippe Charrière authored Nov 21, 2024
```
Parakeet is a GoLang SDK for Ollama

---------
Co-authored-by: Parth Sareen <parth.sareen@ollama.com>
```
  b4348bdd
- readme: add community integration py-gpt (#6503) · 155734e0
  Marcin Szczygliński authored Nov 21, 2024
  
  155734e0
- readme: add Promptery to community integrations (#7093) · 883d80e0
  Michael authored Nov 21, 2024
  
  883d80e0