Commits · c6bcdc4223c50071b59a19c42cc54ec9932f696f · OpenDAS / ollama

13 May, 2025 1 commit

Revert "remove cuda v11 (#10569)" (#10692) · c6bcdc42

Daniel Hiltgen authored May 13, 2025

Bring back v11 until we can better warn users that their driver
is too old.

This reverts commit fa393554.

c6bcdc42

12 May, 2025 1 commit

Follow up to #10363 (#10647) · 9d6df908

Daniel Hiltgen authored May 12, 2025

The quantization PR didn't block all unsupported file types,
which this PR fixes.  It also updates the API docs to reflect
the now reduced set of supported types.

9d6df908

08 May, 2025 1 commit
- api: remove unused sampling parameters (#10581) · fa9973cd
  Jeffrey Morgan authored May 08, 2025
  
  fa9973cd
07 May, 2025 1 commit

remove cuda v11 (#10569) · fa393554

Daniel Hiltgen authored May 06, 2025

This reduces the size of our Windows installer payloads by ~256M by dropping
support for nvidia drivers older than Feb 2023. Hardware support is unchanged.

Linux default bundle sizes are reduced by ~600M to 1G.

fa393554

05 May, 2025 1 commit

api: remove unused or unsupported api options (#10574) · 3b2d2c83

Jeffrey Morgan authored May 05, 2025

Some options listed in api/types.go are not supported in
newer models, or have been deprecated in the past. This is
the first of a series of PRs to clean up the API options

3b2d2c83

29 Apr, 2025 1 commit
- config: update default context length to 4096 · 44b466ee
  Devon Rifkin authored Apr 28, 2025
  
  44b466ee
28 Apr, 2025 1 commit
- Revert "increase default context length to 4096 (#10364)" · dd93e1af
  Devon Rifkin authored Apr 28, 2025
```
This reverts commit 424f6486.
```
  dd93e1af
22 Apr, 2025 1 commit

increase default context length to 4096 (#10364) · 424f6486

Devon Rifkin authored Apr 22, 2025

* increase default context length to 4096

We lower the default numParallel from 4 to 2 and use these "savings" to
double the default context length from 2048 to 4096.

We're memory neutral in cases when we previously would've used
numParallel == 4, but we add the following mitigation to handle some
cases where we would have previously fallen back to 1x2048 due to low
VRAM: we decide between 2048 and 4096 using a runtime check, choosing
2048 if we're on a one GPU system with total VRAM of <= 4 GB. We
purposefully don't check the available VRAM because we don't want the
context window size to change unexpectedly based on the available VRAM.

We plan on making the default even larger, but this is a relatively
low-risk change we can make to quickly double it.

* fix tests

add an explicit context length so they don't get truncated. The code
that converts -1 from being a signal for doing a runtime check isn't
running as part of these tests.

* tweak small gpu message

* clarify context length default

also make it actually show up in `ollama serve --help`

424f6486

15 Apr, 2025 2 commits

docs: change more template blocks to have syntax highlighting · 637fd212

Devon Rifkin authored Apr 15, 2025

In #8215 syntax highlighting was added to most of the blocks, but there were a couple that were still being rendered as plaintext

637fd212

docs: update some response code blocks to json5 · 378d3210

Devon Rifkin authored Apr 14, 2025

This is to prevent rendering bright red comments indicating invalid JSON when the comments are just supposed to be explanatory

378d3210

08 Apr, 2025 1 commit

cleanup: remove OLLAMA_TMPDIR and references to temporary executables (#10182) · ccc8c677

frob authored Apr 09, 2025



* cleanup: remove OLLAMA_TMPDIR
* cleanup: ollama doesn't use temporary executables anymore

---------
Co-authored-by: Richard Lyons <frob@cloudstaff.com>

ccc8c677

01 Apr, 2025 1 commit

api: return model capabilities from the show endpoint (#10066) · e172f095

Bruce MacDonald authored Apr 01, 2025

With support for multimodal models becoming more varied and common it is important for clients to be able to easily see what capabilities a model has. Retuning these from the show endpoint will allow clients to easily see what a model can do.

e172f095

27 Mar, 2025 1 commit
- docs: make context length faq readable (#10006) · b816ff86
  Parth Sareen authored Mar 26, 2025
  
  b816ff86
25 Mar, 2025 1 commit
- docs: add flags to example linux log output command (#9852) · 5e0b904e
  copeland3300 authored Mar 25, 2025
  
  5e0b904e
21 Mar, 2025 2 commits
- benchmark: performance of running ollama server (#8643) · fb6252d7
  Bruce MacDonald authored Mar 21, 2025
  
  fb6252d7
- docs: update final response for /api/chat stream (#9919) · d14ce75b
  Parth Sareen authored Mar 21, 2025
  
  d14ce75b
13 Mar, 2025 1 commit
- docs: Add OLLAMA_ORIGINS for browser extension support (#9643) · 74b44fdf
  Bradley Erickson authored Mar 13, 2025
  
  74b44fdf
10 Mar, 2025 1 commit
- docs: Add OLLAMA_CONTEXT_LENGTH to FAQ. (#9545) · d8a5d96b
  frob authored Mar 10, 2025
  
  d8a5d96b
07 Mar, 2025 1 commit

Better WantedBy declaration · 25248f4b

‮rekcäH nitraM‮ authored Mar 07, 2025

The problem with default.target is that it always points to the target that is currently started. So if you boot into single user mode or the rescue mode still Ollama tries to start.

I noticed this because either tried (and failed) to start all the time during a system update, where Ollama definitely is not wanted.

25248f4b

05 Mar, 2025 1 commit

Win: doc new rocm zip file (#9367) · cae5d4d4

Daniel Hiltgen authored Mar 05, 2025

To stay under the 2G github artifact limit, we're splitting ROCm
out like we do on linux.

cae5d4d4

04 Mar, 2025 1 commit

server/.../backoff,syncs: don't break builds without synctest (#9484) · 55ab9f37

Blake Mizerany authored Mar 03, 2025

Previously, developers without the synctest experiment enabled would see
build failures when running tests in some server/internal/internal
packages using the synctest package. This change makes the transition to
use of the package less painful but guards the use of the synctest
package with build tags.

synctest is enabled in CI. If a new change will break a synctest
package, it will break in CI, even if it does not break locally.

The developer docs have been updated to help with any confusion about
why package tests pass locally but fail in CI.

55ab9f37

27 Feb, 2025 1 commit

Windows ARM build (#9120) · 688925ac

Daniel Hiltgen authored Feb 27, 2025

* Windows ARM build

Skip cmake, and note it's unused in the developer docs.

* Win: only check for ninja when we need it

On windows ARM, the cim lookup fails, but we don't need ninja anyway.

688925ac

25 Feb, 2025 2 commits
- docs: rocm install link (#9346) · 88885567
  Chuanhui Liu authored Feb 25, 2025
  
  88885567
- Move cgroups fix out of AMD section. (#9072) · 4df98f3e
  frob authored Feb 25, 2025
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
  4df98f3e
22 Feb, 2025 1 commit
- docs: add additional ROCm docs for building (#9066) · 7cfd4aee
  Jeffrey Morgan authored Feb 22, 2025
  
  7cfd4aee
15 Feb, 2025 1 commit
- docs: fix incorrect shortcut key in windows.md (#9098) · 0667badd
  James-William-Kincaid-III authored Feb 15, 2025
  
  0667badd
13 Feb, 2025 1 commit
- docs: add H200 as supported device. (#9076) · 3a4449e2
  frob authored Feb 13, 2025
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
  3a4449e2
08 Feb, 2025 1 commit
- docs: link directly to latest release page for tdm-gcc (#8939) · b86c0a15
  Jeffrey Morgan authored Feb 08, 2025
  
  b86c0a15
07 Feb, 2025 2 commits
- docs: improve syntax highlighting in code blocks (#8854) · b901a712
  Azis Alvriyanto authored Feb 08, 2025
  
  b901a712
- docs: include port in faq.md OLLAMA_HOST examples (#8905) · a400df48
  Leisure Linux authored Feb 07, 2025
  
  a400df48
06 Feb, 2025 1 commit
- docs: add step for removing libraries in linux.md (#8897) · 78140197
  Abhinav Pant authored Feb 07, 2025
  
  78140197
05 Feb, 2025 1 commit
- docs: add section in development.md on library detection (#8855) · f00d359a
  Jeffrey Morgan authored Feb 05, 2025
  
  f00d359a
03 Feb, 2025 1 commit
- docs: use OLLAMA_VERSION=0.5.7 for install version override (#8802) · bfdeffc3
  Melroy van den Berg authored Feb 03, 2025
  
  bfdeffc3
02 Feb, 2025 1 commit
- docs: add missing json and shell code blocks in api.md (#8766) · ad22ace4
  Davide Bertoni authored Feb 02, 2025
  
  ad22ace4
29 Jan, 2025 2 commits

docs: update api.md with streaming with tools is enabled (#8676) · 711648c9
Parth Sareen authored Jan 29, 2025

711648c9

next build (#8539) · dcfb7a10

Michael Yang authored Jan 29, 2025



* add build to .dockerignore

* test: only build one arch

* add build to .gitignore

* fix ccache path

* filter amdgpu targets

* only filter if autodetecting

* Don't clobber gpu list for default runner

This ensures the GPU specific environment variables are set properly

* explicitly set CXX compiler for HIP

* Update build_windows.ps1

This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.

* build: add ollama subdir

* add .git to .dockerignore

* docs: update development.md

* update build_darwin.sh

* remove unused scripts

* llm: add cwd and build/lib/ollama to library paths

* default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS

* add additional cmake output vars for msvc

* interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12

* remove unncessary filepath.Dir, cleanup

* add hardware-specific directory to path

* use absolute server path

* build: linux arm

* cmake install targets

* remove unused files

* ml: visit each library path once

* build: skip cpu variants on arm

* build: install cpu targets

* build: fix workflow

* shorter names

* fix rocblas install

* docs: clean up development.md

* consistent build dir removal in development.md

* silence -Wimplicit-function-declaration build warnings in ggml-cpu

* update readme

* update development readme

* llm: update library lookup logic now that there is one runner (#8587)

* tweak development.md

* update docs

* add windows cuda/rocm tests

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>

dcfb7a10

23 Jan, 2025 1 commit
- docs: remove reference to the deleted examples folder (#8524) · ca2f9843
  Daniel Jalkut authored Jan 23, 2025
  
  ca2f9843
21 Jan, 2025 1 commit
- docs: remove tfs_z option from documentation (#8515) · 294b6f5a
  frob authored Jan 21, 2025
  
  294b6f5a
20 Jan, 2025 1 commit
- docs: update suspend header in gpu.md (#8487) · 7bb356c6
  EndoTheDev authored Jan 20, 2025
  
  7bb356c6
15 Jan, 2025 1 commit
- docs: fix path to examples (#8438) · a041b4df
  Gloryjaw authored Jan 16, 2025
  
  a041b4df