Commits · d8def1ff9432ef60d1067e5e6dde0d700dd95021 · OpenDAS / ollama

07 Jul, 2024 4 commits
- llm: allow gemma 2 to context shift (#5534) · d8def1ff
  Jeffrey Morgan authored Jul 07, 2024
  
  d8def1ff
- Update llama.cpp submodule to `a8db2a9c` (#5530) · 571dc619
  Jeffrey Morgan authored Jul 07, 2024
  
  571dc619
- llm: print caching notices in debug only (#5533) · 0e09c380
  Jeffrey Morgan authored Jul 07, 2024
  
  0e09c380
- sched: don't error if paging to disk on Windows and macOS (#5523) · 0ee87615
  Jeffrey Morgan authored Jul 06, 2024
  
  0ee87615
06 Jul, 2024 10 commits
- gpu: report system free memory instead of 0 (#5521) · f8241bfb
  Jeffrey Morgan authored Jul 06, 2024
  
  f8241bfb
- llm: add `-DBUILD_SHARED_LIBS=off` to common cpu cmake flags (#5520) · 4607c706
  Jeffrey Morgan authored Jul 06, 2024
  
  4607c706
- release: move mingw library cleanup to correct job · c12f1c5b
  jmorganca authored Jul 06, 2024
  
  c12f1c5b
- release: remove unwanted mingw dll.a files · a08f20d9
  jmorganca authored Jul 06, 2024
  
  a08f20d9
- Revert "llm: only statically link libstdc++" · 6cea0360
  jmorganca authored Jul 06, 2024
```
This reverts commit 5796bfc4.
```
  6cea0360
- llm: only statically link libstdc++ · 5796bfc4
  jmorganca authored Jul 06, 2024
  
  5796bfc4
- llm: statically link pthread and stdc++ dependencies in windows build · f1a379aa
  jmorganca authored Jul 06, 2024
  
  f1a379aa
- llm: add `GGML_STATIC` flag to windows static lib · 9ae14699
  jmorganca authored Jul 06, 2024
  
  9ae14699
- llm: add `COMMON_DARWIN_DEFS` to arm static build (#5513) · e0348d3f
  Jeffrey Morgan authored Jul 05, 2024
  
  e0348d3f
- llm: fix missing dylibs by restoring old build behavior on Linux and macOS (#5511) · 2cc854f8
  Jeffrey Morgan authored Jul 05, 2024
```
* Revert "fix cmake build (#5505)"

This reverts commit 4fd5f352.

* llm: fix missing dylibs by restoring old build behavior

* crlf -> lf
```
  2cc854f8
05 Jul, 2024 12 commits
- llm: put back old include dir (#5507) · 5304b765
  Jeffrey Morgan authored Jul 05, 2024
```
* llm: put back old include dir

* llm: update link paths for old submodule commits
```
  5304b765
- fix cmake build (#5505) · 4fd5f352
  Jeffrey Morgan authored Jul 05, 2024
  
  4fd5f352
- Merge pull request #5502 from dhiltgen/ci_fixes · 842f85f7
  Daniel Hiltgen authored Jul 05, 2024
```
Always go build in CI generate steps
```
  842f85f7
- Always go build in CI generate steps · 9d30f9f8
  Daniel Hiltgen authored Jul 05, 2024
```
With the recent cgo changes, bugs can sneak through
if we don't make sure to `go build` all the permutations
```
  9d30f9f8
- types/model: remove knowledge of digest (#5500) · 631cfd9e
  Blake Mizerany authored Jul 05, 2024
```
This was leading to ambiguity and confusion in ollama.com, and is not
used anywhere in ollama at the moment. Once manifests are addressable by
digest, we can add this back in, and in a way that is more tailored to
the concept of addressing a manifest by digest.
```
  631cfd9e
- fix typo in cgo directives in `llm.go` (#5501) · 78fb33dd
  Jeffrey Morgan authored Jul 05, 2024
  
  78fb33dd
- update llama.cpp submodule to `d7fd29f` (#5475) · 8f8e736b
  Jeffrey Morgan authored Jul 05, 2024
  
  8f8e736b
- Use slot with cached prompt instead of least recently used (#5492) · d89454de
  Jeffrey Morgan authored Jul 05, 2024
```
* Use common prefix to select slot

* actually report `longest`
```
  d89454de
- Merge pull request #5469 from dhiltgen/prevent_system_oom · af28b945
  Daniel Hiltgen authored Jul 05, 2024
```
Prevent loading models larger than total memory
```
  af28b945
- Fix assert on small embedding inputs (#5491) · e9188e97
  Jeffrey Morgan authored Jul 05, 2024
```
* Fix assert on small embedding inputs

* Update llm/patches/09-pooling.diff
```
  e9188e97
- Merge pull request #4412 from dhiltgen/win_docs · 78eddfc0
  Daniel Hiltgen authored Jul 05, 2024
```
Document older win10 terminal problems
```
  78eddfc0
- Merge pull request #5466 from dhiltgen/fix_clip_unicode · 02c24d3d
  Daniel Hiltgen authored Jul 05, 2024
```
Fix clip model loading with unicode paths
```
  02c24d3d
04 Jul, 2024 2 commits
- Document older win10 terminal problems · 52abc8ac
  Daniel Hiltgen authored May 13, 2024
```
We haven't found a workaround, so for now recommend updating.
```
  52abc8ac
- fix error detection by limiting model loading error parsing (#5472) · 4d71c559
  Jeffrey Morgan authored Jul 03, 2024
  
  4d71c559
03 Jul, 2024 11 commits

fix: use `envconfig.ModelsDir` directly (#4821) · 0d16eb31

Anatoli Babenia authored Jul 04, 2024



* Co-authored-by: Anatoli Babenia <anatoli@rainforce.org>
Co-authored-by: Maas Lalani <maas@lalani.dev>

0d16eb31

Merge pull request #5447 from dhiltgen/fix_keepalive · 8072e205
Daniel Hiltgen authored Jul 03, 2024
```
Only set default keep_alive on initial model load
```
8072e205

Only set default keep_alive on initial model load · 955f2a4e

Daniel Hiltgen authored Jul 02, 2024

This change fixes the handling of keep_alive so that if client
request omits the setting, we only set this on initial load.  Once
the model is loaded, if new requests leave this unset, we'll keep
whatever keep_alive was there.

955f2a4e

Prevent loading models larger than total memory · 3c75113e

Daniel Hiltgen authored Jul 03, 2024

Users may not realize the siny new model they're trying to load
fits on their disk, but can't load into system+GPU memory.  Today
we crash, but with this fix, we'll give them a better error message
before even trying to load it.

3c75113e

Merge pull request #5243 from dhiltgen/modelfile_use_mmap · ccd77858
Daniel Hiltgen authored Jul 03, 2024
```
Fix use_mmap for modefiles
```
ccd77858

Return Correct Prompt Eval Count Regardless of Cache Prompt (#5371) · 3b5a4a77

royjhan authored Jul 03, 2024

* openai compatibility

* Revert "openai compatibility"

This reverts commit d3f98a811e00fc497d889c8c45b0cfec5b64690c.

* remove erroneous subtraction of prompt cache

3b5a4a77

Merge pull request #5467 from dhiltgen/bogus_cpu_mac_error · daed0634
Daniel Hiltgen authored Jul 03, 2024
```
Fix corner cases on tmp cleaner on mac
```
daed0634
Merge pull request #5465 from dhiltgen/better_cuda_logging · 0d4dd707
Daniel Hiltgen authored Jul 03, 2024
```
Better nvidia GPU discovery logging
```
0d4dd707

Fix corner cases on tmp cleaner on mac · 0e982bc1

Daniel Hiltgen authored Jul 03, 2024

When ollama is running a long time, tmp cleaners can remove the
runners. This tightens up a few corner cases on arm macs where
we failed with "server cpu not listed in available servers map[]"

0e982bc1

Fix clip model loading with unicode paths · 6298f498

Daniel Hiltgen authored Jul 03, 2024

On windows, if the model dir contained unicode characters
clip models would fail to load.  This fixes the file name
handling in clip.cpp to support utf16 on windows.

6298f498

Better nvidia GPU discovery logging · ef757da2

Daniel Hiltgen authored Jul 03, 2024

Refine the way we log GPU discovery to improve the non-debug
output, and report more actionable log messages when possible
to help users troubleshoot on their own.

ef757da2

02 Jul, 2024 1 commit
- Merge pull request #5448 from ollama/mxyng/fix-generate · e5352297
  Michael Yang authored Jul 02, 2024
```
use model template by default
```
  e5352297