Commits · ad0c19dde403ba67aa27247775e33c33c30ee235 · OpenDAS / ollama

07 Aug, 2024 5 commits

Use llama3.1 in tools example (#5985) · ad0c19dd
Kyle Kelley authored Aug 07, 2024
```
* Use llama3.1 in tools example

* Update api.md
```
ad0c19dd
Merge pull request #6145 from ollama/jessegross/bug5840 · 69eb06c4
Jesse Gross authored Aug 07, 2024
```
Fix crash on startup when trying to clean up unused files (#5840)
```
69eb06c4

manifest: Fix crash on startup when trying to clean up unused files (#5840) · 1829fb61

Jesse Gross authored Aug 05, 2024

Currently if the config field is missing in the manifest file (or
corrupted), Ollama will crash when it tries to read it. This can
happen at startup or when pulling new models.

This data is mostly just used for showing model information so we
can be tolerant of it not being present - it is not required to
run the models. Besides avoiding crashing, this also gives us the
ability to restructure the config in the future by pulling it
into the main manifest file.

1829fb61

manifest: Don't prune layers if we can't open a manifest file · 685a5353

Jesse Gross authored Aug 01, 2024

If there is an error when opening a manifest file (corrupted, permission denied, etc.)
then the referenced layers will not be included in the list of active
layers. This causes them to be deleted when pruning happens at startup
or a model is pulled.

In such a situation, we should prefer to preserve data in the hopes that
it can be recovered rather than being agressive about deletion.

685a5353

llm: reserve required number of slots for embeddings (#6219) · de4fc297
Jeffrey Morgan authored Aug 06, 2024

de4fc297

06 Aug, 2024 4 commits
- update llama.cpp submodule to `1e6f6554` (#6208) · e04c7012
  Jeffrey Morgan authored Aug 06, 2024
  
  e04c7012
- Fixed invalid option provided not displaying the invalid option name problem. (#6202) · d4a7216c
  Chua Chee Seng authored Aug 07, 2024
  
  d4a7216c
- Merge pull request #6207 from dhiltgen/sparse_win · a4fdd03c
  Daniel Hiltgen authored Aug 06, 2024
```
Ensure sparse files on windows during download
```
  a4fdd03c
- Ensure sparse files on windows during download · fc85f50a
  Daniel Hiltgen authored Aug 06, 2024
```
The file.Truncate call on windows will write the whole file
unless you set the sparse flag, leading to heavy I/O at the
beginning of download.  This should improve our
I/O behavior on windows and put less stress on the users disk.
```
  fc85f50a
05 Aug, 2024 9 commits
- sort batch results (#6189) · 86b907f8
  royjhan authored Aug 05, 2024
  
  86b907f8
- Merge pull request #6190 from ollama/mxyng/fix-integration · 10d49bce
  Michael Yang authored Aug 05, 2024
```
fix concurrency test
```
  10d49bce
- fix concurrency test · 7ed36741
  Michael Yang authored Aug 05, 2024
  
  7ed36741
- Merge pull request #6186 from dhiltgen/numa · 50ee8b5f
  Daniel Hiltgen authored Aug 05, 2024
```
Implement linux NUMA detection
```
  50ee8b5f
- Merge pull request #6146 from ollama/mxyng/testing · 03bdac05
  Michael Yang authored Aug 05, 2024
```
use testing tempdirs
```
  03bdac05
- Implement linux NUMA detection · f457d634
  Daniel Hiltgen authored Aug 05, 2024
```
If the system has multiple numa nodes, enable numa support in llama.cpp
If we detect numactl in the path, use that, else use the basic "distribute" mode.
```
  f457d634
- Merge pull request #6167 from ollama/mxyng/line-feed · 39f2bc6b
  Michael Yang authored Aug 05, 2024
```
line feed
```
  39f2bc6b
- Disable paging for journalctl (#6154) · b73b0940
  frob authored Aug 05, 2024
```
Users using `journalctl` to get logs for issue logging sometimes don't realize that paging is causing information to be missed.
```
  b73b0940
- line feed · 6a073447
  Michael Yang authored Aug 04, 2024
  
  6a073447
04 Aug, 2024 1 commit
- Add Gemma 2 2b (#6151) · 8b920f35
  sryu1 authored Aug 05, 2024
  
  8b920f35
03 Aug, 2024 1 commit
- Reference ollama integration with Harbor (#6147) · 4221e398
  Ivan Charapanau authored Aug 03, 2024
  
  4221e398
02 Aug, 2024 5 commits
- use testing tempdirs · a091fadf
  Michael Yang authored Aug 02, 2024
  
  a091fadf
- Merge pull request #6128 from ollama/mxyng/lint · 77ccbf04
  Michael Yang authored Aug 02, 2024
```
enable gofmt/gofumpt/goimports/tenv
```
  77ccbf04
- Update OpenAI Compatibility Docs with /v1/completions (#5311) · 4addf6b5
  royjhan authored Aug 02, 2024
```
* Update docs

* token bug corrected

* Update docs/openai.md

* Update docs/openai.md

* add suffix

* merge conflicts

* merge conflicts
```
  4addf6b5
- Update docs (#5310) · 85c7f111
  royjhan authored Aug 02, 2024
  
  85c7f111
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
01 Aug, 2024 13 commits
- Fix models/{model} URL (#6132) · ce1fb444
  Kim Hallberg authored Aug 02, 2024
  
  ce1fb444
- Update OpenAI Compatibility Docs with /v1/embeddings (#5470) · 558a54b0
  royjhan authored Aug 01, 2024
```
* docs without usage

* no usage

* rm metric note
```
  558a54b0
- Add to docs (#5309) · ed52833b
  royjhan authored Aug 01, 2024
  
  ed52833b
- OpenAI: Add Usage to `v1/embeddings` (#5886) · 6f133a0b
  royjhan authored Aug 01, 2024
```
* add prompt tokens to embed response

* rm slog

* metrics

* types

* prompt n

* clean up

* reset submodule

* add tokens to v1/embeddings

* separate usage
```
  6f133a0b
- Update OpenAI Compatibility Docs with /v1/models (#5151) · f561eecf
  royjhan authored Aug 01, 2024
```
* OpenAI Docs

* Update docs/openai.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Remove newline

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  f561eecf
- Merge pull request #6115 from slouffka/fix-context · ff7c9060
  Michael Yang authored Aug 01, 2024
```
Fix context in /api/generate grows too much (#5980).
```
  ff7c9060
- Merge pull request #4756 from ollama/mxyng/convert2 · 0ff42e84
  Michael Yang authored Aug 01, 2024
```
refactor convert
```
  0ff42e84
- Refactor and format code. · 8a9f946c
  Vyacheslav Moskalev authored Aug 02, 2024
  
  8a9f946c
- Refactor code. Remove extra variable. · 3b521054
  Vyacheslav Moskalev authored Aug 01, 2024
  
  3b521054
- Better types and naming closer to style. · b0c21658
  Vyacheslav Moskalev authored Aug 01, 2024
  
  b0c21658
- Change the order of context and prompt. · 49a54831
  Vyacheslav Moskalev authored Aug 01, 2024
  
  49a54831
- Fix extra context concatenation in generate handler (#5980). · 6bc5c137
  Vyacheslav Moskalev authored Aug 01, 2024
  
  6bc5c137
- Merge pull request #6109 from ollama/mxyng/fix-modelfile · 3e614260
  Michael Yang authored Jul 31, 2024
```
fix modelfile message quotes
```
  3e614260
31 Jul, 2024 2 commits
- fix modelfile message quotes · d87b4a48
  Michael Yang authored Jul 31, 2024
  
  d87b4a48
- Merge pull request #6106 from ollama/mxyng/default-sliding-window-attention · 4c14855a
  Michael Yang authored Jul 31, 2024
```
patches: phi3 optional sliding window attention
```
  4c14855a