Commits · 5d6657835669064fa9658e6712b01887a072c606 · OpenDAS / ollama

31 Jul, 2024 2 commits
- Update README.md · 5d665783
  Jeffrey Morgan authored Jul 30, 2024
```
Better example for multi-modal input
```
  5d665783
- patch gemma support · afa8d6e9
  jmorganca authored Jul 30, 2024
  
  afa8d6e9
30 Jul, 2024 4 commits

Add Metrics to `api\embed` response (#5709) · 1b44d873

royjhan authored Jul 30, 2024

* add prompt tokens to embed response

* rm slog

* metrics

* types

* prompt n

* clean up

* reset submodule

* update tests

* test name

* list metrics

1b44d873

Merge pull request #5859 from dhiltgen/homogeneous_gpus · cef2c605
Daniel Hiltgen authored Jul 30, 2024
```
Prevent partial loading on mixed GPU brands
```
cef2c605

Prevent partial loading on mixed GPU brands · 34542099

Daniel Hiltgen authored Jul 22, 2024

In mult-brand GPU setups, if we couldn't fully load the model we
would fall through the scheduler and mistakenly try to load across
a mix of brands.  This makes sure we find the set of GPU(s) that
best fit for the partial load.

34542099

Update and Fix example models (#6065) · 0be8baad
Kim Hallberg authored Jul 30, 2024
```
* Update example models

* Remove unused README.md
```
0be8baad

29 Jul, 2024 11 commits
- Merge pull request #5895 from dhiltgen/sched_faq · 1a83581a
  Daniel Hiltgen authored Jul 29, 2024
```
Better explain multi-gpu behavior
```
  1a83581a
- Merge pull request #5927 from dhiltgen/high_cpu_count · 37926eb9
  Daniel Hiltgen authored Jul 29, 2024
```
Ensure amd gpu nodes are numerically sorted
```
  37926eb9
- Merge pull request #5934 from dhiltgen/missing_cuda_repo · 3d4634fd
  Daniel Hiltgen authored Jul 29, 2024
```
Report better error on cuda unsupported os/arch
```
  3d4634fd
- return tool calls finish reason for openai (#5995) · 365431d4
  royjhan authored Jul 29, 2024
```
* hot fix

* backend stream support

* clean up

* finish reason

* move to openai
```
  365431d4
- Merge pull request #5932 from dhiltgen/win_font · 161e12ce
  Daniel Hiltgen authored Jul 29, 2024
```
Explain font problems on windows 10
```
  161e12ce
- api: add stringifier for `Tool` (#5891) · 46e6327e
  Jeffrey Morgan authored Jul 29, 2024
  
  46e6327e
- update llama.cpp submodule to `6eeaeba1` (#6039) · 68ee42f9
  Jeffrey Morgan authored Jul 29, 2024
  
  68ee42f9
- docs: update README.md (#6059) · f26aef9a
  Ikko Eltociear Ashimine authored Jul 30, 2024
```
HuggingFace -> Hugging Face
```
  f26aef9a
- Merge pull request #5992 from ollama/mxyng/save · 38d9036b
  Michael Yang authored Jul 29, 2024
```
fix: model save
```
  38d9036b
- Fix typo in image docs (#6041) · 6f26e932
  Veit Heller authored Jul 29, 2024
  
  6f26e932
- upate to `llama3.1` elsewhere in repo (#6032) · 0e4d6536
  Jeffrey Morgan authored Jul 28, 2024
  
  0e4d6536
28 Jul, 2024 1 commit
- update readme to llama3.1 (#5933) · 2c016106
  Michael authored Jul 28, 2024
  
  2c016106
27 Jul, 2024 1 commit
- feat: add support for min_p (resolve #1142) (#1825) · f3d7a481
  Tibor Schmidt authored Jul 27, 2024
  
  f3d7a481
26 Jul, 2024 9 commits
- llm: keep patch for llama 3 rope factors (#5987) · f2a96c7d
  Jeffrey Morgan authored Jul 26, 2024
  
  f2a96c7d
- Merge pull request #5705 from dhiltgen/win_errormode · e8a66680
  Daniel Hiltgen authored Jul 26, 2024
```
Enable windows error dialog for subprocess
```
  e8a66680
- Merge pull request #5999 from ollama/mxyng/fix-push · 079b2c3b
  Michael Yang authored Jul 26, 2024
```
fix nil deref in auth.go
```
  079b2c3b
- server: fix race conditions during download (#5994) · 750c1c55
  Blake Mizerany authored Jul 26, 2024
```
This fixes various data races scattered throughout the download/pull
client where the client was accessing the download state concurrently.

This commit is mostly a hot-fix and will be replaced by a new client one
day soon.

Also, remove the unnecessary opts argument from downloadChunk.
```
  750c1c55
- fix nil deref in auth.go · a622c47b
  Michael Yang authored Jul 26, 2024
  
  a622c47b
- Merge pull request #5512 from ollama/mxyng/detect-stop · ec4c35fe
  Michael Yang authored Jul 26, 2024
```
autodetect stop parameters from template
```
  ec4c35fe
- fix: model save · 3d9de805
  Michael Yang authored Jul 26, 2024
```
stop parameter is saved as a slice which is incompatible with modelfile
parsing
```
  3d9de805
- Update api.md (#5968) · f5e39392
  Jeffrey Morgan authored Jul 25, 2024
  
  f5e39392
- Update openai.md · ae27d9dc
  Jeffrey Morgan authored Jul 25, 2024
  
  ae27d9dc
25 Jul, 2024 7 commits
- Merge pull request #5552 from ollama/mxyng/messages-docs · 37096790
  Michael Yang authored Jul 25, 2024
```
docs
```
  37096790
- Update docs/template.md · 997c9038
  Michael Yang authored Jul 25, 2024
```
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
```
  997c9038
- server: reuse original download URL for images (#5962) · c8af3c2d
  Blake Mizerany authored Jul 25, 2024
```
This changes the registry client to reuse the original download URL
it gets on the first redirect response for all subsequent requests,
preventing thundering herd issues when hot new LLMs are released.
```
  c8af3c2d
- Update openai.md · 455e6117
  Jeffrey Morgan authored Jul 25, 2024
  
  455e6117
- openai tools doc (#5617) · 4de1370a
  royjhan authored Jul 25, 2024
  
  4de1370a
- Revert "llm(llama): pass rope factors (#5924)" (#5963) · bbf8f102
  Jeffrey Morgan authored Jul 25, 2024
```
This reverts commit bb46bbcf.
```
  bbf8f102
- Report better error on cuda unsupported os/arch · ce3c93b0
  Daniel Hiltgen authored Jul 24, 2024
```
If we detect an NVIDIA GPU, but nvidia doesn't support the os/arch,
this will report a better error for the user and point them to docs
to self-install the drivers if possible.
```
  ce3c93b0
24 Jul, 2024 4 commits
- Explain font problems on windows 10 · 6c2129d5
  Daniel Hiltgen authored Jul 24, 2024
  
  6c2129d5
- Ensure amd gpu nodes are numerically sorted · 7c2a157c
  Daniel Hiltgen authored Jul 24, 2024
```
For systems that enumerate over 10 CPUs the default lexicographical
sort order interleaves CPUs and GPUs.
```
  7c2a157c
- llm(llama): pass rope factors (#5924) · bb46bbcf
  Michael Yang authored Jul 24, 2024
  
  bb46bbcf
- Fix Embed Test Flakes (#5893) · ac33aa7d
  royjhan authored Jul 24, 2024
```
* float cmp

* increase tolerance
```
  ac33aa7d
23 Jul, 2024 1 commit
- Better explain multi-gpu behavior · 830fdd27
  Daniel Hiltgen authored Jul 23, 2024
  
  830fdd27