Commits · 741affdfd6938bebcec34a2ee4c40fe2584a63bc · OpenDAS / ollama

"vscode:/vscode.git/clone" did not exist on "51b53ddb6c3aa77426c7d5cc0b543b79628053c4"

02 Sep, 2024 1 commit
- docs: update faq.md for OLLAMA_MODELS env var permissions (#6587) · 741affdf
  SnoopyTlion authored Sep 03, 2024
  
  741affdf
01 Sep, 2024 2 commits
- fix(cmd): show info may have nil ModelInfo (#6579) · 5f7b4a5e
  Vimal Kumar authored Sep 01, 2024
  
  5f7b4a5e
- docs: update GGUF examples and references (#6577) · 1aad8387
  rayfiyo authored Sep 01, 2024
  
  1aad8387
31 Aug, 2024 1 commit
- Add findutils to base images (#6581) · a1cef4d0
  Daniel Hiltgen authored Aug 31, 2024
```
This caused missing internal files
```
  a1cef4d0
30 Aug, 2024 3 commits
- Merge pull request #6562 from ollama/mxyng/build-artifacts · c41f0b9e
  Michael Yang authored Aug 30, 2024
```
remove any unneeded build artifacts
```
  c41f0b9e
- Merge pull request #6482 from ollama/mxyng/client-path · 142cbb72
  Michael Yang authored Aug 30, 2024
```
passthrough OLLAMA_HOST path to client
```
  142cbb72
- Merge pull request #6534 from ollama/mxyng/messages · 9468c682
  Michael Yang authored Aug 30, 2024
```
update templates to use messages
```
  9468c682
29 Aug, 2024 3 commits
- remove any unneeded build artifacts · 11018196
  Michael Yang authored Aug 29, 2024
  
  11018196
- doc: Add Nix and Flox to package manager listing (#6074) · 56346ccf
  Bryan Honof authored Aug 29, 2024
  
  56346ccf
- update the openai docs to explain how to set the context size (#6548) · 8e4e509f
  Patrick Devine authored Aug 28, 2024
  
  8e4e509f
28 Aug, 2024 8 commits
- Merge pull request #6546 from ollama/mxyng/fix-test · 47c2b947
  Michael Yang authored Aug 28, 2024
```
fix(test): do not clobber models directory
```
  47c2b947
- Merge pull request #6539 from ollama/mxyng/validate-modelpath · 5eb77bf9
  Michael Yang authored Aug 28, 2024
```
fix: validate modelpath
```
  5eb77bf9
- fix(test): do not clobber models directory · e4d0a9c3
  Michael Yang authored Aug 28, 2024
  
  e4d0a9c3
- add llama3.1 chat template (#6545) · 7416ced7
  Patrick Devine authored Aug 28, 2024
  
  7416ced7
- Merge pull request #6522 from ollama/mxyng/detect-chat · 9cfd2dd3
  Michael Yang authored Aug 28, 2024
```
detect chat template from configs that contain lists
```
  9cfd2dd3
- update deprecated warnings · 8e6da3cb
  Michael Yang authored Aug 27, 2024
  
  8e6da3cb
- validate model path · d9d50c43
  Michael Yang authored Aug 27, 2024
  
  d9d50c43
- throw an error when encountering unsupport tensor sizes (#6538) · 6c1c1ad6
  Patrick Devine authored Aug 27, 2024
  
  6c1c1ad6
27 Aug, 2024 12 commits
- Move ollama executable out of bin dir (#6535) · 93ea9240
  Daniel Hiltgen authored Aug 27, 2024
  
  93ea9240
- update templates to use messages · 413ae39f
  Michael Yang authored Aug 27, 2024
  
  413ae39f
- more tokenizer tests · 60e47573
  Michael Yang authored Aug 27, 2024
  
  60e47573
- add safetensors to the modelfile docs (#6532) · d13c3daa
  Patrick Devine authored Aug 27, 2024
  
  d13c3daa
- Fix import image width (#6528) · 1713eddc
  Patrick Devine authored Aug 27, 2024
  
  1713eddc
- Update manual instructions with discrete ROCm bundle (#6445) · 4e1c4f6e
  Daniel Hiltgen authored Aug 27, 2024
  
  4e1c4f6e
- llm: fix typo in comment (#6530) · 397cae79
  Sean Khatiri authored Aug 27, 2024
  
  397cae79
- adjust image sizes · 1c70a00f
  Patrick Devine authored Aug 27, 2024
  
  1c70a00f
- clean up convert tokenizer · eae3af68
  Michael Yang authored Aug 27, 2024
  
  eae3af68
- detect chat template from configs that contain lists · 3eb08377
  Michael Yang authored Aug 26, 2024
  
  3eb08377
- update the import docs (#6104) · ac80010d
  Patrick Devine authored Aug 26, 2024
  
  ac80010d
- server: clean up route names for consistency (#6524) · 47fa0839
  Jeffrey Morgan authored Aug 26, 2024
  
  47fa0839
25 Aug, 2024 1 commit

Only enable numa on CPUs (#6484) · 0f92b19b

Daniel Hiltgen authored Aug 24, 2024

The numa flag may be having a performance impact on multi-socket systems with GPU loads

0f92b19b

23 Aug, 2024 7 commits
- gpu: Group GPU Library sets by variant (#6483) · 69be940b
  Daniel Hiltgen authored Aug 23, 2024
```
The recent cuda variant changes uncovered a bug in ByLibrary
which failed to group by common variant for GPU types.
```
  69be940b
- Merge pull request #5446 from ollama/mxyng/faq · 9638c24c
  Michael Yang authored Aug 23, 2024
```
update faq
```
  9638c24c
- update faq · bb362caf
  Michael Yang authored Jul 02, 2024
  
  bb362caf
- passthrough OLLAMA_HOST path to client · 386af6c1
  Michael Yang authored Aug 23, 2024
  
  386af6c1
- convert safetensor adapters into GGUF (#6327) · 0c819e16
  Patrick Devine authored Aug 23, 2024
  
  0c819e16
- gpu: Ensure driver version set before variant (#6480) · 7a1e1c1c
  Daniel Hiltgen authored Aug 23, 2024
```
During rebasing, the ordering was inverted causing the cuda version
selection logic to break, with driver version being evaluated as zero
incorrectly causing a downgrade to v11.
```
  7a1e1c1c
- llm: Align cmake define for cuda no peer copy (#6455) · 0b03b9c3
  Daniel Hiltgen authored Aug 23, 2024
```
Define changed recently and this slipped through the cracks with the old
name.
```
  0b03b9c3
22 Aug, 2024 1 commit

Fix embeddings memory corruption (#6467) · 90ca8417

Daniel Hiltgen authored Aug 22, 2024

* Fix embeddings memory corruption

The patch was leading to a buffer overrun corruption.  Once removed though, parallism
in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
work around this, only use slot 0 for embeddings.

* Fix embed integration test assumption

The token eval count has changed with recent llama.cpp bumps (0.3.5+)

90ca8417

21 Aug, 2024 1 commit
- Merge pull request #6064 from ollama/mxyng/convert-llama3 · 6bd8a4b0
  Michael Yang authored Aug 21, 2024
```
convert: update llama conversion for llama3.1
```
  6bd8a4b0