Commits · 3bc28736cd3eec7c7fcc4981ebfef5c36e4bdd7d · OpenDAS / ollama

22 Jan, 2024 1 commit

Daniel Hiltgen authored Jan 22, 2024

This wires up logging in llama.cpp to always go to stderr, and also
turns up logging if OLLAMA_DEBUG is set.

730dcfcc

21 Jan, 2024 1 commit

Probe GPUs before backend init · ec376453

Daniel Hiltgen authored Jan 21, 2024

Detect potential error scenarios so we can fallback to CPU mode without
hitting asserts.

ec376453

17 Jan, 2024 1 commit
- Add multiple CPU variants for Intel Mac · 1b249748
  Daniel Hiltgen authored Jan 12, 2024
```
This also refines the build process for the ext_server build.
```
  1b249748
14 Jan, 2024 1 commit
- Disable `mmap` with lora layers (#1985) · 557110d0
  Jeffrey Morgan authored Jan 13, 2024
  
  557110d0
11 Jan, 2024 1 commit

Support multiple variants for a given llm lib type · 8da7bef0

Daniel Hiltgen authored Jan 05, 2024

In some cases we may want multiple variants for a given GPU type or CPU.
This adds logic to have an optional Variant which we can use to select
an optimal library, but also allows us to try multiple variants in case
some fail to load.

This can be useful for scenarios such as ROCm v5 vs v6 incompatibility
or potentially CPU features.

8da7bef0

10 Jan, 2024 1 commit

Update submodule to `6efb8eb30e7025b168f3fda3ff83b9b386428ad6` (#1885) · 2c6e8f52

Jeffrey Morgan authored Jan 10, 2024

* update submodule to `6efb8eb30e7025b168f3fda3ff83b9b386428ad6`
* unblock condition variable in `update_slots` when closing server

2c6e8f52

07 Jan, 2024 1 commit
- add `-DCMAKE_SYSTEM_NAME=Darwin` cmake flag (#1832) · dbdd50b2
  Jeffrey Morgan authored Jan 07, 2024
  
  dbdd50b2
04 Jan, 2024 1 commit
- Code shuffle to clean up the llm dir · 77d96da9
  Daniel Hiltgen authored Jan 04, 2024
  
  77d96da9