Commits · 1b991d0ba961936ec8bb50c5b8dabdcd2f9aff25 · OpenDAS / ollama

19 Dec, 2023 13 commits

Refine build to support CPU only · 1b991d0b

Daniel Hiltgen authored Dec 13, 2023

If someone checks out the ollama repo and doesn't install the CUDA
library, this will ensure they can build a CPU only version

1b991d0b

Add automated test for multimodal · 51082535
Daniel Hiltgen authored Dec 13, 2023
```
A simple test case that verifies llava:7b can read text in an image
```
51082535
Bump llama.cpp to b1662 and set n_parallel=1 · 9adca7f7
Daniel Hiltgen authored Dec 14, 2023

9adca7f7

Build linux using ubuntu 20.04 · 89bbaafa

Daniel Hiltgen authored Dec 18, 2023

This changes the container-based linux build to use an older Ubuntu
distro to improve our compatibility matrix for older user machines

89bbaafa

Adapted rocm support to cgo based llama.cpp · 35934b2e
Daniel Hiltgen authored Nov 29, 2023

35934b2e

Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. · f8ef4439

65a authored Oct 16, 2023

The build tags rocm or cuda must be specified to both go generate and go build.
ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well
as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the
CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also
used to switch VRAM detection between cuda and rocm implementations, using
added "accelerator_foo.go" files which contain architecture specific functions
and variables. accelerator_none is used when no tags are set, and a helper
function addRunner will ignore it if it is the chosen accelerator. Fix go
generate commands, thanks @deadmeu for testing.

f8ef4439

Add cgo implementation for llama.cpp · d4cd6957

Daniel Hiltgen authored Nov 13, 2023

Run the server.cpp directly inside the Go runtime via cgo
while retaining the LLM Go abstractions.

d4cd6957

Update images.go · 5e7fd690
Bruce MacDonald authored Dec 11, 2023

5e7fd690

deprecate ggml · 811b1f03

Bruce MacDonald authored Nov 24, 2023



- remove ggml runner
- automatically pull gguf models when ggml detected
- tell users to update to gguf in the case automatic pull fails
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>

811b1f03

Merge pull request #1595 from pgibler/main · ed195f35
Matt Williams authored Dec 18, 2023
```
Added cmdh to community section in README
```
ed195f35
Merge pull request #1592 from jmorganca/mattw/examplepruning · e0d0072e
Matt Williams authored Dec 18, 2023
```
Lets get rid of these old modelfile examples
```
e0d0072e
Added cmdh to community section in README · 620a2ffc
pgibler authored Dec 18, 2023

620a2ffc
Lets get rid of these old modelfile examples · d287013f
Matt Williams authored Dec 18, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
d287013f

18 Dec, 2023 5 commits
- update runner submodule · 6b5bdfa6
  Jeffrey Morgan authored Dec 18, 2023
  
  6b5bdfa6
- update runner submodule to fix hipblas build · c063ee4a
  Jeffrey Morgan authored Dec 18, 2023
  
  c063ee4a
- send empty messages on last chat response (#1530) · d99fa6ce
  Bruce MacDonald authored Dec 18, 2023
  
  d99fa6ce
- add magic header for unit tests (#1558) · 3948c6ea
  Patrick Devine authored Dec 18, 2023
  
  3948c6ea
- update runner submodule · b85982eb
  Jeffrey Morgan authored Dec 18, 2023
  
  b85982eb
15 Dec, 2023 6 commits
- add API create/copy handlers (#1541) · 86b0dd4b
  Patrick Devine authored Dec 15, 2023
  
  86b0dd4b
- README with Enchanted iOS App (#1529) · f7287384
  Augustinas Malinauskas authored Dec 15, 2023
```
* feat(docs): README with Enchanted iOS app

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  f7287384
- Added Bionic GPT as a front end. (#1463) · 115048a0
  Ian Purton authored Dec 15, 2023
```
* Added Bionic GPT as a front end.

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  115048a0
- use exp slices for go 1.20 compatibility (#1544) · 1b417a78
  Bruce MacDonald authored Dec 15, 2023
  
  1b417a78
- add API tests for list handler (#1535) · 0174665d
  Patrick Devine authored Dec 14, 2023
  
  0174665d
- Add unit test of API routes (#1528) · 630518f0
  Patrick Devine authored Dec 14, 2023
  
  630518f0
14 Dec, 2023 2 commits

remove sample_count from docs (#1527) · 6e16098a
Bruce MacDonald authored Dec 14, 2023
```
this info has not been returned from these endpoints in some time
```
6e16098a

restore model load duration on generate response (#1524) · 6ee8c801

Bruce MacDonald authored Dec 14, 2023

* restore model load duration on generate response

- set model load duration on generate and chat done response
- calculate createAt time when response created

* remove checkpoints predict opts

* Update routes.go

6ee8c801

13 Dec, 2023 5 commits
- Update runner to support mixtral and mixture of experts (MoE) (#1475) · 31f0551d
  Jeffrey Morgan authored Dec 13, 2023
  
  31f0551d
- fix tests · 4a1abfe4
  Jeffrey Morgan authored Dec 13, 2023
  
  4a1abfe4
- add multimodal to `README.md` · bbd41494
  Jeffrey Morgan authored Dec 13, 2023
  
  bbd41494
- Docs for multimodal support (#1485) · fedba24a
  Jeffrey Morgan authored Dec 13, 2023
```
* add multimodal docs

* add chat api docs

* consistency between `/api/generate` and `/api/chat`

* simplify docs
```
  fedba24a
- Added message format for chat api (#1488) · e3b090db
  pepperoni21 authored Dec 13, 2023
  
  e3b090db
12 Dec, 2023 6 commits
- add image support to the chat api (#1490) · d9e60f63
  Patrick Devine authored Dec 12, 2023
  
  d9e60f63
- Merge pull request #1469 from jmorganca/mxyng/model-types · 4251b342
  Michael Yang authored Dec 12, 2023
```
remove per-model types
```
  4251b342
- Fix issues with `/set template` and `/set system` (#1486) · 0a9d3480
  Jeffrey Morgan authored Dec 12, 2023
  
  0a9d3480
- exponential back-off (#1484) · 3144e2a4
  Bruce MacDonald authored Dec 12, 2023
  
  3144e2a4
- retry on concurrent request failure (#1483) · c0960e29
  Bruce MacDonald authored Dec 12, 2023
```
- remove parallel
```
  c0960e29
- Fix Readme "Database -> MindsDB" link (#1479) · 5314fc9b
  ruecat authored Dec 12, 2023
  
  5314fc9b
11 Dec, 2023 3 commits
- Update README.md (#1412) · a36b5fef
  Jorge Torres authored Dec 11, 2023
  
  a36b5fef
- Multimodal support (#1216) · 910e9401
  Patrick Devine authored Dec 11, 2023
```
---------
Co-authored-by: Matt Apperson <mattapperson@Matts-MacBook-Pro.local>
```
  910e9401
- remove per-model types · 56ffc302
  Michael Yang authored Dec 08, 2023
```
mostly replaced by decoding tensors except ggml models which only
support llama
```
  56ffc302