Commits · fa24e73b8253a554ec840395a5d1dfdb91d3598b · OpenDAS / ollama

22 Dec, 2023 2 commits
- Remove CPU build, fixup linux build script · fa24e73b
  Daniel Hiltgen authored Dec 21, 2023
  
  fa24e73b
- Fix CPU performance on hyperthreaded systems · 325d7498
  Daniel Hiltgen authored Dec 21, 2023
```
The default thread count logic was broken and resulted in 2x the number
of threads as it should on a hyperthreading CPU
resulting in thrashing and poor performance.
```
  325d7498
21 Dec, 2023 1 commit

Daniel Hiltgen authored Dec 20, 2023

The windows native setup still needs some more work, but this gets it building
again and if you set the PATH properly, you can run the resulting exe on a cuda system.

d9cd3d96

20 Dec, 2023 1 commit

Revamp the dynamic library shim · 7555ea44

Daniel Hiltgen authored Dec 20, 2023

This switches the default llama.cpp to be CPU based, and builds the GPU variants
as dynamically loaded libraries which we can select at runtime.

This also bumps the ROCm library to version 6 given 5.7 builds don't work
on the latest ROCm library that just shipped.

7555ea44

19 Dec, 2023 18 commits
- Additional nvidial-ml path to check · 1d1eb168
  Daniel Hiltgen authored Dec 19, 2023
  
  1d1eb168
- Fix darwin intel build · 6558f94e
  Daniel Hiltgen authored Dec 19, 2023
  
  6558f94e
- Carry ggml-metal.metal as payload · 54dbfa4c
  Daniel Hiltgen authored Dec 18, 2023
  
  54dbfa4c
- Add WSL2 path to nvidia-ml.so library · 5646826a
  Daniel Hiltgen authored Dec 15, 2023
  
  5646826a
- Refine handling of shim presence · 3269535a
  Daniel Hiltgen authored Dec 15, 2023
```
This allows the CPU only builds to work on systems with Radeon cards
```
  3269535a
- Refine build to support CPU only · 1b991d0b
  Daniel Hiltgen authored Dec 13, 2023
```
If someone checks out the ollama repo and doesn't install the CUDA
library, this will ensure they can build a CPU only version
```
  1b991d0b
- Add automated test for multimodal · 51082535
  Daniel Hiltgen authored Dec 13, 2023
```
A simple test case that verifies llava:7b can read text in an image
```
  51082535
- Bump llama.cpp to b1662 and set n_parallel=1 · 9adca7f7
  Daniel Hiltgen authored Dec 14, 2023
  
  9adca7f7
- Build linux using ubuntu 20.04 · 89bbaafa
  Daniel Hiltgen authored Dec 18, 2023
```
This changes the container-based linux build to use an older Ubuntu
distro to improve our compatibility matrix for older user machines
```
  89bbaafa
- Adapted rocm support to cgo based llama.cpp · 35934b2e
  Daniel Hiltgen authored Nov 29, 2023
  
  35934b2e
- Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. · f8ef4439
  65a authored Oct 16, 2023
```
The build tags rocm or cuda must be specified to both go generate and go build.
ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well
as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the
CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also
used to switch VRAM detection between cuda and rocm implementations, using
added "accelerator_foo.go" files which contain architecture specific functions
and variables. accelerator_none is used when no tags are set, and a helper
function addRunner will ignore it if it is the chosen accelerator. Fix go
generate commands, thanks @deadmeu for testing.
```
  f8ef4439
- Add cgo implementation for llama.cpp · d4cd6957
  Daniel Hiltgen authored Nov 13, 2023
```
Run the server.cpp directly inside the Go runtime via cgo
while retaining the LLM Go abstractions.
```
  d4cd6957
- Update images.go · 5e7fd690
  Bruce MacDonald authored Dec 11, 2023
  
  5e7fd690
- deprecate ggml · 811b1f03
  Bruce MacDonald authored Nov 24, 2023
```
- remove ggml runner
- automatically pull gguf models when ggml detected
- tell users to update to gguf in the case automatic pull fails
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  811b1f03
- Merge pull request #1595 from pgibler/main · ed195f35
  Matt Williams authored Dec 18, 2023
```
Added cmdh to community section in README
```
  ed195f35
- Merge pull request #1592 from jmorganca/mattw/examplepruning · e0d0072e
  Matt Williams authored Dec 18, 2023
```
Lets get rid of these old modelfile examples
```
  e0d0072e
- Added cmdh to community section in README · 620a2ffc
  pgibler authored Dec 18, 2023
  
  620a2ffc
- Lets get rid of these old modelfile examples · d287013f
  Matt Williams authored Dec 18, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  d287013f
18 Dec, 2023 5 commits
- update runner submodule · 6b5bdfa6
  Jeffrey Morgan authored Dec 18, 2023
  
  6b5bdfa6
- update runner submodule to fix hipblas build · c063ee4a
  Jeffrey Morgan authored Dec 18, 2023
  
  c063ee4a
- send empty messages on last chat response (#1530) · d99fa6ce
  Bruce MacDonald authored Dec 18, 2023
  
  d99fa6ce
- add magic header for unit tests (#1558) · 3948c6ea
  Patrick Devine authored Dec 18, 2023
  
  3948c6ea
- update runner submodule · b85982eb
  Jeffrey Morgan authored Dec 18, 2023
  
  b85982eb
15 Dec, 2023 6 commits
- add API create/copy handlers (#1541) · 86b0dd4b
  Patrick Devine authored Dec 15, 2023
  
  86b0dd4b
- README with Enchanted iOS App (#1529) · f7287384
  Augustinas Malinauskas authored Dec 15, 2023
```
* feat(docs): README with Enchanted iOS app

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  f7287384
- Added Bionic GPT as a front end. (#1463) · 115048a0
  Ian Purton authored Dec 15, 2023
```
* Added Bionic GPT as a front end.

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  115048a0
- use exp slices for go 1.20 compatibility (#1544) · 1b417a78
  Bruce MacDonald authored Dec 15, 2023
  
  1b417a78
- add API tests for list handler (#1535) · 0174665d
  Patrick Devine authored Dec 14, 2023
  
  0174665d
- Add unit test of API routes (#1528) · 630518f0
  Patrick Devine authored Dec 14, 2023
  
  630518f0
14 Dec, 2023 2 commits

remove sample_count from docs (#1527) · 6e16098a
Bruce MacDonald authored Dec 14, 2023
```
this info has not been returned from these endpoints in some time
```
6e16098a

restore model load duration on generate response (#1524) · 6ee8c801

Bruce MacDonald authored Dec 14, 2023

* restore model load duration on generate response

- set model load duration on generate and chat done response
- calculate createAt time when response created

* remove checkpoints predict opts

* Update routes.go

6ee8c801

13 Dec, 2023 5 commits
- Update runner to support mixtral and mixture of experts (MoE) (#1475) · 31f0551d
  Jeffrey Morgan authored Dec 13, 2023
  
  31f0551d
- fix tests · 4a1abfe4
  Jeffrey Morgan authored Dec 13, 2023
  
  4a1abfe4
- add multimodal to `README.md` · bbd41494
  Jeffrey Morgan authored Dec 13, 2023
  
  bbd41494
- Docs for multimodal support (#1485) · fedba24a
  Jeffrey Morgan authored Dec 13, 2023
```
* add multimodal docs

* add chat api docs

* consistency between `/api/generate` and `/api/chat`

* simplify docs
```
  fedba24a
- Added message format for chat api (#1488) · e3b090db
  pepperoni21 authored Dec 13, 2023
  
  e3b090db