Commits · 96fb441abd297748cfddb173289af7d54dbeee4d · OpenDAS / ollama

22 Dec, 2023 4 commits
- Merge pull request #1146 from dhiltgen/ext_server_cgo · 96fb441a
  Daniel Hiltgen authored Dec 22, 2023
```
Add cgo implementation for llama.cpp
```
  96fb441a
- Fix doc glitch · 495c06e4
  Daniel Hiltgen authored Dec 21, 2023
  
  495c06e4
- Remove CPU build, fixup linux build script · fa24e73b
  Daniel Hiltgen authored Dec 21, 2023
  
  fa24e73b
- Fix CPU performance on hyperthreaded systems · 325d7498
  Daniel Hiltgen authored Dec 21, 2023
```
The default thread count logic was broken and resulted in 2x the number
of threads as it should on a hyperthreading CPU
resulting in thrashing and poor performance.
```
  325d7498
21 Dec, 2023 3 commits
- allow for starting llava queries with filepath (#1549) · fabf2f34
  Bruce MacDonald authored Dec 21, 2023
  
  fabf2f34
- Revive windows build · d9cd3d96
  Daniel Hiltgen authored Dec 20, 2023
```
The windows native setup still needs some more work, but this gets it building
again and if you set the PATH properly, you can run the resulting exe on a cuda system.
```
  d9cd3d96
- add FAQ for slow networking in WSL2 (#1646) · a607d922
  Patrick Devine authored Dec 20, 2023
  
  a607d922
20 Dec, 2023 2 commits

Revamp the dynamic library shim · 7555ea44

Daniel Hiltgen authored Dec 20, 2023

This switches the default llama.cpp to be CPU based, and builds the GPU variants
as dynamically loaded libraries which we can select at runtime.

This also bumps the ROCm library to version 6 given 5.7 builds don't work
on the latest ROCm library that just shipped.

7555ea44

Update api.md · df068124
Jeffrey Morgan authored Dec 20, 2023

df068124

19 Dec, 2023 23 commits
- Additional nvidial-ml path to check · 1d1eb168
  Daniel Hiltgen authored Dec 19, 2023
  
  1d1eb168
- Merge pull request #1619 from jmorganca/mxyng/fix-version-test · 23dc1793
  Michael Yang authored Dec 19, 2023
```
fix(test): use real version string for comparison
```
  23dc1793
- fix(test): use real version string for comparison · 63aac0ed
  Michael Yang authored Dec 19, 2023
  
  63aac0ed
- Fix darwin intel build · 6558f94e
  Daniel Hiltgen authored Dec 19, 2023
  
  6558f94e
- Add Langchain Dart library (#1564) · 1ca484f6
  Erick Ghaumez authored Dec 19, 2023
```
* Add Langchain Dart

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  1ca484f6
- Update README.md · 72b0c32f
  Jeffrey Morgan authored Dec 19, 2023
  
  72b0c32f
- Update README.md · 68c28224
  Jeffrey Morgan authored Dec 19, 2023
  
  68c28224
- Carry ggml-metal.metal as payload · 54dbfa4c
  Daniel Hiltgen authored Dec 18, 2023
  
  54dbfa4c
- Add WSL2 path to nvidia-ml.so library · 5646826a
  Daniel Hiltgen authored Dec 15, 2023
  
  5646826a
- Refine handling of shim presence · 3269535a
  Daniel Hiltgen authored Dec 15, 2023
```
This allows the CPU only builds to work on systems with Radeon cards
```
  3269535a
- Refine build to support CPU only · 1b991d0b
  Daniel Hiltgen authored Dec 13, 2023
```
If someone checks out the ollama repo and doesn't install the CUDA
library, this will ensure they can build a CPU only version
```
  1b991d0b
- Add automated test for multimodal · 51082535
  Daniel Hiltgen authored Dec 13, 2023
```
A simple test case that verifies llava:7b can read text in an image
```
  51082535
- Bump llama.cpp to b1662 and set n_parallel=1 · 9adca7f7
  Daniel Hiltgen authored Dec 14, 2023
  
  9adca7f7
- Build linux using ubuntu 20.04 · 89bbaafa
  Daniel Hiltgen authored Dec 18, 2023
```
This changes the container-based linux build to use an older Ubuntu
distro to improve our compatibility matrix for older user machines
```
  89bbaafa
- Adapted rocm support to cgo based llama.cpp · 35934b2e
  Daniel Hiltgen authored Nov 29, 2023
  
  35934b2e
- Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. · f8ef4439
  65a authored Oct 16, 2023
```
The build tags rocm or cuda must be specified to both go generate and go build.
ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well
as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the
CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also
used to switch VRAM detection between cuda and rocm implementations, using
added "accelerator_foo.go" files which contain architecture specific functions
and variables. accelerator_none is used when no tags are set, and a helper
function addRunner will ignore it if it is the chosen accelerator. Fix go
generate commands, thanks @deadmeu for testing.
```
  f8ef4439
- Add cgo implementation for llama.cpp · d4cd6957
  Daniel Hiltgen authored Nov 13, 2023
```
Run the server.cpp directly inside the Go runtime via cgo
while retaining the LLM Go abstractions.
```
  d4cd6957
- Update images.go · 5e7fd690
  Bruce MacDonald authored Dec 11, 2023
  
  5e7fd690
- deprecate ggml · 811b1f03
  Bruce MacDonald authored Nov 24, 2023
```
- remove ggml runner
- automatically pull gguf models when ggml detected
- tell users to update to gguf in the case automatic pull fails
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  811b1f03
- Merge pull request #1595 from pgibler/main · ed195f35
  Matt Williams authored Dec 18, 2023
```
Added cmdh to community section in README
```
  ed195f35
- Merge pull request #1592 from jmorganca/mattw/examplepruning · e0d0072e
  Matt Williams authored Dec 18, 2023
```
Lets get rid of these old modelfile examples
```
  e0d0072e
- Added cmdh to community section in README · 620a2ffc
  pgibler authored Dec 18, 2023
  
  620a2ffc
- Lets get rid of these old modelfile examples · d287013f
  Matt Williams authored Dec 18, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  d287013f
18 Dec, 2023 5 commits
- update runner submodule · 6b5bdfa6
  Jeffrey Morgan authored Dec 18, 2023
  
  6b5bdfa6
- update runner submodule to fix hipblas build · c063ee4a
  Jeffrey Morgan authored Dec 18, 2023
  
  c063ee4a
- send empty messages on last chat response (#1530) · d99fa6ce
  Bruce MacDonald authored Dec 18, 2023
  
  d99fa6ce
- add magic header for unit tests (#1558) · 3948c6ea
  Patrick Devine authored Dec 18, 2023
  
  3948c6ea
- update runner submodule · b85982eb
  Jeffrey Morgan authored Dec 18, 2023
  
  b85982eb
15 Dec, 2023 3 commits
- add API create/copy handlers (#1541) · 86b0dd4b
  Patrick Devine authored Dec 15, 2023
  
  86b0dd4b
- README with Enchanted iOS App (#1529) · f7287384
  Augustinas Malinauskas authored Dec 15, 2023
```
* feat(docs): README with Enchanted iOS app

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  f7287384
- Added Bionic GPT as a front end. (#1463) · 115048a0
  Ian Purton authored Dec 15, 2023
```
* Added Bionic GPT as a front end.

* Update README.md

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  115048a0