Commits · dfda91c2eeb0a067c2309187d86b6325226853cd · OpenDAS / ollama

04 Jan, 2024 3 commits
- Init submodule with new path · fac9060d
  Daniel Hiltgen authored Jan 04, 2024
  
  fac9060d
- remove old llama.cpp submodule path · a554616f
  Daniel Hiltgen authored Jan 04, 2024
  
  a554616f
- Code shuffle to clean up the llm dir · 77d96da9
  Daniel Hiltgen authored Jan 04, 2024
  
  77d96da9
19 Dec, 2023 1 commit

Bruce MacDonald authored Nov 24, 2023



- remove ggml runner
- automatically pull gguf models when ggml detected
- tell users to update to gguf in the case automatic pull fails
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>

811b1f03

21 Sep, 2023 1 commit
- silence warm up log · 058d0cd0
  Michael Yang authored Sep 21, 2023
  
  058d0cd0
07 Sep, 2023 1 commit
- GGUF support (#441) · 09dd2aef
  Bruce MacDonald authored Sep 07, 2023
  
  09dd2aef
30 Aug, 2023 2 commits

update docs for subprocess · a82eb275
Jeffrey Morgan authored Aug 30, 2023

a82eb275

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79