- 09 Jul, 2024 7 commits
-
-
royjhan authored
* stop token parsing fix * add stop test
-
royjhan authored
-
Daniel Hiltgen authored
Statically link c++ and thread lib on windows
-
Daniel Hiltgen authored
This makes sure we statically link the c++ and thread library on windows to avoid unnecessary runtime dependencies on non-standard DLLs
-
Michael Yang authored
update named templates
-
Michael Yang authored
update message processing
-
Jeffrey Morgan authored
* server: fix unneeded model reloads when setting `OLLAMA_NUM_PARALLEL` * remove whitespace change * undo some changes
-
- 07 Jul, 2024 5 commits
-
-
Jeffrey Morgan authored
llm: remove ambiguous comment when putting upper limit on predictions to avoid infinite generation (#5535)
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 06 Jul, 2024 10 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Revert "fix cmake build (#5505)" This reverts commit 4fd5f352. * llm: fix missing dylibs by restoring old build behavior * crlf -> lf
-
- 05 Jul, 2024 17 commits
-
-
Jeffrey Morgan authored
* llm: put back old include dir * llm: update link paths for old submodule commits
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Always go build in CI generate steps
-
Daniel Hiltgen authored
With the recent cgo changes, bugs can sneak through if we don't make sure to `go build` all the permutations
-
Blake Mizerany authored
This was leading to ambiguity and confusion in ollama.com, and is not used anywhere in ollama at the moment. Once manifests are addressable by digest, we can add this back in, and in a way that is more tailored to the concept of addressing a manifest by digest.
-
Michael Yang authored
-
Michael Yang authored
ensure runtime model changes (template, system prompt, messages, options) are captured on model updates without needing to reload the server
-
Michael Yang authored
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Use common prefix to select slot * actually report `longest`
-
Daniel Hiltgen authored
Prevent loading models larger than total memory
-
Jeffrey Morgan authored
* Fix assert on small embedding inputs * Update llm/patches/09-pooling.diff
-
Daniel Hiltgen authored
Document older win10 terminal problems
-
Daniel Hiltgen authored
Fix clip model loading with unicode paths
-
- 04 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
We haven't found a workaround, so for now recommend updating.
-