- 06 Jan, 2026 1 commit
-
-
Devon Rifkin authored
* preserve tool definition and call JSON ordering This is another iteration of <https://github.com/ollama/ollama/pull/12518>, but this time we've simplified things by relaxing the competing requirements of being compatible AND order-preserving with templates (vs. renderers). We maintain backwards compatibility at the cost of not guaranteeing order for templates. We plan on moving more and more models to renderers, which have been updated to use these new data types, and additionally we could add an opt-in way of templates getting an order-preserved list (e.g., via sibling template vars) * orderedmap_test: remove testify
-
- 18 Dec, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 04 Nov, 2025 1 commit
-
-
Daniel Hiltgen authored
* app: add code for macOS and Windows apps under 'app' * app: add readme * app: windows and linux only for now * ci: fix ui CI validation --------- Co-authored-by:jmorganca <jmorganca@gmail.com>
-
- 20 Jun, 2025 1 commit
-
-
Michael Yang authored
* Reapply "feat: incremental gguf parser (#10822)" (#11114) This reverts commit a6e64fbd. * fix older ggufs
-
- 18 Jun, 2025 1 commit
-
-
Jeffrey Morgan authored
This reverts commit 6b04cad7.
-
- 12 Jun, 2025 1 commit
-
-
Michael Yang authored
* incremental gguf parser * gguf: update test to not rely on gguf on disc * re-use existing create gguf * read capabilities from gguf kv * kv exists * update tests * s/doneFunc/successFunc/g * new buffered reader --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 29 Apr, 2025 1 commit
-
-
batuhankadioglu authored
-
- 05 Mar, 2025 1 commit
-
-
Blake Mizerany authored
This commit replaces the old pull implementation in the server package with the new, faster, more robust pull implementation in the registry package. The new endpoint, and now the remove endpoint too, are behind the feature gate "client2" enabled only by setting the OLLAMA_EXPERIMENT environment variable include "client2". Currently, the progress indication is wired to perform the same as the previous implementation to avoid making changes to the CLI, and because the status reports happen at the start of the download, and the end of the write to disk, the progress indication is not as smooth as it could be. This is a known issue and will be addressed in a future change. This implementation may be ~0.5-1.0% slower in rare cases, depending on network and disk speed, but is generally MUCH faster and more robust than the its predecessor in all other cases.
-
- 27 Feb, 2025 1 commit
-
-
Blake Mizerany authored
This commit introduces a new API implementation for handling interactions with the registry and the local model cache. The new API is located in server/internal/registry. The package name is "registry" and should be considered temporary; it is hidden and not bleeding outside of the server package. As the commits roll in, we'll start consuming more of the API and then let reverse osmosis take effect, at which point it will surface closer to the root level packages as much as needed.
-
- 29 Jan, 2025 1 commit
-
-
Michael Yang authored
* add build to .dockerignore * test: only build one arch * add build to .gitignore * fix ccache path * filter amdgpu targets * only filter if autodetecting * Don't clobber gpu list for default runner This ensures the GPU specific environment variables are set properly * explicitly set CXX compiler for HIP * Update build_windows.ps1 This isn't complete, but is close. Dependencies are missing, and it only builds the "default" preset. * build: add ollama subdir * add .git to .dockerignore * docs: update development.md * update build_darwin.sh * remove unused scripts * llm: add cwd and build/lib/ollama to library paths * default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS * add additional cmake output vars for msvc * interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12 * remove unncessary filepath.Dir, cleanup * add hardware-specific directory to path * use absolute server path * build: linux arm * cmake install targets * remove unused files * ml: visit each library path once * build: skip cpu variants on arm * build: install cpu targets * build: fix workflow * shorter names * fix rocblas install * docs: clean up development.md * consistent build dir removal in development.md * silence -Wimplicit-function-declaration build warnings in ggml-cpu * update readme * update development readme * llm: update library lookup logic now that there is one runner (#8587) * tweak development.md * update docs * add windows cuda/rocm tests --------- Co-authored-by:
jmorganca <jmorganca@gmail.com> Co-authored-by:
Daniel Hiltgen <daniel@ollama.com>
-
- 21 Dec, 2024 1 commit
-
-
Michael Yang authored
gods v2 uses go generics rather than interfaces which simplifies the code considerably
-
- 20 Dec, 2024 1 commit
-
-
Squishedmac authored
-
- 23 Nov, 2024 1 commit
-
-
Meng Zhuo authored
-
- 22 Nov, 2024 1 commit
-
-
Mikel Olasagasti Uranga authored
update uuid.New().String() to uuid.NewString()
-
- 14 Nov, 2024 1 commit
-
-
Bruce MacDonald authored
- golang.org/x/sync v0.3.0 -> v0.9.0 - golang.org/x/image v0.14.0 -> v0.22.0 - golang.org/x/text v0.15.0 -> v0.20.0
-
- 18 Oct, 2024 1 commit
-
-
Patrick Devine authored
Co-authored-by:
jmorganca <jmorganca@gmail.com> Co-authored-by:
Michael Yang <mxyng@pm.me> Co-authored-by:
Jesse Gross <jesse@ollama.com>
-
- 06 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 21 May, 2024 1 commit
-
-
Michael Yang authored
-
- 11 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 15 Apr, 2024 1 commit
-
-
Patrick Devine authored
-
- 07 Mar, 2024 1 commit
-
-
Patrick Devine authored
-
- 24 Feb, 2024 1 commit
-
-
Michael Yang authored
-
- 15 Feb, 2024 2 commits
-
-
vinjn authored
-
Daniel Hiltgen authored
This focuses on Windows first, but coudl be used for Mac and possibly linux in the future.
-
- 19 Dec, 2023 1 commit
-
-
Daniel Hiltgen authored
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.
-
- 05 Dec, 2023 1 commit
-
-
Michael Yang authored
-
- 14 Nov, 2023 1 commit
-
-
Michael Yang authored
-
- 01 Nov, 2023 1 commit
-
-
Michael Yang authored
-
- 25 Oct, 2023 2 commits
-
-
Patrick Devine authored
-
Ajay Kemparaj authored
-
- 16 Oct, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 06 Oct, 2023 1 commit
-
-
Michael Yang authored
-
- 22 Sep, 2023 2 commits
-
-
Patrick Devine authored
-
Patrick Devine authored
-
- 05 Sep, 2023 1 commit
-
-
Michael Yang authored
-
- 30 Aug, 2023 1 commit
-
-
Bruce MacDonald authored
* remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm
-
- 10 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 08 Aug, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 01 Aug, 2023 2 commits
-
-
Bruce MacDonald authored
- read runner options from map to see what was specified explicitly and overwrite zero values
-
Bruce MacDonald authored
-