"vscode:/vscode.git/clone" did not exist on "961373adf45cb6976b802dd5b34de7857da97119"
- 18 Jun, 2025 1 commit
-
-
Jeffrey Morgan authored
This reverts commit 6b04cad7.
-
- 12 Jun, 2025 1 commit
-
-
Michael Yang authored
* incremental gguf parser * gguf: update test to not rely on gguf on disc * re-use existing create gguf * read capabilities from gguf kv * kv exists * update tests * s/doneFunc/successFunc/g * new buffered reader --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 29 Apr, 2025 1 commit
-
-
batuhankadioglu authored
-
- 07 Mar, 2025 1 commit
-
-
Parth Sareen authored
This change bring in various interface cleanups along with greatly improving the performance of the sampler. Tested with llama3.2 on local machine. Improves performance from ~ 70 tokens/s -> 135 tokens/s with topK(40) enabled. Without topK performance is ~ 110 tokens/s
-
- 05 Mar, 2025 1 commit
-
-
Blake Mizerany authored
This commit replaces the old pull implementation in the server package with the new, faster, more robust pull implementation in the registry package. The new endpoint, and now the remove endpoint too, are behind the feature gate "client2" enabled only by setting the OLLAMA_EXPERIMENT environment variable include "client2". Currently, the progress indication is wired to perform the same as the previous implementation to avoid making changes to the CLI, and because the status reports happen at the start of the download, and the end of the write to disk, the progress indication is not as smooth as it could be. This is a known issue and will be addressed in a future change. This implementation may be ~0.5-1.0% slower in rare cases, depending on network and disk speed, but is generally MUCH faster and more robust than the its predecessor in all other cases.
-
- 27 Feb, 2025 2 commits
-
-
Jesse Gross authored
Otherwise on Linux I get: go: download go1.24 for linux/amd64: toolchain not available
-
Blake Mizerany authored
This commit introduces a new API implementation for handling interactions with the registry and the local model cache. The new API is located in server/internal/registry. The package name is "registry" and should be considered temporary; it is hidden and not bleeding outside of the server package. As the commits roll in, we'll start consuming more of the API and then let reverse osmosis take effect, at which point it will surface closer to the root level packages as much as needed.
-
- 24 Feb, 2025 1 commit
-
-
Blake Mizerany authored
-
- 29 Jan, 2025 1 commit
-
-
Michael Yang authored
* add build to .dockerignore * test: only build one arch * add build to .gitignore * fix ccache path * filter amdgpu targets * only filter if autodetecting * Don't clobber gpu list for default runner This ensures the GPU specific environment variables are set properly * explicitly set CXX compiler for HIP * Update build_windows.ps1 This isn't complete, but is close. Dependencies are missing, and it only builds the "default" preset. * build: add ollama subdir * add .git to .dockerignore * docs: update development.md * update build_darwin.sh * remove unused scripts * llm: add cwd and build/lib/ollama to library paths * default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS * add additional cmake output vars for msvc * interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12 * remove unncessary filepath.Dir, cleanup * add hardware-specific directory to path * use absolute server path * build: linux arm * cmake install targets * remove unused files * ml: visit each library path once * build: skip cpu variants on arm * build: install cpu targets * build: fix workflow * shorter names * fix rocblas install * docs: clean up development.md * consistent build dir removal in development.md * silence -Wimplicit-function-declaration build warnings in ggml-cpu * update readme * update development readme * llm: update library lookup logic now that there is one runner (#8587) * tweak development.md * update docs * add windows cuda/rocm tests --------- Co-authored-by:
jmorganca <jmorganca@gmail.com> Co-authored-by:
Daniel Hiltgen <daniel@ollama.com>
-
- 21 Dec, 2024 1 commit
-
-
Michael Yang authored
gods v2 uses go generics rather than interfaces which simplifies the code considerably
-
- 20 Dec, 2024 1 commit
-
-
Squishedmac authored
-
- 11 Dec, 2024 1 commit
-
-
Blake Mizerany authored
-
- 23 Nov, 2024 1 commit
-
-
Meng Zhuo authored
-
- 22 Nov, 2024 1 commit
-
-
Mikel Olasagasti Uranga authored
update uuid.New().String() to uuid.NewString()
-
- 14 Nov, 2024 1 commit
-
-
Bruce MacDonald authored
- golang.org/x/sync v0.3.0 -> v0.9.0 - golang.org/x/image v0.14.0 -> v0.22.0 - golang.org/x/text v0.15.0 -> v0.20.0
-
- 27 Oct, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 18 Oct, 2024 1 commit
-
-
Patrick Devine authored
Co-authored-by:
jmorganca <jmorganca@gmail.com> Co-authored-by:
Michael Yang <mxyng@pm.me> Co-authored-by:
Jesse Gross <jesse@ollama.com>
-
- 13 Aug, 2024 1 commit
-
-
Daniel Hiltgen authored
Go version 1.22.6 is triggering AV false positives, so go back to 1.22.5
-
- 05 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 06 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 21 May, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 20 May, 2024 1 commit
-
-
jmorganca authored
-
- 11 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 15 Apr, 2024 1 commit
-
-
Patrick Devine authored
-
- 29 Mar, 2024 1 commit
-
-
Patrick Devine authored
Co-authored-by:Michael Yang <mxyng@pm.me>
-
- 26 Mar, 2024 1 commit
-
-
Patrick Devine authored
-
- 07 Mar, 2024 1 commit
-
-
Patrick Devine authored
-
- 24 Feb, 2024 1 commit
-
-
Michael Yang authored
-
- 15 Feb, 2024 2 commits
-
-
vinjn authored
-
Daniel Hiltgen authored
This focuses on Windows first, but coudl be used for Mac and possibly linux in the future.
-
- 18 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 11 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
This switches darwin to dynamic loading, and refactors the code now that no static linking of the library is used on any platform
-
- 19 Dec, 2023 1 commit
-
-
Daniel Hiltgen authored
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.
-
- 15 Dec, 2023 1 commit
-
-
Patrick Devine authored
-
- 05 Dec, 2023 1 commit
-
-
Michael Yang authored
-
- 14 Nov, 2023 1 commit
-
-
Michael Yang authored
-
- 01 Nov, 2023 1 commit
-
-
Michael Yang authored
-
- 25 Oct, 2023 2 commits
-
-
Patrick Devine authored
-
Ajay Kemparaj authored
-