- 11 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 10 Jul, 2024 8 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Wire up windows AMD driver reporting
-
Daniel Hiltgen authored
Bundle missing CRT libraries
-
Daniel Hiltgen authored
Detect CUDA OS overhead
-
Daniel Hiltgen authored
Bump ROCm on windows to 6.1.2
-
Daniel Hiltgen authored
Remove duplicate merge glitch
-
Daniel Hiltgen authored
This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.
-
Daniel Hiltgen authored
-
- 09 Jul, 2024 10 commits
-
-
Daniel Hiltgen authored
Workaround broken ROCm p2p copy
-
royjhan authored
* stop token parsing fix * add stop test
-
royjhan authored
-
Daniel Hiltgen authored
This adds logic to detect skew between the driver and management library which can be attributed to OS overhead and records that so we can adjust subsequent management library free VRAM updates and avoid OOM scenarios.
-
Daniel Hiltgen authored
Statically link c++ and thread lib on windows
-
Daniel Hiltgen authored
This makes sure we statically link the c++ and thread library on windows to avoid unnecessary runtime dependencies on non-standard DLLs
-
Michael Yang authored
update named templates
-
Michael Yang authored
update message processing
-
Jeffrey Morgan authored
* server: fix unneeded model reloads when setting `OLLAMA_NUM_PARALLEL` * remove whitespace change * undo some changes
-
Daniel Hiltgen authored
Some users are experienging runner startup errors due to not having these msvc redist libraries on their host
-
- 08 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
Enable the build flag for llama.cpp to use CPU copy for multi-GPU scenarios.
-
- 07 Jul, 2024 5 commits
-
-
Jeffrey Morgan authored
llm: remove ambiguous comment when putting upper limit on predictions to avoid infinite generation (#5535)
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 06 Jul, 2024 10 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
jmorganca authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Revert "fix cmake build (#5505)" This reverts commit 4fd5f352. * llm: fix missing dylibs by restoring old build behavior * crlf -> lf
-
- 05 Jul, 2024 5 commits
-
-
Jeffrey Morgan authored
* llm: put back old include dir * llm: update link paths for old submodule commits
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Always go build in CI generate steps
-
Daniel Hiltgen authored
With the recent cgo changes, bugs can sneak through if we don't make sure to `go build` all the permutations
-