"torchvision/vscode:/vscode.git/clone" did not exist on "6b41eb0b3bb8ca40dc3b4c025de33421992c9cd2"
- 26 Jul, 2024 6 commits
-
-
Michael Yang authored
fix nil deref in auth.go
-
Blake Mizerany authored
This fixes various data races scattered throughout the download/pull client where the client was accessing the download state concurrently. This commit is mostly a hot-fix and will be replaced by a new client one day soon. Also, remove the unnecessary opts argument from downloadChunk.
-
Michael Yang authored
-
Michael Yang authored
autodetect stop parameters from template
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 25 Jul, 2024 6 commits
-
-
Michael Yang authored
docs
-
Michael Yang authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Blake Mizerany authored
This changes the registry client to reuse the original download URL it gets on the first redirect response for all subsequent requests, preventing thundering herd issues when hot new LLMs are released.
-
Jeffrey Morgan authored
-
royjhan authored
-
Jeffrey Morgan authored
This reverts commit bb46bbcf.
-
- 24 Jul, 2024 2 commits
-
-
Michael Yang authored
-
royjhan authored
* float cmp * increase tolerance
-
- 23 Jul, 2024 1 commit
-
-
Ajay Chintala authored
-
- 22 Jul, 2024 14 commits
-
-
Daniel Hiltgen authored
Bump Go patch version
-
Daniel Hiltgen authored
bump go version to 1.22.5 to fix security vulnerabilities in docker
-
Daniel Hiltgen authored
-
Josh authored
-
Michael Yang authored
-
Michael Yang authored
-
royjhan authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Refine error reporting for subprocess crash
-
Daniel Hiltgen authored
Remove no longer supported max vram var
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM scenarios. With Concurrency this was no longer wired up, and the simplistic value doesn't map to multi-GPU setups. Users can still set `num_gpu` to limit memory usage to avoid OOM if we get our predictions wrong.
-
Daniel Hiltgen authored
On windows, the exit status winds up being the search term many users search for and end up piling in on issues that are unrelated. This refines the reporting so that if we have a more detailed message we'll suppress the exit status portion of the message.
-
- 21 Jul, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 20 Jul, 2024 7 commits
-
-
Daniel Hiltgen authored
Adjust windows ROCm discovery
-
Daniel Hiltgen authored
Refine scheduler unit tests for reliability
-
Daniel Hiltgen authored
Fix context exhaustion integration test for small gpus
-
Daniel Hiltgen authored
The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 19 Jul, 2024 2 commits