- 26 Apr, 2024 19 commits
-
-
Michael Yang authored
-
Michael Yang authored
use merge base for diff-tree
-
Blake Mizerany authored
Also, remove a superfluous 'go get'
-
Michael Yang authored
-
Daniel Hiltgen authored
Put back non-avx CPU build for windows
-
Blake Mizerany authored
-
Quinten van Buul authored
Fixed a typo
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Bump llama.cpp to b2737
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Fix exe name for zip packaging on windows
-
Daniel Hiltgen authored
The zip file encodes the OS and architecture, so keep the short exe name
-
Daniel Hiltgen authored
Refactor windows generate for more modular usage
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Move cuda/rocm dependency gathering into generate script
-
Daniel Hiltgen authored
This will make it simpler for CI to accumulate artifacts from prior steps
-
Daniel Hiltgen authored
Fix release CI
-
Daniel Hiltgen authored
download-artifact path was being used incorrectly. It is where to extract the zip not the files in the zip to extract. Default is workspace dir which is what we want, so omit it
-
- 25 Apr, 2024 9 commits
-
-
Michael Yang authored
only count output tensors
-
Daniel Hiltgen authored
Improve mac parallel performance
-
Jeffrey Morgan authored
* reload model if `num_gpu` changes * dont reload on -1 * fix tests
-
Jeffrey Morgan authored
* llm: limit generation to 10x context size to avoid run on generations * add comment * simplify condition statement
-
Michael Yang authored
-
Daniel Hiltgen authored
-
jmorganca authored
-
Roy Yang authored
-
Daniel Hiltgen authored
Move ggml loading to when attempting to fit
-
- 24 Apr, 2024 12 commits
-
-
Bryce Reitano authored
-
Bryce Reitano authored
-
Bryce Reitano authored
-
Michael Yang authored
update copy handler to use model.Name
-
Michael Yang authored
-
Michael Yang authored
fix: from blob
-
Michael Yang authored
-
Michael Yang authored
-
Blake Mizerany authored
-
Daniel Hiltgen authored
AMD gfx patch rev is hex
-
Daniel Hiltgen authored
Report errors on server lookup instead of path lookup failure
-
Daniel Hiltgen authored
Correctly handle gfx90a discovery
-