- 28 Apr, 2024 5 commits
-
-
Daniel Hiltgen authored
Fix concurrency for CPU mode
-
Daniel Hiltgen authored
Prior refactoring passes accidentally removed the logic to bypass VRAM checks for CPU loads. This adds that back, along with test coverage. This also fixes loaded map access in the unit test to be behind the mutex which was likely the cause of various flakes in the tests.
-
Blake Mizerany authored
-
Jeffrey Morgan authored
-
Blake Mizerany authored
-
- 27 Apr, 2024 2 commits
-
-
Blake Mizerany authored
-
Blake Mizerany authored
The Digest type needs more thought and is not necessary at the moment.
-
- 26 Apr, 2024 30 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Fine grain control over windows generate steps
-
Daniel Hiltgen authored
This will speed up CI which already tries to only build static for unit tests
-
Nataly Merezhuk authored
-
Daniel Hiltgen authored
Fix target in gen_windows.ps1
-
Michael Yang authored
fix gemma, command-r layer weights
-
Daniel Hiltgen authored
-
Michael Yang authored
-
Blake Mizerany authored
These are useful outside this package.
-
Michael Yang authored
check file type before zip
-
Jeffrey Morgan authored
-
Michael Yang authored
-
Michael Yang authored
use merge base for diff-tree
-
Blake Mizerany authored
Also, remove a superfluous 'go get'
-
Michael Yang authored
-
Daniel Hiltgen authored
Put back non-avx CPU build for windows
-
Blake Mizerany authored
-
Quinten van Buul authored
Fixed a typo
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Bump llama.cpp to b2737
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Fix exe name for zip packaging on windows
-
Daniel Hiltgen authored
The zip file encodes the OS and architecture, so keep the short exe name
-
Daniel Hiltgen authored
Refactor windows generate for more modular usage
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Move cuda/rocm dependency gathering into generate script
-
Daniel Hiltgen authored
This will make it simpler for CI to accumulate artifacts from prior steps
-
Daniel Hiltgen authored
Fix release CI
-
Daniel Hiltgen authored
download-artifact path was being used incorrectly. It is where to extract the zip not the files in the zip to extract. Default is workspace dir which is what we want, so omit it
-
- 25 Apr, 2024 3 commits
-
-
Michael Yang authored
only count output tensors
-
Daniel Hiltgen authored
Improve mac parallel performance
-
Jeffrey Morgan authored
* reload model if `num_gpu` changes * dont reload on -1 * fix tests
-