Commits · 581a4a55532e2a28177bcaffd722a2bf298f967d · OpenDAS / ollama

11 Dec, 2024 2 commits

ci: fix artifact path prefix for missing windows payloads (#8052) · 581a4a55

Daniel Hiltgen authored Dec 11, 2024

upload-artifacts strips off leading common paths so when
the ./build/ artifacts were removed, the ./dist/windows-amd64
prefix became common and was stripped, making the
later download-artifacts place them in the wrong location

581a4a55

ci: build dir changed (#8037) · 6a6328a5
Daniel Hiltgen authored Dec 10, 2024
```
Remove no longer relevant build log dir
```
6a6328a5

10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

04 Nov, 2024 3 commits
- CI: Switch to v13 macos runner (#7498) · 046054fa
  Daniel Hiltgen authored Nov 04, 2024
  
  046054fa
- CI: matrix strategy fix (#7496) · 95483f34
  Daniel Hiltgen authored Nov 04, 2024
```
Github actions matrix strategy can't access env settings
```
  95483f34
- Sign windows arm64 official binaries (#7493) · 44bd9e59
  Daniel Hiltgen authored Nov 04, 2024
  
  44bd9e59
02 Nov, 2024 1 commit

CI: omit unused tools for faster release builds (#7432) · b8d5036e

Daniel Hiltgen authored Nov 02, 2024

This leverages caching, and some reduced installer scope to try
to speed up builds. It also tidies up some windows build logic
that was only relevant for the older generate/cmake builds.

b8d5036e

30 Oct, 2024 2 commits

Soften windows clang requirement (#7428) · 712e99d4

Daniel Hiltgen authored Oct 30, 2024

This will no longer error if built with regular gcc on windows.  To help
triage issues that may come in related to different compilers, the runner now
reports the compier used by cgo.

712e99d4

Remove submodule and shift to Go server - 0.4.0 (#7157) · b754f5a6

Daniel Hiltgen authored Oct 30, 2024

* Remove llama.cpp submodule and shift new build to top

* CI: install msys and clang gcc on win

Needed for deepseek to work properly on windows

b754f5a6

24 Sep, 2024 1 commit

CI: Fix win arm version defect (#6940) · e9e9bdb8

Daniel Hiltgen authored Sep 24, 2024

write-host in powershell writes directly to the console and will not be picked
up by a pipe.  Echo, or write-output will.

e9e9bdb8

21 Sep, 2024 1 commit

CI: win arm artifact dist dir (#6900) · 2a038c1d

Daniel Hiltgen authored Sep 20, 2024

The upload artifact is missing the dist prefix since all
payloads are in the same directory, so restore the prefix
on download.

2a038c1d

20 Sep, 2024 3 commits

CI: win arm adjustments (#6898) · 616c5eaf
Daniel Hiltgen authored Sep 20, 2024

616c5eaf
CI: adjust step ordering for win arm to match x64 (#6895) · f5ff917b
Daniel Hiltgen authored Sep 20, 2024

f5ff917b

Add Windows arm64 support to official builds (#5712) · d632e23f

Daniel Hiltgen authored Sep 20, 2024

* Unified arm/x86 windows installer

This adjusts the installer payloads to be architecture aware so we can cary
both amd64 and arm64 binaries in the installer, and install only the applicable
architecture at install time.

* Include arm64 in official windows build

* Harden schedule test for slow windows timers

This test seems to be a bit flaky on windows, so give it more time to converge

d632e23f

17 Sep, 2024 1 commit

CI: dist directories no longer present (#6834) · 8f9ab5e1

Daniel Hiltgen authored Sep 16, 2024

The new buildx based build no longer leaves the dist/linux-* directories
around, so we don't have to clean them up before uploading.

8f9ab5e1

16 Sep, 2024 2 commits

CI: clean up naming, fix tagging latest (#6832) · 7717bb6a

Daniel Hiltgen authored Sep 16, 2024

The rocm CI step for RCs was incorrectly tagging them as the latest rocm build.
The multiarch manifest was incorrectly tagged twice (with and without the
prefix "v"). Static windows artifacts weren't being carried between build
jobs. This also fixes the latest tagging script.

7717bb6a

CI: set platform build build_linux script to keep buildx happy (#6829) · 0ec2915e
Daniel Hiltgen authored Sep 16, 2024
```
The runners don't have emulation set up so the default multi-platform build
wont work.
```
0ec2915e

12 Sep, 2024 1 commit

Optimize container images for startup (#6547) · cd5c8f64

Daniel Hiltgen authored Sep 12, 2024

* Optimize container images for startup

This change adjusts how to handle runner payloads to support
container builds where we keep them extracted in the filesystem.
This makes it easier to optimize the cpu/cuda vs cpu/rocm images for
size, and should result in faster startup times for container images.

* Refactor payload logic and add buildx support for faster builds

* Move payloads around

* Review comments

* Converge to buildx based helper scripts

* Use docker buildx action for release

cd5c8f64

20 Aug, 2024 1 commit

Split rocm back out of bundle (#6432) · a017cf2f

Daniel Hiltgen authored Aug 20, 2024

We're over budget for github's maximum release artifact size with rocm + 2 cuda
versions. This splits rocm back out as a discrete artifact, but keeps the layout so it can
be extracted into the same location as the main bundle.

a017cf2f

19 Aug, 2024 6 commits
- CI: remove directories from dist dir before upload step (#6429) · 19e5a890
  Daniel Hiltgen authored Aug 19, 2024
  
  19e5a890
- CI: handle directories during checksum (#6427) · f91c9e37
  Daniel Hiltgen authored Aug 19, 2024
  
  f91c9e37
- Fix overlapping artifact name on CI · d8be22e4
  Daniel Hiltgen authored Aug 19, 2024
  
  d8be22e4
- Review comments · f9e31da9
  Daniel Hiltgen authored Aug 15, 2024
  
  f9e31da9
- Add windows cuda v12 + v11 support · 927d98a6
  Daniel Hiltgen authored Jul 12, 2024
  
  927d98a6
- Refactor linux packaging · 74d45f01
  Daniel Hiltgen authored Jul 08, 2024
```
This adjusts linux to follow a similar model to windows with a discrete archive
(zip/tgz) to cary the primary executable, and dependent libraries. Runners are
still carried as payloads inside the main binary

Darwin retain the payload model where the go binary is fully self contained.
```
  74d45f01
13 Aug, 2024 1 commit
- Go back to a pinned Go version · feedf49c
  Daniel Hiltgen authored Aug 13, 2024
```
Go version 1.22.6 is triggering AV false positives, so go back to 1.22.5
```
  feedf49c
22 Jul, 2024 1 commit
- Bump Go patch version · 5d604eec
  Daniel Hiltgen authored Jul 22, 2024
  
  5d604eec
10 Jul, 2024 1 commit

Bump ROCm on windows to 6.1.2 · 1f50356e

Daniel Hiltgen authored Jul 10, 2024

This also adjusts our algorithm to favor our bundled ROCm.
I've confirmed VRAM reporting still doesn't work properly so we
can't yet enable concurrency by default.

1f50356e

09 Jul, 2024 1 commit

Statically link c++ and thread lib · b51e3b63

Daniel Hiltgen authored Jul 09, 2024

This makes sure we statically link the c++ and thread library on windows
to avoid unnecessary runtime dependencies on non-standard DLLs

b51e3b63

06 Jul, 2024 4 commits
- release: move mingw library cleanup to correct job · c12f1c5b
  jmorganca authored Jul 06, 2024
  
  c12f1c5b
- release: remove unwanted mingw dll.a files · a08f20d9
  jmorganca authored Jul 06, 2024
  
  a08f20d9
- Revert "llm: only statically link libstdc++" · 6cea0360
  jmorganca authored Jul 06, 2024
```
This reverts commit 5796bfc4.
```
  6cea0360
- llm: only statically link libstdc++ · 5796bfc4
  jmorganca authored Jul 06, 2024
  
  5796bfc4
15 Jun, 2024 1 commit

Implement custom github release action · a12283e2

Daniel Hiltgen authored Jun 15, 2024

This implements the release logic we want via gh cli
to support updating releases with rc tags in place and retain
release notes and other community reactions.

a12283e2

24 May, 2024 1 commit
- set codesign timeout to longer (#4605) · afd2b058
  Jeffrey Morgan authored May 23, 2024
  
  afd2b058
26 Apr, 2024 2 commits

Move cuda/rocm dependency gathering into generate script · 8feb97dc
Daniel Hiltgen authored Apr 25, 2024
```
This will make it simpler for CI to accumulate artifacts from prior steps
```
8feb97dc

Fix release CI · 8589d752

Daniel Hiltgen authored Apr 25, 2024

download-artifact path was being used incorrectly.  It is where to
extract the zip not the files in the zip to extract.  Default is
workspace dir which is what we want, so omit it

8589d752

23 Apr, 2024 1 commit

Move nested payloads to installer and zip file on windows · 058f6cd2

Daniel Hiltgen authored Apr 23, 2024

Now that the llm runner is an executable and not just a dll, more users are facing
problems with security policy configurations on windows that prevent users
writing to directories and then executing binaries from the same location.
This change removes payloads from the main executable on windows and shifts them
over to be packaged in the installer and discovered based on the executables location.
This also adds a new zip file for people who want to "roll their own" installation model.

058f6cd2

09 Apr, 2024 2 commits

Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564) · 1524f323
Blake Mizerany authored Apr 09, 2024

1524f323

build.go: introduce a friendlier way to build Ollama (#3548) · fccf3eec

Blake Mizerany authored Apr 09, 2024

This commit introduces a more friendly way to build Ollama dependencies
and the binary without abusing `go generate` and removing the
unnecessary extra steps it brings with it.

This script also provides nicer feedback to the user about what is
happening during the build process.

At the end, it prints a helpful message to the user about what to do
next (e.g. run the new local Ollama).

fccf3eec