subprocess llama.cpp server (#401)
* remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm
Showing
.gitmodules
0 → 100644
| ... | @@ -38,9 +38,9 @@ require ( | ... | @@ -38,9 +38,9 @@ require ( |
| github.com/twitchyliquid64/golang-asm v0.15.1 // indirect | github.com/twitchyliquid64/golang-asm v0.15.1 // indirect | ||
| github.com/ugorji/go/codec v1.2.11 // indirect | github.com/ugorji/go/codec v1.2.11 // indirect | ||
| golang.org/x/arch v0.3.0 // indirect | golang.org/x/arch v0.3.0 // indirect | ||
| golang.org/x/crypto v0.10.0 // indirect | golang.org/x/crypto v0.10.0 | ||
| golang.org/x/net v0.10.0 // indirect | golang.org/x/net v0.10.0 // indirect | ||
| golang.org/x/sys v0.10.0 // indirect | golang.org/x/sys v0.11.0 // indirect | ||
| golang.org/x/term v0.10.0 | golang.org/x/term v0.10.0 | ||
| golang.org/x/text v0.10.0 // indirect | golang.org/x/text v0.10.0 // indirect | ||
| gonum.org/v1/gonum v0.13.0 | gonum.org/v1/gonum v0.13.0 | ||
| ... | ... |
llm/ggml-alloc.c
deleted
100644 → 0
This diff is collapsed.
llm/ggml-alloc.h
deleted
100644 → 0
llm/ggml-cuda.cu
deleted
100644 → 0
This diff is collapsed.
llm/ggml-cuda.h
deleted
100644 → 0
llm/ggml-metal.h
deleted
100644 → 0
llm/ggml-metal.m
deleted
100644 → 0
This diff is collapsed.
llm/ggml-metal.metal
deleted
100644 → 0
This diff is collapsed.
llm/ggml-mpi.c
deleted
100644 → 0
llm/ggml-mpi.h
deleted
100644 → 0
llm/ggml-opencl.cpp
deleted
100644 → 0
This diff is collapsed.
llm/ggml-opencl.h
deleted
100644 → 0
llm/ggml.c
deleted
100644 → 0
This diff is collapsed.
llm/ggml.h
deleted
100644 → 0
This diff is collapsed.
Please register or sign in to comment