Commit 944bd13d authored by Jeffrey Morgan's avatar Jeffrey Morgan
Browse files

Go

parent 0998d4f0
...@@ -11,7 +11,7 @@ Run large language models with `llama.cpp`. ...@@ -11,7 +11,7 @@ Run large language models with `llama.cpp`.
- Download and run popular large language models - Download and run popular large language models
- Switch between multiple models on the fly - Switch between multiple models on the fly
- Hardware acceleration where available (Metal, CUDA) - Hardware acceleration where available (Metal, CUDA)
- Fast inference server written in C++, powered by [llama.cpp](https://github.com/ggerganov/llama.cpp) - Fast inference server written in Go, powered by [llama.cpp](https://github.com/ggerganov/llama.cpp)
- REST API to use with your application (python, typescript SDKs coming soon) - REST API to use with your application (python, typescript SDKs coming soon)
## Install ## Install
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment