kaito.md 395 Bytes
Newer Older
raojy's avatar
raojy committed
1
2
3
4
5
# KAITO

[KAITO](https://kaito-project.github.io/kaito/docs/) is a Kubernetes operator that supports deploying and serving LLMs with vLLM. It offers managing large models via container images with built-in OpenAI-compatible inference, auto-provisioning GPU nodes and curated model presets.

Please refer to [quick start](https://kaito-project.github.io/kaito/docs/quick-start) for more details.