"vllm/vscode:/vscode.git/clone" did not exist on "c051bfe4eb77b82eba90504360bbd4e61d9e489a"
llmaz.md 376 Bytes
Newer Older
1
2
3
4
---
title: llmaz
---
[](){ #deployment-llmaz }
5
6
7
8

[llmaz](https://github.com/InftyAI/llmaz) is an easy-to-use and advanced inference platform for large language models on Kubernetes, aimed for production use. It uses vLLM as the default model serving backend.

Please refer to the [Quick Start](https://github.com/InftyAI/llmaz?tab=readme-ov-file#quick-start) for more details.