SPDX-FileCopyrightText: Copyright (c) 2024-2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
SPDX-License-Identifier: Apache-2.0
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->
# triton-distributed
## Overview
This repository contains the core components of a distributed GenAI inference framework written in Rust.
Features:
- A High level API defined in [component.rs](src/component.rs) to build distributed applications.
- A [Distributed runtime](src/distributed.rs) to manage the distributed execution of the inference graph.
- Uses [NATS](src/transports/nats.rs) for component communication and [etcd](src/transports/etcd.rs) for service discovery, allowing engines to be distributed across multiple nodes while maintaining a unified processing graph.
- Modular design makes it easy to build inference pipelines by composing reusable components.
## Install Rust:
```bash
curl --proto'=https'--tlsv1.2 -sSf https://sh.rustup.rs | sh