located [in the repository's top-level .pre-commit-config.yaml](https://github.com/triton-inference-server/triton-distributed/blob/main/.pre-commit-config.yaml).
located [in the repository's top-level .pre-commit-config.yaml](https://github.com/ai-dynamo/dynamo/blob/main/.pre-commit-config.yaml).
The hooks do some sanity checking like linting and formatting.
The hooks do some sanity checking like linting and formatting.
These checks must pass to merge a change.
These checks must pass to merge a change.
...
@@ -123,7 +123,7 @@ Also you can use vscode extension [GitHub Local Actions](https://marketplace.vis
...
@@ -123,7 +123,7 @@ Also you can use vscode extension [GitHub Local Actions](https://marketplace.vis
# Developer Certificate of Origin
# Developer Certificate of Origin
Triton Distributed is an open source product released under
Dynamo is an open source product released under
the Apache 2.0 license (see either
the Apache 2.0 license (see either
[the Apache site](https://www.apache.org/licenses/LICENSE-2.0) or
[the Apache site](https://www.apache.org/licenses/LICENSE-2.0) or
the [LICENSE file](./LICENSE)). The Apache 2.0 license allows you
the [LICENSE file](./LICENSE)). The Apache 2.0 license allows you
...
@@ -177,7 +177,7 @@ By making a contribution to this project, I certify that:
...
@@ -177,7 +177,7 @@ By making a contribution to this project, I certify that:
this project or the open source license(s) involved.
this project or the open source license(s) involved.
```
```
We require that every contribution to Triton Distributed is signed with
We require that every contribution to Dynamo is signed with
a Developer Certificate of Origin. Additionally, please use your real name.
a Developer Certificate of Origin. Additionally, please use your real name.
We do not accept anonymous contributors nor those utilizing pseudonyms.
We do not accept anonymous contributors nor those utilizing pseudonyms.
@@ -15,9 +15,9 @@ See the License for the specific language governing permissions and
...
@@ -15,9 +15,9 @@ See the License for the specific language governing permissions and
limitations under the License.
limitations under the License.
-->
-->
# TensorRT-LLM Integration with Triton Distributed
# TensorRT-LLM Integration with Dynamo
This example demonstrates how to use Triton Distributed to serve large language models with the tensorrt_llm engine, enabling efficient model serving with both monolithic and disaggregated deployment options.
This example demonstrates how to use Dynamo to serve large language models with the tensorrt_llm engine, enabling efficient model serving with both monolithic and disaggregated deployment options.
@@ -15,9 +15,9 @@ See the License for the specific language governing permissions and
...
@@ -15,9 +15,9 @@ See the License for the specific language governing permissions and
limitations under the License.
limitations under the License.
-->
-->
# vLLM Integration with Triton Distributed
# vLLM Integration with Dynamo
This example demonstrates how to use Triton Distributed to serve large language models with the vLLM engine, enabling efficient model serving with both monolithic and disaggregated deployment options.
This example demonstrates how to use Dynamo to serve large language models with the vLLM engine, enabling efficient model serving with both monolithic and disaggregated deployment options.
## Prerequisites
## Prerequisites
...
@@ -38,7 +38,7 @@ Start required services (etcd and NATS):
...
@@ -38,7 +38,7 @@ Start required services (etcd and NATS):
## Building the Environment
## Building the Environment
The example is designed to run in a containerized environment using Triton Distributed, vLLM, and associated dependencies. To build the container:
The example is designed to run in a containerized environment using Dynamo, vLLM, and associated dependencies. To build the container: