For Kubernetes deployment, YAML manifests are provided in the `deploy/` directory. These define DynamoGraphDeployment resources for various configurations:
-`agg.yaml` - Aggregated serving
-`agg_router.yaml` - Aggregated serving with KV routing
-`disagg.yaml` - Disaggregated serving
-`disagg_router.yaml` - Disaggregated serving with KV routing
#### Prerequisites
-**Dynamo Cloud**: Follow the [Quickstart Guide](../../../docs/guides/dynamo_deploy/quickstart.md) to deploy Dynamo Cloud first.
-**Container Images**: The deployment files currently require access to `nvcr.io/nvidian/nim-llm-dev/trtllm-runtime`. If you don't have access, build and push your own image:
```bash
./container/build.sh --framework tensorrtllm
# Tag and push to your container registry
# Update the image references in the YAML files
```
-**Port Forwarding**: After deployment, forward the frontend service to access the API: