| `replicas` | int | Number of replicas to run | No | `1` |
**API Version:**`nvidia.com/v1alpha1`
**Scope:** Namespaced
#### Example
```yaml
apiVersion:nvidia.com/v1alpha1
kind:DynamoComponentDeployment
metadata:
name:test-41fa991-vllmworker
spec:
dynamoNamespace:dynamo
envs:
-name:DYN_DEPLOYMENT_CONFIG
value:'<longJSONconfig>'
externalServices:
PrefillWorker:
deploymentSelectorKey:dynamo
deploymentSelectorValue:PrefillWorker/dynamo
resources:
limits:
cpu:"10"
gpu:"1"
memory:20Gi
requests:
cpu:"500m"
gpu:"1"
memory:20Gi
serviceName:Frontend
```
## Installation
...
...
@@ -130,7 +75,7 @@ spec:
## GitOps Deployment with FluxCD
This section describes how to use FluxCD for GitOps-based deployment of Dynamo inference graphs. GitOps enables you to manage your Dynamo deployments declaratively using Git as the source of truth. We'll use the [aggregated vLLM example](https://github.com/ai-dynamo/dynamo/blob/main/examples/llm/README.md) to demonstrate the workflow.
This section describes how to use FluxCD for GitOps-based deployment of Dynamo inference graphs. GitOps enables you to manage your Dynamo deployments declaratively using Git as the source of truth. We'll use the [aggregated vLLM example](../../../components/backends/vllm/README.md) to demonstrate the workflow.
### Prerequisites
...
...
@@ -142,30 +87,13 @@ This section describes how to use FluxCD for GitOps-based deployment of Dynamo i
The GitOps workflow for Dynamo deployments consists of three main steps:
1. Build and push a pipeline to the Dynamo API store
1. Build and push the Dynamo Operator
2. Create and commit a DynamoGraphDeployment custom resource for initial deployment
3. Update the pipeline by building a new version and updating the CR for subsequent updates
3. Update the graph by building a new version and updating the CR for subsequent updates
### Step 1: Build and Push Pipeline
First, build and push your pipeline using the Dynamo CLI:
```bash
# Set your project root directory
export PROJECT_ROOT=$(pwd)
# Configure environment variables
export KUBE_NS=dynamo-cloud
export DYNAMO_CLOUD=http://localhost:8080 # If using port-forward
# OR
# export DYNAMO_CLOUD=https://dynamo-cloud.nvidia.com # If using Ingress/VirtualService