dynamo_cloud.md 6.86 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
<!--
SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
SPDX-License-Identifier: Apache-2.0

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->

# 🚀 Dynamo Cloud Kubernetes Platform (Dynamo Deploy)

The Dynamo Cloud platform is a comprehensive solution for deploying and managing Dynamo inference graphs (also referred to as pipelines) in Kubernetes environments. It provides a streamlined experience for deploying, scaling, and monitoring your inference services. You can interface with Dynamo Cloud using the `deploy` subcommand available in the Dynamo CLI (e.g `dynamo deploy`)

22
## 📋 Overview
23
24
25
26
27
28
29
30
31
32
33

The Dynamo cloud platform consists of several key components:

- **Dynamo Operator**: A Kubernetes operator that manages the lifecycle of Dynamo inference graphs from build ➡️ deploy.
- **API Store**: Stores and manages service configurations and metadata related to Dynamo deployments. Needs to be exposed externally.
- **Custom Resources**: Kubernetes custom resources for defining and managing Dynamo services

These components work together to provide a seamless deployment experience, handling everything from containerization to scaling and monitoring.

![Dynamo Deploy](../../images/dynamo-deploy.png)

34
## 🎯 Prerequisites
35
36
37
38
39
40
41
42
43
44

Before getting started with the Dynamo cloud platform, ensure you have:

- A Kubernetes cluster (version 1.24 or later)
- [Earthly](https://earthly.dev/) installed for building components
- Docker installed and running
- Access to a container registry (e.g., Docker Hub, NVIDIA NGC, etc.)
- `kubectl` configured to access your cluster
- Helm installed (version 3.0 or later)

45
46
47
48
> [!TIP]
> Don't have a Kubernetes cluster? Check out our [Minikube setup guide](./minikube.md) to set up a local environment! 🏠

## 🏗️ Building Docker Images for Dynamo Cloud Components
49
50
51

The Dynamo cloud platform components need to be built and pushed to a container registry before deployment. You can build these components individually or all at once.

52
### ⚙️ Setting Up Environment Variables
53
54
55
56

First, set the required environment variables for building and pushing images:

```bash
57
58
# Set your container registry
export DOCKER_SERVER=<CONTAINER_REGISTRY>
59
# Set the image tag (e.g., latest, 0.0.1, etc.)
60
export IMAGE_TAG=<TAG>
61
62
63
```

Where:
64
65
- `<CONTAINER_REGISTRY>`: Your container registry (e.g., `nvcr.io`, `docker.io/<your-username>`, etc.)
- `<TAG>`: The version tag for your images (e.g., `latest`, `0.0.1`, `v1.0.0`)
66
67
68
69
70
71
72

> [!IMPORTANT]
> Make sure you're logged in to your container registry before pushing images:
> ```bash
> docker login <CONTAINER_REGISTRY>
> ```

73
### 🛠️ Building Components
74

75
You can build and push all platform components at once:
76
77

```bash
78
earthly --push +all-docker --DOCKER_SERVER=$DOCKER_SERVER --IMAGE_TAG=$IMAGE_TAG
79
80
```

81
## 🚀 Deploying the Dynamo Cloud Platform
82
83
84

Once you've built and pushed the components, you can deploy the platform to your Kubernetes cluster.

85
### 📋 Prerequisites
86

87
88
Before deploying Dynamo Cloud, ensure your Kubernetes cluster meets the following requirements:

89
#### 1. 🛡️ Istio Installation
90
Dynamo Cloud requires Istio for service mesh capabilities. Verify Istio is installed and running:
91
92

```bash
93
94
# Check if Istio is installed
kubectl get pods -n istio-system
95

96
97
98
# Expected output should show running Istio pods
# istiod-* pods should be in Running state
```
99

100
#### 2. 💾 PVC Support with Default Storage Class
101
Dynamo Cloud requires Persistent Volume Claim (PVC) support with a default storage class. Verify your cluster configuration:
102
103

```bash
104
105
# Check if default storage class exists
kubectl get storageclass
106

107
108
109
110
111
# Expected output should show at least one storage class marked as (default)
# Example:
# NAME                 PROVISIONER             RECLAIMPOLICY   VOLUMEBINDINGMODE      ALLOWVOLUMEEXPANSION   AGE
# standard (default)   kubernetes.io/gce-pd    Delete          Immediate              true                   1d
```
112

113
### 📥 Installation
114
115
116
117
118
119
120
121

1. Set the required environment variables:
```bash
export DOCKER_USERNAME=<your-docker-username>
export DOCKER_PASSWORD=<your-docker-password>
export DOCKER_SERVER=<your-docker-server>
export IMAGE_TAG=<TAG>  # Use the same tag you used when building the images
export NAMESPACE=dynamo-cloud    # change this to whatever you want!
122
123
```

124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
> [!NOTE]
> DOCKER_USERNAME and DOCKER_PASSWORD are optional and only needed if you want to pull docker images from a private registry.
> A docker image pull secret will be created automatically if these variables are set. Its name will be `docker-imagepullsecret` unless overridden by the `DOCKER_SECRET_NAME` environment variable.

The Dynamo Cloud Platform auto-generates docker images for pipelines and pushes them to a container registry.
By default, the platform will use the same container registry as the platform components (specified by `DOCKER_SERVER`).
However, you can specify a different container registry for pipelines by additionally setting the following environment variables:

```bash
export PIPELINES_DOCKER_SERVER=<your-docker-server>
export PIPELINES_DOCKER_USERNAME=<your-docker-username>
export PIPELINES_DOCKER_PASSWORD=<your-docker-password>
```

2. [One-time Action] Create a new kubernetes namespace and set it as your default.
139
140

```bash
141
142
143
cd deploy/dynamo/helm
kubectl create namespace $NAMESPACE
kubectl config set-context --current --namespace=$NAMESPACE
144
145
```

146
147
3. Deploy the helm chart using the deploy script:

148
149
150
151
```bash
./deploy.sh
```

152
153
154
155
156
157
if you wish to be guided through the deployment process, you can run the deploy script with the `--interactive` flag:

```bash
./deploy.sh --interactive
```

158
4. 🌐 **Expose Dynamo Cloud Externally**
159

160
161
162
163
164
165
166
167
168
> [!NOTE]
> The script will automatically display information about the endpoint you can use to access Dynamo Cloud. In our docs, we refer to this externally available endpoint as `DYNAMO_CLOUD`.

The simplest way to expose the `dynamo-store` service within the namespace externally is to use a port-forward:

```bash
kubectl port-forward svc/dynamo-store <local-port>:80 -n $NAMESPACE
export DYNAMO_CLOUD=http://localhost:<local-port>
```
169

170
## 🎯 Next Steps
171
172
173
174
175
176
177

After deploying the Dynamo cloud platform, you can:

1. Deploy your first inference graph using the [Dynamo CLI](operator_deployment.md)
2. Deploy Dynamo LLM pipelines to Kubernetes using the [Dynamo CLI](../../../examples/llm/README.md)!
3. Manage your deployments using the Dynamo CLI

178
For more detailed information about deploying inference graphs, see the [Dynamo Deploy Guide](README.md).