"lib/vscode:/vscode.git/clone" did not exist on "2d906fb4c9b4c237062c87c728032c06762e4386"
dynamo_cloud.md 7.15 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
<!--
SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
SPDX-License-Identifier: Apache-2.0

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->

# 🚀 Dynamo Cloud Kubernetes Platform (Dynamo Deploy)

The Dynamo Cloud platform is a comprehensive solution for deploying and managing Dynamo inference graphs (also referred to as pipelines) in Kubernetes environments. It provides a streamlined experience for deploying, scaling, and monitoring your inference services. You can interface with Dynamo Cloud using the `deploy` subcommand available in the Dynamo CLI (e.g `dynamo deploy`)

22
## 📋 Overview
23
24
25
26
27
28
29
30
31
32
33

The Dynamo cloud platform consists of several key components:

- **Dynamo Operator**: A Kubernetes operator that manages the lifecycle of Dynamo inference graphs from build ➡️ deploy.
- **API Store**: Stores and manages service configurations and metadata related to Dynamo deployments. Needs to be exposed externally.
- **Custom Resources**: Kubernetes custom resources for defining and managing Dynamo services

These components work together to provide a seamless deployment experience, handling everything from containerization to scaling and monitoring.

![Dynamo Deploy](../../images/dynamo-deploy.png)

34
## 🎯 Prerequisites
35
36
37
38
39
40
41
42
43
44

Before getting started with the Dynamo cloud platform, ensure you have:

- A Kubernetes cluster (version 1.24 or later)
- [Earthly](https://earthly.dev/) installed for building components
- Docker installed and running
- Access to a container registry (e.g., Docker Hub, NVIDIA NGC, etc.)
- `kubectl` configured to access your cluster
- Helm installed (version 3.0 or later)

45
46
47
48
> [!TIP]
> Don't have a Kubernetes cluster? Check out our [Minikube setup guide](./minikube.md) to set up a local environment! 🏠

## 🏗️ Building Docker Images for Dynamo Cloud Components
49
50
51

The Dynamo cloud platform components need to be built and pushed to a container registry before deployment. You can build these components individually or all at once.

52
### ⚙️ Setting Up Environment Variables
53
54
55
56

First, set the required environment variables for building and pushing images:

```bash
57
58
# Set your container registry
export DOCKER_SERVER=<CONTAINER_REGISTRY>
59
# Set the image tag (e.g., latest, 0.0.1, etc.)
60
export IMAGE_TAG=<TAG>
61
62
63
```

Where:
64
65
- `<CONTAINER_REGISTRY>`: Your container registry (e.g., `nvcr.io`, `docker.io/<your-username>`, etc.)
- `<TAG>`: The version tag for your images (e.g., `latest`, `0.0.1`, `v1.0.0`)
66
67
68
69
70
71
72

> [!IMPORTANT]
> Make sure you're logged in to your container registry before pushing images:
> ```bash
> docker login <CONTAINER_REGISTRY>
> ```

73
### 🛠️ Building Components
74

75
You can build and push all platform components at once:
76
77

```bash
78
earthly --push +all-docker --DOCKER_SERVER=$DOCKER_SERVER --IMAGE_TAG=$IMAGE_TAG
79
80
```

81
## 🚀 Deploying the Dynamo Cloud Platform
82
83
84

Once you've built and pushed the components, you can deploy the platform to your Kubernetes cluster.

85
### 📋 Prerequisites
86

87
88
Before deploying Dynamo Cloud, ensure your Kubernetes cluster meets the following requirements:

89
#### 1. 💾 PVC Support with Default Storage Class
90
Dynamo Cloud requires Persistent Volume Claim (PVC) support with a default storage class. Verify your cluster configuration:
91
92

```bash
93
94
# Check if default storage class exists
kubectl get storageclass
95

96
97
98
99
100
# Expected output should show at least one storage class marked as (default)
# Example:
# NAME                 PROVISIONER             RECLAIMPOLICY   VOLUMEBINDINGMODE      ALLOWVOLUMEEXPANSION   AGE
# standard (default)   kubernetes.io/gce-pd    Delete          Immediate              true                   1d
```
101

102
### 📥 Installation
103
104
105
106
107
108
109
110

1. Set the required environment variables:
```bash
export DOCKER_USERNAME=<your-docker-username>
export DOCKER_PASSWORD=<your-docker-password>
export DOCKER_SERVER=<your-docker-server>
export IMAGE_TAG=<TAG>  # Use the same tag you used when building the images
export NAMESPACE=dynamo-cloud    # change this to whatever you want!
111
112
```

113
114
115
116
117
118
119
120
121
122
123
124
125
126
> [!NOTE]
> DOCKER_USERNAME and DOCKER_PASSWORD are optional and only needed if you want to pull docker images from a private registry.
> A docker image pull secret will be created automatically if these variables are set. Its name will be `docker-imagepullsecret` unless overridden by the `DOCKER_SECRET_NAME` environment variable.

The Dynamo Cloud Platform auto-generates docker images for pipelines and pushes them to a container registry.
By default, the platform will use the same container registry as the platform components (specified by `DOCKER_SERVER`).
However, you can specify a different container registry for pipelines by additionally setting the following environment variables:

```bash
export PIPELINES_DOCKER_SERVER=<your-docker-server>
export PIPELINES_DOCKER_USERNAME=<your-docker-username>
export PIPELINES_DOCKER_PASSWORD=<your-docker-password>
```

127
128
129
130
131
132
133
134
135
136
137
138
139
140
If you wish to expose your Dynamo Cloud Platform externally, you can setup the following environment variables:

```bash
# if using ingress
export INGRESS_ENABLED="true"
export INGRESS_CLASS="nginx" # or whatever ingress class you have configured

# if using istio
export ISTIO_ENABLED="true"
export ISTIO_GATEWAY="istio-system/istio-ingressgateway" # or whatever istio gateway you have configured
```

Running the installation script with `--interactive` will guide you through the process of exposing your Dynamo Cloud Platform externally if you don't want to set these environment variables manually.

141
2. [One-time Action] Create a new kubernetes namespace and set it as your default.
142
143

```bash
144
cd deploy/cloud/helm
145
146
kubectl create namespace $NAMESPACE
kubectl config set-context --current --namespace=$NAMESPACE
147
148
```

149
150
3. Deploy the helm chart using the deploy script:

151
152
153
154
```bash
./deploy.sh
```

155
156
157
158
159
160
if you wish to be guided through the deployment process, you can run the deploy script with the `--interactive` flag:

```bash
./deploy.sh --interactive
```

161
4. 🌐 **Expose Dynamo Cloud Externally**
162

163
164
165
166
167
168
169
170
171
> [!NOTE]
> The script will automatically display information about the endpoint you can use to access Dynamo Cloud. In our docs, we refer to this externally available endpoint as `DYNAMO_CLOUD`.

The simplest way to expose the `dynamo-store` service within the namespace externally is to use a port-forward:

```bash
kubectl port-forward svc/dynamo-store <local-port>:80 -n $NAMESPACE
export DYNAMO_CLOUD=http://localhost:<local-port>
```
172

173
## 🎯 Next Steps
174
175
176
177
178
179
180

After deploying the Dynamo cloud platform, you can:

1. Deploy your first inference graph using the [Dynamo CLI](operator_deployment.md)
2. Deploy Dynamo LLM pipelines to Kubernetes using the [Dynamo CLI](../../../examples/llm/README.md)!
3. Manage your deployments using the Dynamo CLI

181
For more detailed information about deploying inference graphs, see the [Dynamo Deploy Guide](README.md).