@@ -47,7 +47,7 @@ kubectl version --client # Should show v1.24+
...
@@ -47,7 +47,7 @@ kubectl version --client # Should show v1.24+
helm version # Should show v3.0+
helm version # Should show v3.0+
```
```
For detailed installation instructions, see the [Prerequisites section](./installation_guide.md#prerequisites) in the Installation Guide.
For detailed installation instructions, see the [Prerequisites section](/docs/kubernetes/installation_guide.md#prerequisites) in the Installation Guide.
## Pre-deployment Checks
## Pre-deployment Checks
...
@@ -57,7 +57,7 @@ Before deploying the platform, run the pre-deployment checks to ensure the clust
...
@@ -57,7 +57,7 @@ Before deploying the platform, run the pre-deployment checks to ensure the clust
./deploy/pre-deployment/pre-deployment-check.sh
./deploy/pre-deployment/pre-deployment-check.sh
```
```
This validates kubectl connectivity, StorageClass configuration, and GPU availability. See [pre-deployment checks](../../deploy/pre-deployment/README.md) for more details.
This validates kubectl connectivity, StorageClass configuration, and GPU availability. See [pre-deployment checks](/deploy/pre-deployment/README.md) for more details.
## 1. Install Platform First
## 1. Install Platform First
...
@@ -82,7 +82,7 @@ If your cluster has namespace-restricted Dynamo operators, add this flag to step
...
@@ -82,7 +82,7 @@ If your cluster has namespace-restricted Dynamo operators, add this flag to step
For more details or customization options (including multinode deployments), see **[Installation Guide for Dynamo Kubernetes Platform](./installation_guide.md)**.
For more details or customization options (including multinode deployments), see **[Installation Guide for Dynamo Kubernetes Platform](/docs/kubernetes/installation_guide.md)**.
## 2. Choose Your Backend
## 2. Choose Your Backend
...
@@ -90,9 +90,9 @@ Each backend has deployment examples and configuration options:
...
@@ -90,9 +90,9 @@ Each backend has deployment examples and configuration options:
For SLA-based autoscaling, see [SLA Planner Quick Start Guide](../planner/sla_planner_quickstart.md).
For SLA-based autoscaling, see [SLA Planner Quick Start Guide](/docs/planner/sla_planner_quickstart.md).
## Understanding Dynamo's Custom Resources
## Understanding Dynamo's Custom Resources
...
@@ -147,15 +147,15 @@ A lower-level interface that defines your complete inference pipeline:
...
@@ -147,15 +147,15 @@ A lower-level interface that defines your complete inference pipeline:
Use this when you need fine-grained control or have already completed profiling.
Use this when you need fine-grained control or have already completed profiling.
Refer to the [API Reference and Documentation](./api_reference.md) for more details.
Refer to the [API Reference and Documentation](/docs/kubernetes/api_reference.md) for more details.
## 📖 API Reference & Documentation
## 📖 API Reference & Documentation
For detailed technical specifications of Dynamo's Kubernetes resources:
For detailed technical specifications of Dynamo's Kubernetes resources:
-**[API Reference](./api_reference.md)** - Complete CRD field specifications for all Dynamo resources
-**[API Reference](/docs/kubernetes/api_reference.md)** - Complete CRD field specifications for all Dynamo resources
-**[Create Deployment](./deployment/create_deployment.md)** - Step-by-step deployment creation with DynamoGraphDeployment
-**[Create Deployment](/docs/kubernetes/deployment/create_deployment.md)** - Step-by-step deployment creation with DynamoGraphDeployment
-**[Operator Guide](./dynamo_operator.md)** - Dynamo operator configuration and management
-**[Operator Guide](/docs/kubernetes/dynamo_operator.md)** - Dynamo operator configuration and management
### Choosing Your Architecture Pattern
### Choosing Your Architecture Pattern
...
@@ -170,7 +170,7 @@ When creating a deployment, select the architecture pattern that best fits your
...
@@ -170,7 +170,7 @@ When creating a deployment, select the architecture pattern that best fits your
You can run the Frontend on one machine (e.g., a CPU node) and workers on different machines (GPU nodes). The Frontend serves as a framework-agnostic HTTP entry point that:
You can run the Frontend on one machine (e.g., a CPU node) and workers on different machines (GPU nodes). The Frontend serves as a framework-agnostic HTTP entry point that:
@@ -26,9 +26,9 @@ This directory contains practical examples demonstrating how to deploy and use D
...
@@ -26,9 +26,9 @@ This directory contains practical examples demonstrating how to deploy and use D
Learn fundamental Dynamo concepts through these introductory examples:
Learn fundamental Dynamo concepts through these introductory examples:
-**[Quickstart](basics/quickstart/README.md)** - Simple aggregated serving example with vLLM backend
-**[Quickstart](/examples/basics/quickstart/README.md)** - Simple aggregated serving example with vLLM backend
-**[Disaggregated Serving](basics/disaggregated_serving/README.md)** - Prefill/decode separation for enhanced performance and scalability
-**[Disaggregated Serving](/examples/basics/disaggregated_serving/README.md)** - Prefill/decode separation for enhanced performance and scalability
-**[Multi-node](basics/multinode/README.md)** - Distributed inference across multiple nodes and GPUs
-**[Multi-node](/examples/basics/multinode/README.md)** - Distributed inference across multiple nodes and GPUs
## Framework Support
## Framework Support
...
@@ -53,11 +53,11 @@ Platform-specific deployment guides for production environments:
...
@@ -53,11 +53,11 @@ Platform-specific deployment guides for production environments:
Low-level runtime examples for developers using Python<>Rust bindings:
Low-level runtime examples for developers using Python<>Rust bindings:
-**[Hello World](custom_backend/hello_world/README.md)** - Minimal Dynamo runtime service demonstrating basic concepts
-**[Hello World](/examples/custom_backend/hello_world/README.md)** - Minimal Dynamo runtime service demonstrating basic concepts
## Getting Started
## Getting Started
1.**Choose your deployment pattern**: Start with the [Quickstart](basics/quickstart/README.md) for a simple local deployment, or explore [Disaggregated Serving](basics/disaggregated_serving/README.md) for advanced architectures.
1.**Choose your deployment pattern**: Start with the [Quickstart](/examples/basics/quickstart/README.md) for a simple local deployment, or explore [Disaggregated Serving](/examples/basics/disaggregated_serving/README.md) for advanced architectures.
2.**Set up prerequisites**: Most examples require etcd and NATS services. You can start them using:
2.**Set up prerequisites**: Most examples require etcd and NATS services. You can start them using:
```bash
```bash
...
@@ -83,4 +83,4 @@ If you're running Kubernetes/cloud deployment examples (EKS, AKS, GKE), you'll a
...
@@ -83,4 +83,4 @@ If you're running Kubernetes/cloud deployment examples (EKS, AKS, GKE), you'll a
See the [Kubernetes Installation Guide](../docs/kubernetes/installation_guide.md#prerequisites) for detailed setup instructions and pre-deployment checks.
See the [Kubernetes Installation Guide](/docs/kubernetes/installation_guide.md#prerequisites) for detailed setup instructions and pre-deployment checks.
Note that this a very simple degenerate example which does not demonstrate the standard Dynamo FrontEnd-Backend deployment. The hello-world client is not a web server, it is a one-off function which sends the predefined text "world,sun,moon,star" to the backend. The example is meant to show the HelloWorldWorker. As such you will only see the HelloWorldWorker pod in deployment. The client will run and exit and the pod will not be operational.
Note that this a very simple degenerate example which does not demonstrate the standard Dynamo FrontEnd-Backend deployment. The hello-world client is not a web server, it is a one-off function which sends the predefined text "world,sun,moon,star" to the backend. The example is meant to show the HelloWorldWorker. As such you will only see the HelloWorldWorker pod in deployment. The client will run and exit and the pod will not be operational.
Follow the [Quickstart Guide](../../../docs/kubernetes/README.md) to install Dynamo Kubernetes Platform.
Follow the [Quickstart Guide](/docs/kubernetes/README.md) to install Dynamo Kubernetes Platform.