@@ -300,7 +300,7 @@ The server-side benchmarking solution:
...
@@ -300,7 +300,7 @@ The server-side benchmarking solution:
## Prerequisites
## Prerequisites
1.**Kubernetes cluster** with NVIDIA GPUs and Dynamo namespace setup (see [Dynamo Cloud/Platform docs](../guides/dynamo_deploy/README.md))
1.**Kubernetes cluster** with NVIDIA GPUs and Dynamo namespace setup (see [Dynamo Cloud/Platform docs](/docs/kubernetes/README.md))
2.**Storage and service account** PersistentVolumeClaim and service account configured with appropriate permissions (see [deploy/utils README](../../deploy/utils/README.md))
2.**Storage and service account** PersistentVolumeClaim and service account configured with appropriate permissions (see [deploy/utils README](../../deploy/utils/README.md))
3.**Docker image** containing the Dynamo benchmarking tools
3.**Docker image** containing the Dynamo benchmarking tools
Quick deployment guide for the disaggregated planner with automatic scaling.
Quick deployment guide for the disaggregated planner with automatic scaling.
> [!NOTE]
> [!NOTE]
> For high-level architecture and concepts, see [SLA-based Planner](../../architecture/sla_planner.md).
> For high-level architecture and concepts, see [SLA-based Planner](/docs/architecture/sla_planner.md).
## Architecture Overview
## Architecture Overview
...
@@ -23,11 +23,11 @@ flowchart LR
...
@@ -23,11 +23,11 @@ flowchart LR
## Prerequisites
## Prerequisites
- Kubernetes cluster with GPU nodes
- Kubernetes cluster with GPU nodes
-[Pre-Deployment Profiling](../../benchmarks/pre_deployment_profiling.md) completed and its results saved to `dynamo-pvc` PVC.
-[Pre-Deployment Profiling](/docs/benchmarks/pre_deployment_profiling.md) completed and its results saved to `dynamo-pvc` PVC.
- Prefill and decode worker uses the best parallelization mapping suggested by the pre-deployment profiling script.
- Prefill and decode worker uses the best parallelization mapping suggested by the pre-deployment profiling script.
> [!NOTE]
> [!NOTE]
> **Important**: The profiling that occurs before Planner deployment requires additional Kubernetes manifests (ServiceAccount, Role, RoleBinding, PVC) that are not included in standard Dynamo deployments. Apply these manifests in the same namespace as `$NAMESPACE`. For a complete setup, start with the [Quick Start guide](../../../deploy/utils/README.md#quick-start), which provides a fully encapsulated deployment including all required manifests.
> **Important**: The profiling that occurs before Planner deployment requires additional Kubernetes manifests (ServiceAccount, Role, RoleBinding, PVC) that are not included in standard Dynamo deployments. Apply these manifests in the same namespace as `$NAMESPACE`. For a complete setup, start with the [Quick Start guide](/deploy/utils/README.md#quick-start), which provides a fully encapsulated deployment including all required manifests.