Unverified Commit c1907d12 authored by Julien Mancuso's avatar Julien Mancuso Committed by GitHub
Browse files

fix: fix broken links (#3186)

parent 4a718028
...@@ -300,7 +300,7 @@ The server-side benchmarking solution: ...@@ -300,7 +300,7 @@ The server-side benchmarking solution:
## Prerequisites ## Prerequisites
1. **Kubernetes cluster** with NVIDIA GPUs and Dynamo namespace setup (see [Dynamo Cloud/Platform docs](../guides/dynamo_deploy/README.md)) 1. **Kubernetes cluster** with NVIDIA GPUs and Dynamo namespace setup (see [Dynamo Cloud/Platform docs](/docs/kubernetes/README.md))
2. **Storage and service account** PersistentVolumeClaim and service account configured with appropriate permissions (see [deploy/utils README](../../deploy/utils/README.md)) 2. **Storage and service account** PersistentVolumeClaim and service account configured with appropriate permissions (see [deploy/utils README](../../deploy/utils/README.md))
3. **Docker image** containing the Dynamo benchmarking tools 3. **Docker image** containing the Dynamo benchmarking tools
......
...@@ -3,7 +3,7 @@ ...@@ -3,7 +3,7 @@
Quick deployment guide for the disaggregated planner with automatic scaling. Quick deployment guide for the disaggregated planner with automatic scaling.
> [!NOTE] > [!NOTE]
> For high-level architecture and concepts, see [SLA-based Planner](../../architecture/sla_planner.md). > For high-level architecture and concepts, see [SLA-based Planner](/docs/architecture/sla_planner.md).
## Architecture Overview ## Architecture Overview
...@@ -23,11 +23,11 @@ flowchart LR ...@@ -23,11 +23,11 @@ flowchart LR
## Prerequisites ## Prerequisites
- Kubernetes cluster with GPU nodes - Kubernetes cluster with GPU nodes
- [Pre-Deployment Profiling](../../benchmarks/pre_deployment_profiling.md) completed and its results saved to `dynamo-pvc` PVC. - [Pre-Deployment Profiling](/docs/benchmarks/pre_deployment_profiling.md) completed and its results saved to `dynamo-pvc` PVC.
- Prefill and decode worker uses the best parallelization mapping suggested by the pre-deployment profiling script. - Prefill and decode worker uses the best parallelization mapping suggested by the pre-deployment profiling script.
> [!NOTE] > [!NOTE]
> **Important**: The profiling that occurs before Planner deployment requires additional Kubernetes manifests (ServiceAccount, Role, RoleBinding, PVC) that are not included in standard Dynamo deployments. Apply these manifests in the same namespace as `$NAMESPACE`. For a complete setup, start with the [Quick Start guide](../../../deploy/utils/README.md#quick-start), which provides a fully encapsulated deployment including all required manifests. > **Important**: The profiling that occurs before Planner deployment requires additional Kubernetes manifests (ServiceAccount, Role, RoleBinding, PVC) that are not included in standard Dynamo deployments. Apply these manifests in the same namespace as `$NAMESPACE`. For a complete setup, start with the [Quick Start guide](/deploy/utils/README.md#quick-start), which provides a fully encapsulated deployment including all required manifests.
```bash ```bash
export NAMESPACE=your-namespace export NAMESPACE=your-namespace
``` ```
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment