@@ -37,7 +37,7 @@ Currently, these setups are only supported with the kGateway based Inference Gat
...
@@ -37,7 +37,7 @@ Currently, these setups are only supported with the kGateway based Inference Gat
### 1. Install Dynamo Platform ###
### 1. Install Dynamo Platform ###
[See Quickstart Guide](../../docs/kubernetes/README.md) to install Dynamo Kubernetes Platform.
[See Quickstart Guide](../../docs/pages/kubernetes/README.md) to install Dynamo Kubernetes Platform.
### 2. Deploy Inference Gateway ###
### 2. Deploy Inference Gateway ###
...
@@ -220,7 +220,7 @@ Common Vars for Routing Configuration:
...
@@ -220,7 +220,7 @@ Common Vars for Routing Configuration:
- Set `DYNAMO_OVERLAP_SCORE_WEIGHT` to weigh how heavily the score uses token overlap (predicted KV cache hits) versus other factors (load, historical hit rate). Higher weight biases toward reusing workers with similar cached prefixes.
- Set `DYNAMO_OVERLAP_SCORE_WEIGHT` to weigh how heavily the score uses token overlap (predicted KV cache hits) versus other factors (load, historical hit rate). Higher weight biases toward reusing workers with similar cached prefixes.
- Set `DYNAMO_ROUTER_TEMPERATURE` to soften or sharpen the selection curve when combining scores. Low temperature makes the router pick the top candidate deterministically; higher temperature lets lower-scoring workers through more often (exploration).
- Set `DYNAMO_ROUTER_TEMPERATURE` to soften or sharpen the selection curve when combining scores. Low temperature makes the router pick the top candidate deterministically; higher temperature lets lower-scoring workers through more often (exploration).
- Set `DYNAMO_USE_KV_EVENTS=false` if you want to disable the workers sending KV events while using kv-routing
- Set `DYNAMO_USE_KV_EVENTS=false` if you want to disable the workers sending KV events while using kv-routing
- See the [Router Guide](../../docs/components/router/router_guide.md) for details.
- See the [Router Guide](../../docs/pages/components/router/router-guide.md) for details.
For detailed documentation on Observability (Prometheus metrics, tracing, and logging), please refer to [docs/observability/](../../docs/observability/).
For detailed documentation on Observability (Prometheus metrics, tracing, and logging), please refer to [docs/pages/observability/](../../docs/pages/observability/).
For detailed documentation on collecting and visualizing metrics on Kubernetes, see [docs/kubernetes/observability/metrics.md](../../../docs/kubernetes/observability/metrics.md).
For detailed documentation on collecting and visualizing metrics on Kubernetes, see [docs/pages/kubernetes/observability/metrics.md](../../../docs/pages/kubernetes/observability/metrics.md).
For detailed documentation on collecting and visualizing logs on Kubernetes, see [docs/kubernetes/observability/logging.md](../../../../docs/kubernetes/observability/logging.md).
For detailed documentation on collecting and visualizing logs on Kubernetes, see [docs/pages/kubernetes/observability/logging.md](../../../../docs/pages/kubernetes/observability/logging.md).
@@ -6,7 +6,7 @@ This directory contains utilities and manifests for Dynamo benchmarking and prof
...
@@ -6,7 +6,7 @@ This directory contains utilities and manifests for Dynamo benchmarking and prof
**Before using these utilities, you must first set up Dynamo Kubernetes Platform following the main installation guide:**
**Before using these utilities, you must first set up Dynamo Kubernetes Platform following the main installation guide:**
👉 **[Follow the Dynamo Kubernetes Platform installation guide](/docs/kubernetes/installation_guide.md) to install the Dynamo Kubernetes Platform first.**
👉 **[Follow the Dynamo Kubernetes Platform installation guide](/docs/pages/kubernetes/installation-guide.md) to install the Dynamo Kubernetes Platform first.**
This includes:
This includes:
1. Installing the Dynamo CRDs
1. Installing the Dynamo CRDs
...
@@ -144,8 +144,8 @@ kubectl delete pod pvc-access-pod -n $NAMESPACE
...
@@ -144,8 +144,8 @@ kubectl delete pod pvc-access-pod -n $NAMESPACE
#### Next Steps
#### Next Steps
For complete benchmarking and profiling workflows:
For complete benchmarking and profiling workflows:
-**Benchmarking Guide**: See [docs/benchmarks/benchmarking.md](../../docs/benchmarks/benchmarking.md) for comparing DynamoGraphDeployments and external endpoints
-**Benchmarking Guide**: See [docs/pages/benchmarks/benchmarking.md](../../docs/pages/benchmarks/benchmarking.md) for comparing DynamoGraphDeployments and external endpoints
-**Pre-Deployment Profiling**: See [docs/components/profiler/profiler_guide.md](../../docs/components/profiler/profiler_guide.md) for optimizing configurations before deployment
-**Pre-Deployment Profiling**: See [docs/pages/components/profiler/profiler-guide.md](../../docs/pages/components/profiler/profiler-guide.md) for optimizing configurations before deployment