Prometheus:# NOTE: this is set on Prometheus to ensure a service is created for the Prometheus component. This is a workaround and should be managed differently.
Prometheus:# NOTE: this is set on Prometheus to ensure a service is created for the Prometheus component. This is a workaround and should be managed differently.
@@ -138,16 +113,6 @@ Use the default pre-built image and inject custom configurations via PVC:
...
@@ -138,16 +113,6 @@ Use the default pre-built image and inject custom configurations via PVC:
> **Note**: All paths must start with `/data/` for security reasons. If you forget this prefix, the script will show a helpful error message with the correct path.
> **Note**: All paths must start with `/data/` for security reasons. If you forget this prefix, the script will show a helpful error message with the correct path.
3.**Set the config path for the profiling job:**
```bash
export DGD_CONFIG_FILE=/workspace/profiling_results/disagg.yaml # or your custom path
```
This approach allows you to:
- Customize DGD configurations without rebuilding container images
- Test different model configurations easily
- Version control your DGD configs alongside your code
> **Important**: For profiling, disagg configs should be run with Grove disabled by adding the annotation `nvidia.com/enable-grove: "false"` to avoid alpha Grove status issues.
> **Important**: For profiling, disagg configs should be run with Grove disabled by adding the annotation `nvidia.com/enable-grove: "false"` to avoid alpha Grove status issues.
**Step 2: Set SLA target**
**Step 2: Set SLA target**
...
@@ -173,13 +138,25 @@ spec:
...
@@ -173,13 +138,25 @@ spec:
-<vllm/sglang>
-<vllm/sglang>
```
```
**Step 3: Run profiling (required)**
**Step 3: Define the container image and config path**
1.**Set the container image:**
```bash
export DOCKER_IMAGE=nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.4.1 # or any existing image tag (TODO: update to 0.5.0 upon release as profiling with 0.4.1 is broken)
```
3.**Set the config path for the profiling job:**
```bash
export DGD_CONFIG_FILE=/data/configs/disagg.yaml # should be the same path you set for --dest in Step 1