@@ -110,6 +110,19 @@ For more information about validating the deployment, see the [vLLM README](../.
## Set Up Metrics Collection
### Enable NIXL Telemetry (Optional)
To enable NIXL telemetry metrics in addition to Dynamo metrics, set the following environment variables in your worker component:
spec:
services:
YourWorker:
envs:
- name: NIXL_TELEMETRY_ENABLE
value: "y"
NIXL telemetry is disabled by default. When enabled, NIXL metrics will be exposed on the port specified by `NIXL_TELEMETRY_PROMETHEUS_PORT` (19090 by default).
### Create PodMonitors
The Prometheus Operator uses PodMonitor resources to automatically discover and scrape metrics from pods. To enable this discovery, the Dynamo operator automatically creates PodMonitor resource and adds these labels to all pods: