Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
dynamo
Commits
2c910b2e
Unverified
Commit
2c910b2e
authored
Nov 12, 2025
by
Ziqi Fan
Committed by
GitHub
Nov 12, 2025
Browse files
docs: update KVBM runbooks on metrics due to dynamo metrics change (#4266)
Signed-off-by:
Ziqi Fan
<
ziqif@nvidia.com
>
parent
744cda65
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
6 additions
and
4 deletions
+6
-4
docs/kvbm/trtllm-setup.md
docs/kvbm/trtllm-setup.md
+3
-2
docs/kvbm/vllm-setup.md
docs/kvbm/vllm-setup.md
+3
-2
No files found.
docs/kvbm/trtllm-setup.md
View file @
2c910b2e
...
@@ -135,11 +135,12 @@ export DYN_KVBM_DISK_ZEROFILL_FALLBACK=true
...
@@ -135,11 +135,12 @@ export DYN_KVBM_DISK_ZEROFILL_FALLBACK=true
Follow below steps to enable metrics collection and view via Grafana dashboard:
Follow below steps to enable metrics collection and view via Grafana dashboard:
```
bash
```
bash
# Start the basic services (etcd & natsd), along with Prometheus and Grafana
# Start the basic services (etcd & natsd), along with Prometheus and Grafana
docker compose
-f
deploy/docker-
compose.yml
--profile
metrics
up
-d
docker compose
-f
deploy/docker-
observability.yml
up
-d
# Set env var DYN_KVBM_METRICS to true, when launch via dynamo
# Set env var DYN_KVBM_METRICS to true, when launch via dynamo
# Optionally set DYN_KVBM_METRICS_PORT to choose the /metrics port (default: 6880).
# Optionally set DYN_KVBM_METRICS_PORT to choose the /metrics port (default: 6880).
DYN_KVBM_METRICS
=
true
\
DYN_KVBM_METRICS
=
true
\
DYN_KVBM_CPU_CACHE_GB
=
20
\
python3
-m
dynamo.trtllm
\
python3
-m
dynamo.trtllm
\
--model-path
Qwen/Qwen3-0.6B
\
--model-path
Qwen/Qwen3-0.6B
\
--served-model-name
Qwen/Qwen3-0.6B
\
--served-model-name
Qwen/Qwen3-0.6B
\
...
@@ -149,7 +150,7 @@ python3 -m dynamo.trtllm \
...
@@ -149,7 +150,7 @@ python3 -m dynamo.trtllm \
sudo
ufw allow 6880/tcp
sudo
ufw allow 6880/tcp
```
```
View grafana metrics via http://localhost:300
1
(default login: dynamo/dynamo) and look for KVBM Dashboard
View grafana metrics via http://localhost:300
0
(default login: dynamo/dynamo) and look for KVBM Dashboard
## Benchmark KVBM
## Benchmark KVBM
...
...
docs/kvbm/vllm-setup.md
View file @
2c910b2e
...
@@ -127,12 +127,13 @@ export DYN_KVBM_DISK_ZEROFILL_FALLBACK=true
...
@@ -127,12 +127,13 @@ export DYN_KVBM_DISK_ZEROFILL_FALLBACK=true
Follow below steps to enable metrics collection and view via Grafana dashboard:
Follow below steps to enable metrics collection and view via Grafana dashboard:
```
bash
```
bash
# Start the basic services (etcd & natsd), along with Prometheus and Grafana
# Start the basic services (etcd & natsd), along with Prometheus and Grafana
docker compose
-f
deploy/docker-
compose.yml
--profile
metrics
up
-d
docker compose
-f
deploy/docker-
observability.yml
up
-d
# Set env var DYN_KVBM_METRICS to true, when launch via dynamo
# Set env var DYN_KVBM_METRICS to true, when launch via dynamo
# Optionally set DYN_KVBM_METRICS_PORT to choose the /metrics port (default: 6880).
# Optionally set DYN_KVBM_METRICS_PORT to choose the /metrics port (default: 6880).
# NOTE: update launch/disagg_kvbm.sh or launch/disagg_kvbm_2p2d.sh as needed
# NOTE: update launch/disagg_kvbm.sh or launch/disagg_kvbm_2p2d.sh as needed
DYN_KVBM_METRICS
=
true
\
DYN_KVBM_METRICS
=
true
\
DYN_KVBM_CPU_CACHE_GB
=
20
\
python
-m
dynamo.vllm
\
python
-m
dynamo.vllm
\
--model
Qwen/Qwen3-0.6B
\
--model
Qwen/Qwen3-0.6B
\
--enforce-eager
\
--enforce-eager
\
...
@@ -142,7 +143,7 @@ python -m dynamo.vllm \
...
@@ -142,7 +143,7 @@ python -m dynamo.vllm \
sudo
ufw allow 6880/tcp
sudo
ufw allow 6880/tcp
```
```
View grafana metrics via http://localhost:300
1
(default login: dynamo/dynamo) and look for KVBM Dashboard
View grafana metrics via http://localhost:300
0
(default login: dynamo/dynamo) and look for KVBM Dashboard
## Benchmark KVBM
## Benchmark KVBM
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment