@@ -55,11 +55,9 @@ cd $DYNAMO_HOME/components/backends/vllm
cd$DYNAMO_HOME/components/backends/vllm
./launch/disagg_kvbm_2p2d.sh
```
> [!NOTE]
> To tune the size of CPU or disk cache, set `DYN_KVBM_CPU_CACHE_GB` and `DYN_KVBM_DISK_CACHE_GB` accordingly. We only set `DYN_KVBM_CPU_CACHE_GB=20` in both scripts above.
> [!NOTE]
> Configure KVBM cache tiers (choose one of the following options):
> Configure or tune KVBM cache tiers (choose one of the following options):
> ```bash
> # Option 1: CPU cache only (GPU -> CPU offloading)
> # 4 means 4GB of pinned CPU memory would be used
...
...
@@ -86,7 +84,7 @@ cd $DYNAMO_HOME/components/backends/vllm
### Sample Request
```bash
# make a request to verify vLLM with KVBM is started up correctly
# Make a request to verify vLLM with KVBM is started up correctly
# NOTE: change the model name if served with a different one