Unverified Commit 1a6eb099 authored by Richard Huo's avatar Richard Huo Committed by GitHub
Browse files

fix: fix the lmcache args in the agg_lmcache.sh script (#3325)


Signed-off-by: default avatarrichardhuo-nv <rihuo@nvidia.com>
parent 0ab6bc2b
...@@ -12,4 +12,4 @@ ENABLE_LMCACHE=1 \ ...@@ -12,4 +12,4 @@ ENABLE_LMCACHE=1 \
LMCACHE_CHUNK_SIZE=256 \ LMCACHE_CHUNK_SIZE=256 \
LMCACHE_LOCAL_CPU=True \ LMCACHE_LOCAL_CPU=True \
LMCACHE_MAX_LOCAL_CPU_SIZE=20 \ LMCACHE_MAX_LOCAL_CPU_SIZE=20 \
python -m dynamo.vllm --model Qwen/Qwen3-0.6B python -m dynamo.vllm --model Qwen/Qwen3-0.6B --connector lmcache
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment