Unverified Commit 6c870810 authored by hhzhang16's avatar hhzhang16 Committed by GitHub
Browse files

fix: add --is-decode-worker flag (#5843)


Signed-off-by: default avatarHannah Zhang <hannahz@nvidia.com>
parent b88cb59b
......@@ -42,7 +42,7 @@ spec:
- /bin/sh
- -c
args:
- python3 -m dynamo.vllm --model meta-llama/Llama-3.1-70B-Instruct -tp 8
- python3 -m dynamo.vllm --model meta-llama/Llama-3.1-70B-Instruct -tp 8 --is-decode-worker
VllmPrefillWorker:
envFromSecret: hf-token-secret
componentType: worker
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment