Unverified Commit ba99d24a authored by Yan Ru Pei's avatar Yan Ru Pei Committed by GitHub
Browse files

chore: disagg + router k8s recipe minor changes (#4050)


Signed-off-by: default avatarPeaBrane <yanrpei@gmail.com>
parent e3ba40e8
......@@ -34,6 +34,7 @@ spec:
args:
- --model
- Qwen/Qwen3-0.6B
- --is-decode-worker
VllmPrefillWorker:
dynamoNamespace: vllm-disagg
envFromSecret: hf-token-secret
......
......@@ -36,11 +36,12 @@ spec:
args:
- --model
- Qwen/Qwen3-0.6B
- --is-decode-worker
VllmPrefillWorker:
dynamoNamespace: vllm-v1-disagg-router
envFromSecret: hf-token-secret
componentType: worker
replicas: 1
replicas: 2
resources:
limits:
gpu: "1"
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment