Deploy_VLLM_example.md 3.19 KB
Newer Older
Anish's avatar
Anish committed
1
2
3
4
5
# Steps to deploy vLLM example

## 1. Deploy Dynamo Graph

```
6
cd dynamo/examples/backends/vllm/deploy
Anish's avatar
Anish committed
7
8
9
10
11
12
13
vim agg_router.yaml    #under metadata add namespace: dynamo-cloud and change image to your built base image
kubectl apply -f agg_router.yaml
```

Your pods should be running like below

```
14
ubuntu@ip-192-168-83-157:~/dynamo/examples/backends/vllm/deploy$ kubectl get pods -A
Anish's avatar
Anish committed
15
16
17
18
19
20
21
22
23
24
25
26
27
NAMESPACE      NAME                                                              READY   STATUS    RESTARTS   AGE
dynamo-cloud   dynamo-platform-dynamo-operator-controller-manager-86795c5f4j4k   2/2     Running   0          4h17m
dynamo-cloud   dynamo-platform-etcd-0                                            1/1     Running   0          4h17m
dynamo-cloud   dynamo-platform-nats-0                                            2/2     Running   0          4h17m
dynamo-cloud   dynamo-platform-nats-box-5dbf45c748-bxqj7                         1/1     Running   0          4h17m
dynamo-cloud   vllm-agg-router-frontend-79d599bb9c-fg97p                         1/1     Running   0          4m9s
dynamo-cloud   vllm-agg-router-vllmdecodeworker-787d575485-hrcjp                 1/1     Running   0          4m9s
dynamo-cloud   vllm-agg-router-vllmdecodeworker-787d575485-zkwdd                 1/1     Running   0          4m9s
```

Test the Deployment

```
28
29
kubectl port-forward deployment/vllm-agg-router-frontend 8000:8000 -n dynamo-cloud
curl localhost:8000/v1/chat/completions \
Anish's avatar
Anish committed
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
  -H "Content-Type: application/json" \
  -d '{
    "model": "Qwen/Qwen3-0.6B",
    "messages": [
    {
        "role": "user",
        "content": "In the heart of Eldoria, an ancient land of boundless magic and mysterious creatures, lies the long-forgotten city of Aeloria. Once a beacon of knowledge and power, Aeloria was buried beneath the shifting sands of time, lost to the world for centuries. You are an intrepid explorer, known for your unparalleled curiosity and courage, who has stumbled upon an ancient map hinting at ests that Aeloria holds a secret so profound that it has the potential to reshape the very fabric of reality. Your journey will take you through treacherous deserts, enchanted forests, and across perilous mountain ranges. Your Task: Character Background: Develop a detailed background for your character. Describe their motivations for seeking out Aeloria, their skills and weaknesses, and any personal connections to the ancient city or its legends. Are they driven by a quest for knowledge, a search for lost familt clue is hidden."
    }
    ],
    "stream": false,
    "max_tokens": 30
  }'
```

You should output something similar to below

```
{"id":"chatcmpl-bbe52b36-90ed-4479-9872-89e1aa412aa7","choices":[{"index":0,"message":{"content":"<think>\nOkay, so the user wants me to develop a character background for an explorer named someone in Eldoria. The character is part of the","refusal":null,"tool_calls":null,"role":"assistant","function_call":null,"audio":null},"finish_reason":"stop","logprobs":null}],"created":1753417848,"model":"Qwen/Qwen3-0.6B","service_tier":null,"system_fingerprint":null,"object":"chat.completion","usage":{"prompt_tokens":196,"completion_tokens":29,"total_tokens":225,"prompt_tokens_details":null,"completion_tokens_details":null}}
```