quick_start_local.rst 1.3 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
Get started with Dynamo locally in just a few commands:

**1. Install Dynamo**

.. code-block:: bash

   # Install uv (recommended Python package manager)
   curl -LsSf https://astral.sh/uv/install.sh | sh

   # Create virtual environment and install Dynamo
   uv venv venv
   source venv/bin/activate
13
14
   # Use prerelease flag to install RC versions of flashinfer and/or other dependencies
   uv pip install --prerelease=allow "ai-dynamo[sglang]"  # or [vllm], [trtllm]
15
16
17
18
19
20

**2. Start etcd/NATS**

.. code-block:: bash

   # Fetch and start etcd and NATS using Docker Compose
21
22
   VERSION=$(uv pip show ai-dynamo | grep Version | cut -d' ' -f2)
   curl -fsSL -o docker-compose.yml https://raw.githubusercontent.com/ai-dynamo/dynamo/refs/tags/v${VERSION}/deploy/docker-compose.yml
23
24
25
26
27
28
   docker compose -f docker-compose.yml up -d

**3. Run Dynamo**

.. code-block:: bash

29
   # Start the OpenAI compatible frontend (default port is 8000)
30
31
32
33
34
35
36
37
38
   python -m dynamo.frontend

   # In another terminal, start an SGLang worker
   python -m dynamo.sglang --model-path Qwen/Qwen3-0.6B

**4. Test your deployment**

.. code-block:: bash

39
   curl localhost:8000/v1/chat/completions \
40
41
42
43
44
45
     -H "Content-Type: application/json" \
     -d '{"model": "Qwen/Qwen3-0.6B",
          "messages": [{"role": "user", "content": "Hello!"}],
          "max_tokens": 50}'