release-artifacts.md 29.3 KB
Newer Older
1
2
3
---
# SPDX-FileCopyrightText: Copyright (c) 2025-2026 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# SPDX-License-Identifier: Apache-2.0
4
title: Release Artifacts
5
6
7
8
---

This document provides a comprehensive inventory of all Dynamo release artifacts including container images, Python wheels, Helm charts, and Rust crates.

9
> **See also:** [Support Matrix](support-matrix.md) for hardware and platform compatibility | [Feature Matrix](feature-matrix.md) for backend feature support
10
11
12

Release history in this document begins at v0.6.0.

13
## Current Release: Dynamo v1.0.1
14

15
16
- **GitHub Release:** [v1.0.1](https://github.com/ai-dynamo/dynamo/releases/tag/v1.0.1)
- **Docs:** [v1.0.1](https://docs.dynamo.nvidia.com/dynamo)
17
18
- **NGC Collection:** [ai-dynamo](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/collections/ai-dynamo)

19
20
> **Experimental:** [v1.1.0-dev.1](#pre-release-artifacts) is available as an experimental preview. See [Pre-Release Artifacts](#pre-release-artifacts) for container images, wheels, and Helm charts.

21
22
23
24
### Container Images

| Image:Tag | Description | Backend | CUDA | Arch | NGC | Notes |
|-----------|-------------|---------|------|------|-----|-------|
25
26
27
28
29
30
31
32
33
34
| `vllm-runtime:1.0.1` | Runtime container for vLLM backend | vLLM `v0.16.0` | `v12.9` | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/vllm-runtime?version=1.0.1) | |
| `vllm-runtime:1.0.1-cuda13` | Runtime container for vLLM backend (CUDA 13) | vLLM `v0.16.0` | `v13.0` | AMD64/ARM64* | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/vllm-runtime?version=1.0.1-cuda13) | |
| `vllm-runtime:1.0.1-efa-amd64` | Runtime container for vLLM with AWS EFA | vLLM `v0.16.0` | `v12.9` | AMD64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/vllm-runtime?version=1.0.1-efa-amd64) | Experimental |
| `sglang-runtime:1.0.1` | Runtime container for SGLang backend | SGLang `v0.5.9` | `v12.9` | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/sglang-runtime?version=1.0.1) | |
| `sglang-runtime:1.0.1-cuda13` | Runtime container for SGLang backend (CUDA 13) | SGLang `v0.5.9` | `v13.0` | AMD64/ARM64* | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/sglang-runtime?version=1.0.1-cuda13) | |
| `tensorrtllm-runtime:1.0.1` | Runtime container for TensorRT-LLM backend | TRT-LLM `v1.3.0rc5.post1` | `v13.1` | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/tensorrtllm-runtime?version=1.0.1) | |
| `tensorrtllm-runtime:1.0.1-efa-amd64` | Runtime container for TensorRT-LLM with AWS EFA | TRT-LLM `v1.3.0rc5.post1` | `v13.1` | AMD64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/tensorrtllm-runtime?version=1.0.1-efa-amd64) | Experimental |
| `dynamo-frontend:1.0.1` | API gateway with Endpoint Prediction Protocol (EPP) | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/dynamo-frontend?version=1.0.1) | |
| `kubernetes-operator:1.0.1` | Kubernetes operator for Dynamo deployments | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/kubernetes-operator?version=1.0.1) | |
| `snapshot-agent:1.0.1` | Snapshot agent for fast GPU worker recovery via CRIU | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/snapshot-agent?version=1.0.1) | Preview |
35
36
37
38
39
40
41
42
43

\* Multimodal inference on CUDA 13 images: works on AMD64 for all backends; works on ARM64 only for TensorRT-LLM (`vllm-runtime:*-cuda13` and `sglang-runtime:*-cuda13` do not support multimodality on ARM64).

### Python Wheels

We recommend using the TensorRT-LLM NGC container instead of the `ai-dynamo[trtllm]` wheel. See the [NGC container collection](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/collections/ai-dynamo) for supported images.

| Package | Description | Python | Platform | PyPI |
|---------|-------------|--------|----------|------|
44
45
46
| `ai-dynamo==1.0.1` | Main package with backend integrations (vLLM, SGLang, TRT-LLM) | `3.10``3.12` | Linux (glibc `v2.28+`) | [link](https://pypi.org/project/ai-dynamo/1.0.1/) |
| `ai-dynamo-runtime==1.0.1` | Core Python bindings for Dynamo runtime | `3.10``3.12` | Linux (glibc `v2.28+`) | [link](https://pypi.org/project/ai-dynamo-runtime/1.0.1/) |
| `kvbm==1.0.1` | KV Block Manager for disaggregated KV cache | `3.12` | Linux (glibc `v2.28+`) | [link](https://pypi.org/project/kvbm/1.0.1/) |
47
48
49
50
51

### Helm Charts

| Chart | Description | NGC |
|-------|-------------|-----|
52
53
| `dynamo-platform-1.0.1` | Platform services (etcd, NATS) and Dynamo Operator for Dynamo cluster | [link](https://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/dynamo-platform-1.0.1.tgz) |
| `snapshot-1.0.1` | Snapshot DaemonSet for fast GPU worker recovery | [link](https://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/snapshot-1.0.1.tgz) |
54

55
56
> [!NOTE]
> The `dynamo-crds` Helm chart is deprecated as of v1.0.0; CRDs are now managed by the Dynamo Operator. The `dynamo-graph` Helm chart is deprecated as of v0.9.0.
57
58
59
60
61

### Rust Crates

| Crate | Description | MSRV (Rust) | crates.io |
|-------|-------------|-------------|-----------|
62
63
64
65
66
67
68
69
70
| `dynamo-runtime@1.0.1` | Core distributed runtime library | `v1.82` | [link](https://crates.io/crates/dynamo-runtime/1.0.1) |
| `dynamo-llm@1.0.1` | LLM inference engine | `v1.82` | [link](https://crates.io/crates/dynamo-llm/1.0.1) |
| `dynamo-async-openai@1.0.1` | Async OpenAI-compatible API client | `v1.82` | [link](https://crates.io/crates/dynamo-async-openai/1.0.1) |
| `dynamo-parsers@1.0.1` | Protocol parsers (SSE, JSON streaming) | `v1.82` | [link](https://crates.io/crates/dynamo-parsers/1.0.1) |
| `dynamo-memory@1.0.1` | Memory management utilities | `v1.82` | [link](https://crates.io/crates/dynamo-memory/1.0.1) |
| `dynamo-config@1.0.1` | Configuration management | `v1.82` | [link](https://crates.io/crates/dynamo-config/1.0.1) |
| `dynamo-tokens@1.0.1` | Tokenizer bindings for LLM inference | `v1.82` | [link](https://crates.io/crates/dynamo-tokens/1.0.1) |
| `dynamo-mocker@1.0.1` | Inference engine simulator for benchmarking | `v1.82` | [link](https://crates.io/crates/dynamo-mocker/1.0.1) |
| `dynamo-kv-router@1.0.1` | KV-aware request routing library | `v1.82` | [link](https://crates.io/crates/dynamo-kv-router/1.0.1) |
71
72
73
74
75

## Quick Install Commands

### Container Images (NGC)

76
77
> [!TIP]
> For detailed run instructions, see the backend-specific guides: [vLLM](../backends/vllm/README.md) | [SGLang](../backends/sglang/README.md) | [TensorRT-LLM](../backends/trtllm/README.md)
78
79
80

```bash
# Runtime containers
81
82
83
docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.1
docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:1.0.1
docker pull nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:1.0.1
84
85

# CUDA 13 variants
86
87
docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.1-cuda13
docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:1.0.1-cuda13
88

89
# EFA variants (AWS, AMD64 only, experimental)
90
91
docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.1-efa-amd64
docker pull nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:1.0.1-efa-amd64
92
93

# Infrastructure containers
94
95
96
docker pull nvcr.io/nvidia/ai-dynamo/dynamo-frontend:1.0.1
docker pull nvcr.io/nvidia/ai-dynamo/kubernetes-operator:1.0.1
docker pull nvcr.io/nvidia/ai-dynamo/snapshot-agent:1.0.1
97
98
99
100
```

### Python Wheels (PyPI)

101
> [!TIP]
102
> For detailed installation instructions, see the [Local Quick Start](https://github.com/ai-dynamo/dynamo#local-quick-start) in the README.
103
104
105

```bash
# Install Dynamo with a specific backend (Recommended)
106
107
uv pip install "ai-dynamo[vllm]==1.0.1"
uv pip install "ai-dynamo[sglang]==1.0.1"
108
# TensorRT-LLM requires the NVIDIA PyPI index and pip
109
pip install --pre --extra-index-url https://pypi.nvidia.com "ai-dynamo[trtllm]==1.0.1"
110
111

# Install Dynamo core only
112
uv pip install ai-dynamo==1.0.1
113
114

# Install standalone KVBM (Python 3.12 only)
115
uv pip install kvbm==1.0.1
116
117
118
119
```

### Helm Charts (NGC)

120
> [!TIP]
121
> For Kubernetes deployment instructions, see the [Kubernetes Installation Guide](../kubernetes/installation-guide.md).
122
123

```bash
124
125
helm install dynamo-platform oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/dynamo-platform --version 1.0.1
helm install snapshot oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/snapshot --version 1.0.1
126
127
128
129
```

### Rust Crates (crates.io)

130
> [!TIP]
131
> For API documentation, see each crate on [docs.rs](https://docs.rs/). To build Dynamo from source, see [Building from Source](https://github.com/ai-dynamo/dynamo#building-from-source).
132
133

```bash
134
135
136
137
138
139
140
141
142
cargo add dynamo-runtime@1.0.1
cargo add dynamo-llm@1.0.1
cargo add dynamo-async-openai@1.0.1
cargo add dynamo-parsers@1.0.1
cargo add dynamo-memory@1.0.1
cargo add dynamo-config@1.0.1
cargo add dynamo-tokens@1.0.1
cargo add dynamo-mocker@1.0.1
cargo add dynamo-kv-router@1.0.1
143
144
```

145
**CUDA and Driver Requirements:** For detailed CUDA toolkit versions and minimum driver requirements for each container image, see the [Support Matrix](support-matrix.md#cuda-and-driver-requirements).
146
147
148

## Known Issues

149
For a complete list of known issues, refer to the release notes for each version:
150
- [v1.0.1 Release Notes](https://github.com/ai-dynamo/dynamo/releases/tag/v1.0.1)
151
- [v1.0.0 Release Notes](https://github.com/ai-dynamo/dynamo/releases/tag/v1.0.0)
152
- [v0.9.0 Release Notes](https://github.com/ai-dynamo/dynamo/releases/tag/v0.9.0)
153
154
155
156
157
158
- [v0.8.1 Release Notes](https://github.com/ai-dynamo/dynamo/releases/tag/v0.8.1)

### Known Artifact Issues

| Version | Artifact | Issue | Status |
|---------|----------|-------|--------|
159
| v0.9.0 | `dynamo-platform-0.9.0` | Helm chart sets operator image to `0.7.1` instead of `0.9.0`. | Fixed in v0.9.0.post1 |
160
161
162
163
164
165
166
167
| v0.8.1 | `vllm-runtime:0.8.1-cuda13` | Container fails to launch. | Known issue |
| v0.8.1 | `sglang-runtime:0.8.1-cuda13`, `vllm-runtime:0.8.1-cuda13` | Multimodality not expected to work on ARM64. Works on AMD64. | Known limitation |
| v0.8.0 | `sglang-runtime:0.8.0-cuda13` | CuDNN installation issue caused PyTorch `v2.9.1` compatibility problems with `nn.Conv3d`, resulting in performance degradation and excessive memory usage in multimodal workloads. | Fixed in v0.8.1 ([#5461](https://github.com/ai-dynamo/dynamo/pull/5461)) |

---

## Release History

168
169
- **v1.1.0-dev.1** *(experimental)*: Preview release. SGLang `v0.5.9`, TRT-LLM `v1.3.0rc5.post1`, vLLM `v0.17.1`, NIXL `v0.10.1`. Not recommended for production use.
- **v1.0.1**: Patch release. Same backend versions as v1.0.0: SGLang `v0.5.9`, TRT-LLM `v1.3.0rc5.post1`, vLLM `v0.16.0`, NIXL `v0.10.1`.
170
- **v1.0.0**: First major release. SGLang `v0.5.9`, TRT-LLM `v1.3.0rc5.post1` (CUDA 13.1), vLLM `v0.16.0`, NIXL `v0.10.1`. New `snapshot-agent` container and `snapshot` Helm chart (Preview). New EFA container variants for vLLM and TRT-LLM (Experimental, AMD64 only). New `dynamo-mocker` and `dynamo-kv-router` Rust crates. Deprecated `dynamo-crds` Helm chart (CRDs now managed by the Operator). `v1alpha1` CRDs deprecated.
171
- **v0.9.1**: Updated TRT-LLM to `v1.3.0rc3`. All other backend versions unchanged from v0.9.0.
172
173
174
- **v0.9.0.post1**: Fixed `dynamo-platform` Helm chart operator image tag (Helm chart only, NGC)
- **v0.9.0**: Updated vLLM to `v0.14.1`, SGLang to `v0.5.8`, TRT-LLM to `v1.3.0rc1`, NIXL to `v0.9.0`. New `dynamo-tokens` Rust crate. Deprecated `dynamo-graph` Helm chart.
- **v0.8.1.post1/.post2/.post3 Patches**: Experimental patch releases updating TRT-LLM only (PyPI wheels and TRT-LLM container). No other artifacts changed.
175
- **Standalone Frontend Container**: `dynamo-frontend` added in v0.8.0
176
- **EFA Runtimes**: Experimental AWS EFA variants for vLLM and TRT-LLM (AMD64 only) in v1.0.0
177
- **CUDA 13 Runtimes**: Experimental CUDA 13 runtime for SGLang and vLLM in v0.8.0
178
179
180
181
- **New Rust Crates**: `dynamo-memory` and `dynamo-config` added in v0.8.0

### GitHub Releases

182
183
184
185
186
| Version | Release Date | GitHub | Docs | Notes |
|---------|--------------|--------|------|-------|
| `v1.1.0-dev.1` | Mar 17, 2026 | [Branch](https://github.com/ai-dynamo/dynamo/tree/release/1.1.0-dev.1) | — | Experimental |
| `v1.0.1` | Mar 16, 2026 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v1.0.1) | [Docs](https://docs.dynamo.nvidia.com/dynamo) | |
| `v1.0.0` | Mar 12, 2026 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v1.0.0) | [Docs](https://docs.dynamo.nvidia.com/dynamo) | |
187
188
| `v0.9.1` | Mar 4, 2026 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.9.1) | [Docs](https://docs.dynamo.nvidia.com/dynamo) |
| `v0.9.0` | Feb 11, 2026 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.9.0) | [Docs](https://docs.dynamo.nvidia.com/dynamo/v-0-9-0/) |
189
190
191
192
193
194
| `v0.8.1` | Jan 23, 2026 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.8.1) | [Docs](https://docs.nvidia.com/dynamo/v-0-8-1/) |
| `v0.8.0` | Jan 15, 2026 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.8.0) | [Docs](https://docs.nvidia.com/dynamo/v-0-8-0/) |
| `v0.7.1` | Dec 15, 2025 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.7.1) | [Docs](https://docs.nvidia.com/dynamo/v-0-7-1/) |
| `v0.7.0` | Nov 26, 2025 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.7.0) | [Docs](https://docs.nvidia.com/dynamo/v-0-7-0/) |
| `v0.6.1` | Nov 6, 2025 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.6.1) | — |
| `v0.6.0` | Oct 28, 2025 | [Release](https://github.com/ai-dynamo/dynamo/releases/tag/v0.6.0) | — |
195
196
197

### Container Images

198
199
200
201
> **NGC Collection:** [ai-dynamo](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/collections/ai-dynamo)
>
> To access a specific version, append `?version=TAG` to the container URL:
> `https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/{container}?version={tag}`
202
203
204
205
206

#### vllm-runtime

| Image:Tag | vLLM | Arch | CUDA | Notes |
|-----------|------|------|------|-------|
207
208
209
| `vllm-runtime:1.0.1` | `v0.16.0` | AMD64/ARM64 | `v12.9` | |
| `vllm-runtime:1.0.1-cuda13` | `v0.16.0` | AMD64/ARM64* | `v13.0` | |
| `vllm-runtime:1.0.1-efa-amd64` | `v0.16.0` | AMD64 | `v12.9` | Experimental |
210
211
212
| `vllm-runtime:1.0.0` | `v0.16.0` | AMD64/ARM64 | `v12.9` | |
| `vllm-runtime:1.0.0-cuda13` | `v0.16.0` | AMD64/ARM64* | `v13.0` | |
| `vllm-runtime:1.0.0-efa-amd64` | `v0.16.0` | AMD64 | `v12.9` | Experimental |
213
214
| `vllm-runtime:0.9.1` | `v0.14.1` | AMD64/ARM64 | `v12.9` | |
| `vllm-runtime:0.9.1-cuda13` | `v0.14.1` | AMD64/ARM64* | `v13.0` | Experimental |
215
216
| `vllm-runtime:0.9.0` | `v0.14.1` | AMD64/ARM64 | `v12.9` | |
| `vllm-runtime:0.9.0-cuda13` | `v0.14.1` | AMD64/ARM64* | `v13.0` | Experimental |
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
| `vllm-runtime:0.8.1` | `v0.12.0` | AMD64/ARM64 | `v12.9` | |
| `vllm-runtime:0.8.0` | `v0.12.0` | AMD64/ARM64 | `v12.9` | |
| `vllm-runtime:0.8.0-cuda13` | `v0.12.0` | AMD64/ARM64 | `v13.0` | Experimental |
| `vllm-runtime:0.7.0.post2` | `v0.11.2` | AMD64/ARM64 | `v12.8` | Patch |
| `vllm-runtime:0.7.1` | `v0.11.0` | AMD64/ARM64 | `v12.8` | |
| `vllm-runtime:0.7.0.post1` | `v0.11.0` | AMD64/ARM64 | `v12.8` | Patch |
| `vllm-runtime:0.7.0` | `v0.11.0` | AMD64/ARM64 | `v12.8` | |
| `vllm-runtime:0.6.1.post1` | `v0.11.0` | AMD64/ARM64 | `v12.8` | Patch |
| `vllm-runtime:0.6.1` | `v0.11.0` | AMD64/ARM64 | `v12.8` | |
| `vllm-runtime:0.6.0` | `v0.11.0` | AMD64 | `v12.8` | |

#### sglang-runtime

| Image:Tag | SGLang | Arch | CUDA | Notes |
|-----------|--------|------|------|-------|
232
233
| `sglang-runtime:1.0.1` | `v0.5.9` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:1.0.1-cuda13` | `v0.5.9` | AMD64/ARM64* | `v13.0` | |
234
235
| `sglang-runtime:1.0.0` | `v0.5.9` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:1.0.0-cuda13` | `v0.5.9` | AMD64/ARM64* | `v13.0` | |
236
237
| `sglang-runtime:0.9.1` | `v0.5.8` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.9.1-cuda13` | `v0.5.8` | AMD64/ARM64* | `v13.0` | Experimental |
238
239
| `sglang-runtime:0.9.0` | `v0.5.8` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.9.0-cuda13` | `v0.5.8` | AMD64/ARM64* | `v13.0` | Experimental |
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
| `sglang-runtime:0.8.1` | `v0.5.6.post2` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.8.1-cuda13` | `v0.5.6.post2` | AMD64/ARM64 | `v13.0` | Experimental |
| `sglang-runtime:0.8.0` | `v0.5.6.post2` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.8.0-cuda13` | `v0.5.6.post2` | AMD64/ARM64 | `v13.0` | Experimental |
| `sglang-runtime:0.7.1` | `v0.5.4.post3` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.7.0.post1` | `v0.5.4.post3` | AMD64/ARM64 | `v12.9` | Patch |
| `sglang-runtime:0.7.0` | `v0.5.4.post3` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.6.1.post1` | `v0.5.3.post2` | AMD64/ARM64 | `v12.9` | Patch |
| `sglang-runtime:0.6.1` | `v0.5.3.post2` | AMD64/ARM64 | `v12.9` | |
| `sglang-runtime:0.6.0` | `v0.5.3.post2` | AMD64 | `v12.8` | |

#### tensorrtllm-runtime

| Image:Tag | TRT-LLM | Arch | CUDA | Notes |
|-----------|---------|------|------|-------|
255
256
| `tensorrtllm-runtime:1.0.1` | `v1.3.0rc5.post1` | AMD64/ARM64 | `v13.1` | |
| `tensorrtllm-runtime:1.0.1-efa-amd64` | `v1.3.0rc5.post1` | AMD64 | `v13.1` | Experimental |
257
258
| `tensorrtllm-runtime:1.0.0` | `v1.3.0rc5.post1` | AMD64/ARM64 | `v13.1` | |
| `tensorrtllm-runtime:1.0.0-efa-amd64` | `v1.3.0rc5.post1` | AMD64 | `v13.1` | Experimental |
259
| `tensorrtllm-runtime:0.9.1` | `v1.3.0rc3` | AMD64/ARM64 | `v13.0` | |
260
261
| `tensorrtllm-runtime:0.9.0` | `v1.3.0rc1` | AMD64/ARM64 | `v13.0` | |
| `tensorrtllm-runtime:0.8.1.post3` | `v1.2.0rc6.post3` | AMD64/ARM64 | `v13.0` | Patch |
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
| `tensorrtllm-runtime:0.8.1.post1` | `v1.2.0rc6.post2` | AMD64/ARM64 | `v13.0` | Patch |
| `tensorrtllm-runtime:0.8.1` | `v1.2.0rc6.post1` | AMD64/ARM64 | `v13.0` | |
| `tensorrtllm-runtime:0.8.0` | `v1.2.0rc6.post1` | AMD64/ARM64 | `v13.0` | |
| `tensorrtllm-runtime:0.7.0.post2` | `v1.2.0rc2` | AMD64/ARM64 | `v13.0` | Patch |
| `tensorrtllm-runtime:0.7.1` | `v1.2.0rc3` | AMD64/ARM64 | `v13.0` | |
| `tensorrtllm-runtime:0.7.0.post1` | `v1.2.0rc3` | AMD64/ARM64 | `v13.0` | Patch |
| `tensorrtllm-runtime:0.7.0` | `v1.2.0rc2` | AMD64/ARM64 | `v13.0` | |
| `tensorrtllm-runtime:0.6.1-cuda13` | `v1.2.0rc1` | AMD64/ARM64 | `v13.0` | Experimental |
| `tensorrtllm-runtime:0.6.1.post1` | `v1.1.0rc5` | AMD64/ARM64 | `v12.9` | Patch |
| `tensorrtllm-runtime:0.6.1` | `v1.1.0rc5` | AMD64/ARM64 | `v12.9` | |
| `tensorrtllm-runtime:0.6.0` | `v1.1.0rc5` | AMD64/ARM64 | `v12.9` | |

#### dynamo-frontend

| Image:Tag | Arch | Notes |
|-----------|------|-------|
278
| `dynamo-frontend:1.0.1` | AMD64/ARM64 | |
279
| `dynamo-frontend:1.0.0` | AMD64/ARM64 | |
280
| `dynamo-frontend:0.9.1` | AMD64/ARM64 | |
281
| `dynamo-frontend:0.9.0` | AMD64/ARM64 | |
282
283
284
285
286
287
288
| `dynamo-frontend:0.8.1` | AMD64/ARM64 | |
| `dynamo-frontend:0.8.0` | AMD64/ARM64 | Initial |

#### kubernetes-operator

| Image:Tag | Arch | Notes |
|-----------|------|-------|
289
| `kubernetes-operator:1.0.1` | AMD64/ARM64 | |
290
| `kubernetes-operator:1.0.0` | AMD64/ARM64 | |
291
| `kubernetes-operator:0.9.1` | AMD64/ARM64 | |
292
| `kubernetes-operator:0.9.0` | AMD64/ARM64 | |
293
294
295
296
297
298
299
300
| `kubernetes-operator:0.8.1` | AMD64/ARM64 | |
| `kubernetes-operator:0.8.0` | AMD64/ARM64 | |
| `kubernetes-operator:0.7.1` | AMD64/ARM64 | |
| `kubernetes-operator:0.7.0.post1` | AMD64/ARM64 | Patch |
| `kubernetes-operator:0.7.0` | AMD64/ARM64 | |
| `kubernetes-operator:0.6.1` | AMD64/ARM64 | |
| `kubernetes-operator:0.6.0` | AMD64/ARM64 | |

301
302
303
304
#### snapshot-agent

| Image:Tag | Arch | Notes |
|-----------|------|-------|
305
| `snapshot-agent:1.0.1` | AMD64/ARM64 | Preview |
306
307
| `snapshot-agent:1.0.0` | AMD64/ARM64 | Preview |

308
309
### Python Wheels

310
311
312
> **PyPI:** [ai-dynamo](https://pypi.org/project/ai-dynamo/) | [ai-dynamo-runtime](https://pypi.org/project/ai-dynamo-runtime/) | [kvbm](https://pypi.org/project/kvbm/)
>
> To access a specific version: `https://pypi.org/project/{package}/{version}/`
313
314
315
316
317

#### ai-dynamo (wheel)

| Package | Python | Platform | Notes |
|---------|--------|----------|-------|
318
| `ai-dynamo==1.0.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
319
| `ai-dynamo==1.0.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
320
| `ai-dynamo==0.9.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
321
322
| `ai-dynamo==0.9.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo==0.8.1.post3` | `3.10``3.12` | Linux (glibc `v2.28+`) | TRT-LLM `v1.2.0rc6.post3` |
323
324
325
326
327
328
329
330
331
332
333
334
| `ai-dynamo==0.8.1.post1` | `3.10``3.12` | Linux (glibc `v2.28+`) | TRT-LLM `v1.2.0rc6.post2` |
| `ai-dynamo==0.8.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo==0.8.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo==0.7.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo==0.7.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo==0.6.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo==0.6.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |

#### ai-dynamo-runtime (wheel)

| Package | Python | Platform | Notes |
|---------|--------|----------|-------|
335
| `ai-dynamo-runtime==1.0.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
336
| `ai-dynamo-runtime==1.0.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
337
| `ai-dynamo-runtime==0.9.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
338
339
| `ai-dynamo-runtime==0.9.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo-runtime==0.8.1.post3` | `3.10``3.12` | Linux (glibc `v2.28+`) | TRT-LLM `v1.2.0rc6.post3` |
340
341
342
343
344
345
346
347
348
349
350
351
| `ai-dynamo-runtime==0.8.1.post1` | `3.10``3.12` | Linux (glibc `v2.28+`) | TRT-LLM `v1.2.0rc6.post2` |
| `ai-dynamo-runtime==0.8.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo-runtime==0.8.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo-runtime==0.7.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo-runtime==0.7.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo-runtime==0.6.1` | `3.10``3.12` | Linux (glibc `v2.28+`) | |
| `ai-dynamo-runtime==0.6.0` | `3.10``3.12` | Linux (glibc `v2.28+`) | |

#### kvbm (wheel)

| Package | Python | Platform | Notes |
|---------|--------|----------|-------|
352
| `kvbm==1.0.1` | `3.12` | Linux (glibc `v2.28+`) | |
353
| `kvbm==1.0.0` | `3.12` | Linux (glibc `v2.28+`) | |
354
| `kvbm==0.9.1` | `3.12` | Linux (glibc `v2.28+`) | |
355
| `kvbm==0.9.0` | `3.12` | Linux (glibc `v2.28+`) | |
356
357
358
359
360
361
362
| `kvbm==0.8.1` | `3.12` | Linux (glibc `v2.28+`) | |
| `kvbm==0.8.0` | `3.12` | Linux (glibc `v2.28+`) | |
| `kvbm==0.7.1` | `3.12` | Linux (glibc `v2.28+`) | |
| `kvbm==0.7.0` | `3.12` | Linux (glibc `v2.28+`) | Initial |

### Helm Charts

363
364
365
> **NGC Helm Registry:** [ai-dynamo](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/collections/ai-dynamo)
>
> Direct download: `https://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/{chart}-{version}.tgz`
366

367
368
#### dynamo-crds (Helm chart) -- Deprecated

369
370
> [!NOTE]
> The `dynamo-crds` Helm chart is deprecated as of v1.0.0. CRDs are now managed by the Dynamo Operator.
371
372
373

| Chart | Notes |
|-------|-------|
374
| `dynamo-crds-0.9.1` | Last release |
375
| `dynamo-crds-0.9.0` | |
376
377
378
379
380
381
382
383
384
385
386
| `dynamo-crds-0.8.1` | |
| `dynamo-crds-0.8.0` | |
| `dynamo-crds-0.7.1` | |
| `dynamo-crds-0.7.0` | |
| `dynamo-crds-0.6.1` | |
| `dynamo-crds-0.6.0` | |

#### dynamo-platform (Helm chart)

| Chart | Notes |
|-------|-------|
387
| `dynamo-platform-1.0.1` | |
388
| `dynamo-platform-1.0.0` | |
389
| `dynamo-platform-0.9.1` | |
390
391
| `dynamo-platform-0.9.0-post1` | Helm fix: operator image tag |
| `dynamo-platform-0.9.0` | |
392
393
394
395
396
397
398
| `dynamo-platform-0.8.1` | |
| `dynamo-platform-0.8.0` | |
| `dynamo-platform-0.7.1` | |
| `dynamo-platform-0.7.0` | |
| `dynamo-platform-0.6.1` | |
| `dynamo-platform-0.6.0` | |

399
400
401
402
#### snapshot (Helm chart)

| Chart | Notes |
|-------|-------|
403
| `snapshot-1.0.1` | Preview |
404
405
| `snapshot-1.0.0` | Preview |

406
407
#### dynamo-graph (Helm chart) -- Deprecated

408
409
> [!NOTE]
> The `dynamo-graph` Helm chart is deprecated as of v0.9.0.
410
411
412

| Chart | Notes |
|-------|-------|
413
| `dynamo-graph-0.8.1` | Last release |
414
415
416
417
418
419
420
421
| `dynamo-graph-0.8.0` | |
| `dynamo-graph-0.7.1` | |
| `dynamo-graph-0.7.0` | |
| `dynamo-graph-0.6.1` | |
| `dynamo-graph-0.6.0` | |

### Rust Crates

422
> **crates.io:** [dynamo-runtime](https://crates.io/crates/dynamo-runtime) | [dynamo-llm](https://crates.io/crates/dynamo-llm) | [dynamo-async-openai](https://crates.io/crates/dynamo-async-openai) | [dynamo-parsers](https://crates.io/crates/dynamo-parsers) | [dynamo-memory](https://crates.io/crates/dynamo-memory) | [dynamo-config](https://crates.io/crates/dynamo-config) | [dynamo-tokens](https://crates.io/crates/dynamo-tokens)
423
424
>
> To access a specific version: `https://crates.io/crates/{crate}/{version}`
425
426
427
428
429

#### dynamo-runtime (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
430
| `dynamo-runtime@1.0.1` | `v1.82` | |
431
| `dynamo-runtime@1.0.0` | `v1.82` | |
432
| `dynamo-runtime@0.9.1` | `v1.82` | |
433
| `dynamo-runtime@0.9.0` | `v1.82` | |
434
435
436
437
438
439
440
441
442
443
444
| `dynamo-runtime@0.8.1` | `v1.82` | |
| `dynamo-runtime@0.8.0` | `v1.82` | |
| `dynamo-runtime@0.7.1` | `v1.82` | |
| `dynamo-runtime@0.7.0` | `v1.82` | |
| `dynamo-runtime@0.6.1` | `v1.82` | |
| `dynamo-runtime@0.6.0` | `v1.82` | |

#### dynamo-llm (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
445
| `dynamo-llm@1.0.1` | `v1.82` | |
446
| `dynamo-llm@1.0.0` | `v1.82` | |
447
| `dynamo-llm@0.9.1` | `v1.82` | |
448
| `dynamo-llm@0.9.0` | `v1.82` | |
449
450
451
452
453
454
455
456
457
458
459
| `dynamo-llm@0.8.1` | `v1.82` | |
| `dynamo-llm@0.8.0` | `v1.82` | |
| `dynamo-llm@0.7.1` | `v1.82` | |
| `dynamo-llm@0.7.0` | `v1.82` | |
| `dynamo-llm@0.6.1` | `v1.82` | |
| `dynamo-llm@0.6.0` | `v1.82` | |

#### dynamo-async-openai (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
460
| `dynamo-async-openai@1.0.1` | `v1.82` | |
461
| `dynamo-async-openai@1.0.0` | `v1.82` | |
462
| `dynamo-async-openai@0.9.1` | `v1.82` | |
463
| `dynamo-async-openai@0.9.0` | `v1.82` | |
464
465
466
467
468
469
470
471
472
473
474
| `dynamo-async-openai@0.8.1` | `v1.82` | |
| `dynamo-async-openai@0.8.0` | `v1.82` | |
| `dynamo-async-openai@0.7.1` | `v1.82` | |
| `dynamo-async-openai@0.7.0` | `v1.82` | |
| `dynamo-async-openai@0.6.1` | `v1.82` | |
| `dynamo-async-openai@0.6.0` | `v1.82` | |

#### dynamo-parsers (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
475
| `dynamo-parsers@1.0.1` | `v1.82` | |
476
| `dynamo-parsers@1.0.0` | `v1.82` | |
477
| `dynamo-parsers@0.9.1` | `v1.82` | |
478
| `dynamo-parsers@0.9.0` | `v1.82` | |
479
480
481
482
483
484
485
486
487
488
489
| `dynamo-parsers@0.8.1` | `v1.82` | |
| `dynamo-parsers@0.8.0` | `v1.82` | |
| `dynamo-parsers@0.7.1` | `v1.82` | |
| `dynamo-parsers@0.7.0` | `v1.82` | |
| `dynamo-parsers@0.6.1` | `v1.82` | |
| `dynamo-parsers@0.6.0` | `v1.82` | |

#### dynamo-memory (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
490
| `dynamo-memory@1.0.1` | `v1.82` | |
491
| `dynamo-memory@1.0.0` | `v1.82` | |
492
| `dynamo-memory@0.9.1` | `v1.82` | |
493
| `dynamo-memory@0.9.0` | `v1.82` | |
494
495
496
497
498
499
500
| `dynamo-memory@0.8.1` | `v1.82` | |
| `dynamo-memory@0.8.0` | `v1.82` | Initial |

#### dynamo-config (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
501
| `dynamo-config@1.0.1` | `v1.82` | |
502
| `dynamo-config@1.0.0` | `v1.82` | |
503
| `dynamo-config@0.9.1` | `v1.82` | |
504
| `dynamo-config@0.9.0` | `v1.82` | |
505
506
| `dynamo-config@0.8.1` | `v1.82` | |
| `dynamo-config@0.8.0` | `v1.82` | Initial |
507
508
509
510
511

#### dynamo-tokens (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
512
| `dynamo-tokens@1.0.1` | `v1.82` | |
513
| `dynamo-tokens@1.0.0` | `v1.82` | |
514
| `dynamo-tokens@0.9.1` | `v1.82` | |
515
| `dynamo-tokens@0.9.0` | `v1.82` | Initial |
516
517
518
519
520

#### dynamo-mocker (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
521
| `dynamo-mocker@1.0.1` | `v1.82` | |
522
523
524
525
526
527
| `dynamo-mocker@1.0.0` | `v1.82` | Initial |

#### dynamo-kv-router (crate)

| Crate | MSRV (Rust) | Notes |
|-------|-------------|-------|
528
| `dynamo-kv-router@1.0.1` | `v1.82` | |
529
| `dynamo-kv-router@1.0.0` | `v1.82` | Initial |
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578

---

## Pre-Release Artifacts

> [!WARNING]
> **Pre-Release artifacts do not go through QA validation.** Pre-release versions are experimental previews intended for early testing and feedback. They may contain bugs, breaking changes, or incomplete features. Use stable releases for production workloads.

### v1.1.0-dev.1

- **Branch:** [release/1.1.0-dev.1](https://github.com/ai-dynamo/dynamo/tree/release/1.1.0-dev.1)
- **GitHub Tag:** [v1.1.0-dev.1](https://github.com/ai-dynamo/dynamo/releases/tag/v1.1.0-dev.1)
- **Backends:** SGLang `v0.5.9` | TensorRT-LLM `v1.3.0rc5.post1` | vLLM `v0.17.1` | NIXL `v0.10.1`

#### Container Images

| Image:Tag | Backend | CUDA | Arch |
|-----------|---------|------|------|
| `vllm-runtime:1.1.0-dev.1` | vLLM `v0.17.1` | `v12.9` | AMD64/ARM64 |
| `vllm-runtime:1.1.0-dev.1-cuda13` | vLLM `v0.17.1` | `v13.0` | AMD64/ARM64* |
| `vllm-runtime:1.1.0-dev.1-efa-amd64` | vLLM `v0.17.1` | `v12.9` | AMD64 |
| `sglang-runtime:1.1.0-dev.1` | SGLang `v0.5.9` | `v12.9` | AMD64/ARM64 |
| `sglang-runtime:1.1.0-dev.1-cuda13` | SGLang `v0.5.9` | `v13.0` | AMD64/ARM64* |
| `tensorrtllm-runtime:1.1.0-dev.1` | TRT-LLM `v1.3.0rc5.post1` | `v13.1` | AMD64/ARM64 |
| `tensorrtllm-runtime:1.1.0-dev.1-efa-amd64` | TRT-LLM `v1.3.0rc5.post1` | `v13.1` | AMD64 |
| `dynamo-frontend:1.1.0-dev.1` | — | — | AMD64/ARM64 |
| `kubernetes-operator:1.1.0-dev.1` | — | — | AMD64/ARM64 |
| `snapshot-agent:1.1.0-dev.1` | — | — | AMD64/ARM64 |

#### Python Wheels

Available from [pypi.nvidia.com](https://pypi.nvidia.com/) (pre-release index):

```bash
uv pip install --pre --extra-index-url https://pypi.nvidia.com/ ai-dynamo==1.1.0.dev1
uv pip install --pre --extra-index-url https://pypi.nvidia.com/ ai-dynamo-runtime==1.1.0.dev1
uv pip install --pre --extra-index-url https://pypi.nvidia.com/ kvbm==1.1.0.dev1
```

#### Helm Charts

| Chart | NGC |
|-------|-----|
| `dynamo-platform-1.1.0-dev.1` | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/helm-charts/dynamo-platform?version=1.1.0-dev.1) |
| `snapshot-1.1.0-dev.1` | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/helm-charts/snapshot?version=1.1.0-dev.1) |

#### Rust Crates

Not shipped for pre-release versions.