**v0.9.0.post1** is a Helm-chart-only patch release on NGC (no GitHub release). It fixes the `dynamo-platform` Helm chart which incorrectly set the operator image tag to `0.7.1` instead of `0.9.0`. Only the `dynamo-platform` chart was patched; all other artifacts remain at v0.9.0.
**v0.9.0.post1** is a Helm-chart-only patch release on NGC (no GitHub release). It fixes the `dynamo-platform` Helm chart which incorrectly set the operator image tag to `0.7.1` instead of `0.9.0`. Only the `dynamo-platform` chart was patched; all other artifacts remain at v0.9.0. Users upgrading to v0.9.1 do not need this patch.
| Artifact | Version | Change | Link |
| Artifact | Version | Change | Link |
|----------|---------|--------|------|
|----------|---------|--------|------|
...
@@ -31,13 +31,13 @@ Release history in this document begins at v0.6.0.
...
@@ -31,13 +31,13 @@ Release history in this document begins at v0.6.0.
| Image:Tag | Description | Backend | CUDA | Arch | NGC | Notes |
| Image:Tag | Description | Backend | CUDA | Arch | NGC | Notes |
| `dynamo-frontend:0.9.0` | API gateway with Endpoint Prediction Protocol (EPP) | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/dynamo-frontend?version=0.9.0) | |
| `dynamo-frontend:0.9.1` | API gateway with Endpoint Prediction Protocol (EPP) | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/dynamo-frontend?version=0.9.1) | |
| `kubernetes-operator:0.9.0` | Kubernetes operator for Dynamo deployments | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/kubernetes-operator?version=0.9.0) | |
| `kubernetes-operator:0.9.1` | Kubernetes operator for Dynamo deployments | — | — | AMD64/ARM64 | [link](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/kubernetes-operator?version=0.9.1) | |
\* Multimodal inference on CUDA 13 images: works on AMD64 for all backends; works on ARM64 only for TensorRT-LLM (`vllm-runtime:*-cuda13` and `sglang-runtime:*-cuda13` do not support multimodality on ARM64).
\* Multimodal inference on CUDA 13 images: works on AMD64 for all backends; works on ARM64 only for TensorRT-LLM (`vllm-runtime:*-cuda13` and `sglang-runtime:*-cuda13` do not support multimodality on ARM64).
...
@@ -47,16 +47,16 @@ We recommend using the TensorRT-LLM NGC container instead of the `ai-dynamo[trtl
...
@@ -47,16 +47,16 @@ We recommend using the TensorRT-LLM NGC container instead of the `ai-dynamo[trtl
> For API documentation, see each crate on [docs.rs](https://docs.rs/). To build Dynamo from source, see [Building from Source](https://github.com/ai-dynamo/dynamo#building-from-source).
> For API documentation, see each crate on [docs.rs](https://docs.rs/). To build Dynamo from source, see [Building from Source](https://github.com/ai-dynamo/dynamo#building-from-source).
```bash
```bash
cargo add dynamo-runtime@0.9.0
cargo add dynamo-runtime@0.9.1
cargo add dynamo-llm@0.9.0
cargo add dynamo-llm@0.9.1
cargo add dynamo-async-openai@0.9.0
cargo add dynamo-async-openai@0.9.1
cargo add dynamo-parsers@0.9.0
cargo add dynamo-parsers@0.9.1
cargo add dynamo-memory@0.9.0
cargo add dynamo-memory@0.9.1
cargo add dynamo-config@0.9.0
cargo add dynamo-config@0.9.1
cargo add dynamo-tokens@0.9.0
cargo add dynamo-tokens@0.9.1
```
```
**CUDA and Driver Requirements:** For detailed CUDA toolkit versions and minimum driver requirements for each container image, see the [Support Matrix](support-matrix.md#cuda-and-driver-requirements).
**CUDA and Driver Requirements:** For detailed CUDA toolkit versions and minimum driver requirements for each container image, see the [Support Matrix](support-matrix.md#cuda-and-driver-requirements).
...
@@ -155,6 +155,7 @@ For a complete list of known issues, refer to the release notes for each version
...
@@ -155,6 +155,7 @@ For a complete list of known issues, refer to the release notes for each version
## Release History
## Release History
-**v0.9.1**: Updated TRT-LLM to `v1.3.0rc3`. All other backend versions unchanged from v0.9.0.
-**v0.9.0**: Updated vLLM to `v0.14.1`, SGLang to `v0.5.8`, TRT-LLM to `v1.3.0rc1`, NIXL to `v0.9.0`. New `dynamo-tokens` Rust crate. Deprecated `dynamo-graph` Helm chart.
-**v0.9.0**: Updated vLLM to `v0.14.1`, SGLang to `v0.5.8`, TRT-LLM to `v1.3.0rc1`, NIXL to `v0.9.0`. New `dynamo-tokens` Rust crate. Deprecated `dynamo-graph` Helm chart.
-**v0.8.1.post1/.post2/.post3 Patches**: Experimental patch releases updating TRT-LLM only (PyPI wheels and TRT-LLM container). No other artifacts changed.
-**v0.8.1.post1/.post2/.post3 Patches**: Experimental patch releases updating TRT-LLM only (PyPI wheels and TRT-LLM container). No other artifacts changed.
...
@@ -166,7 +167,8 @@ For a complete list of known issues, refer to the release notes for each version
...
@@ -166,7 +167,8 @@ For a complete list of known issues, refer to the release notes for each version