Commits · ed5e4fd021d7f92a7468751b20293ac242a5c8c2 · OpenDAS / dynamo

02 Jan, 2026 1 commit
- chore: update all copyright headers in repo to 2026 (#5130) · cf433e68
  Tushar Sharma authored Jan 02, 2026
```
Signed-off-by: Tushar Sharma <tusharma@nvidia.com>
```
  cf433e68
05 Dec, 2025 1 commit
- fix: use channel to avoid race condition from async nats registration task (#4758) · 501ef021
  Biswa Panda authored Dec 05, 2025
  
  501ef021
04 Dec, 2025 1 commit
- refactor: remove legacy NATS metrics and stats_handler (#4680) · 00b64ae0
  Keiven C authored Dec 04, 2025
```
Signed-off-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  00b64ae0
26 Nov, 2025 1 commit
- fix: allow router to be registered as general transport type (#4633) · ea9d00e7
  Yan Ru Pei authored Nov 26, 2025
```
Signed-off-by: PeaBrane <yanrpei@gmail.com>
```
  ea9d00e7
25 Nov, 2025 2 commits
- chore(runtime): Make nats_client private, refactor NATS stats scraping (#4591) · 17dcffe8
  Graham King authored Nov 25, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  17dcffe8
- chore(runtime): Tidy up component paths (#4560) · 51c1b9f1
  Graham King authored Nov 25, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  51c1b9f1
21 Nov, 2025 1 commit
- chore: Make nats_client private at crate level, various tidy up (#4513) · f05f7629
  Graham King authored Nov 21, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  f05f7629
19 Nov, 2025 1 commit
- feat: Only monitor NATS metrics if using NATS request plane (#4442) · 69797b5a
  Graham King authored Nov 19, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  69797b5a
17 Nov, 2025 1 commit
- feat: Command line flag to set request plane mode: tcp, http or nats (#4365) · 886506c1
  Graham King authored Nov 17, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  886506c1
13 Nov, 2025 1 commit
- feat: transport agnostic request plane for dynamo - natless (#4246) · 06b0ebef
  Biswa Panda authored Nov 13, 2025
  
  06b0ebef
11 Nov, 2025 1 commit
- chore: Remove static mode (#4235) · e1af3af6
  Graham King authored Nov 11, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  e1af3af6
10 Nov, 2025 1 commit
- refactor: Make the Runtime and DistributedRuntime fields private (#4193) · cf630bf7
  Graham King authored Nov 10, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  cf630bf7
08 Nov, 2025 1 commit
- fix: refactor to use service discovery (#4092) · 09b26bf6
  mohammedabdulwahhab authored Nov 08, 2025
```
Signed-off-by: mohammedabdulwahhab <furkhan324@berkeley.edu>
```
  09b26bf6
28 Oct, 2025 1 commit
- chore(runtime): Do not expose etcd lease ID (#3915) · c78b5901
  Graham King authored Oct 28, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  c78b5901
24 Oct, 2025 1 commit

refactor: redesign the metrics API from Trait to composition to make the code... · cbe0b177

Keiven C authored Oct 24, 2025


refactor: redesign the metrics API from Trait to composition to make the code cleaner and easier to understand (#3687)
Signed-off-by: Keiven Chang <keivenchang@users.noreply.github.com>

cbe0b177

23 Oct, 2025 1 commit
- chore: Use KeyValueStoreManager instead of etcd::Client (#3822) · 7731b024
  Graham King authored Oct 23, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  7731b024
21 Oct, 2025 1 commit
- refactor(runtime): Replace std::sync::Mutex with parking_lot::Mutex (#3740) · 9ae98ed7
  Graham King authored Oct 21, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  9ae98ed7
20 Oct, 2025 1 commit
- chore: Replace ServiceConfigBuilder with add_stats_service (#3736) · f6ed01b1
  Graham King authored Oct 20, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  f6ed01b1
17 Oct, 2025 1 commit
- refactor: Make `nats_client` optional internally (#3705) · 66fd6f84
  Graham King authored Oct 17, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  66fd6f84
10 Oct, 2025 1 commit
- feat: Introduce storage_client in DistributedRuntime (#3507) · 7a7d397c
  Graham King authored Oct 10, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  7a7d397c
07 Oct, 2025 1 commit
- feat(etcd): Version the etcd keys (#3458) · a5371bfc
  Graham King authored Oct 07, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  a5371bfc
30 Sep, 2025 2 commits
- chore: Add Key abstraction in our KeyValueStore (#3322) · 50cdae5f
  Graham King authored Sep 30, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  50cdae5f
- chore: Move model_input, model_type from ModelEntry to ModelDeploymentCard (#3292) · 6ffd20a8
  Graham King authored Sep 30, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  6ffd20a8
04 Sep, 2025 1 commit

fix: reduce nats stats query frequency (#2847) · 4df2e2d6

Keiven C authored Sep 04, 2025


Signed-off-by: Keiven Chang <keivenchang@users.noreply.github.com>
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>

4df2e2d6

22 Aug, 2025 3 commits
- fix: move metrics registration to service creation (#2664) · 92151e3e
  Keiven C authored Aug 22, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  92151e3e
- chore: Rust to 1.89 and edition 2024 (#2659) · bce74588
  Graham King authored Aug 22, 2025
  
  bce74588
- chore(llm): Rename protocols::Endpoint to EndpointId (#2615) · 6a358f7c
  Graham King authored Aug 22, 2025
  
  6a358f7c
19 Aug, 2025 1 commit

fix: use tokio spawn / interval.tick(), make nats metric names clearer, fix... · bec1dd54

Keiven C authored Aug 18, 2025


fix: use tokio spawn / interval.tick(), make nats metric names clearer, fix tests sharing environment variables (temp_env) (#2506)
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>

bec1dd54

18 Aug, 2025 2 commits
- feat(http): TLS support (#2492) · a4bbe492
  Graham King authored Aug 18, 2025
  
  a4bbe492
- fix: replace metrics callback with background scraping to prevent tim… (#2480) · 04442173
  Keiven C authored Aug 18, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  04442173
15 Aug, 2025 1 commit
- feat(metrics): add NATS client metrics to prometheus_metrics_fmt (#2292) · acbdabc4
  Keiven C authored Aug 14, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  acbdabc4
14 Aug, 2025 1 commit
- feat: Add a "model" label to Component metrics (#2389) · 3a3f5bf2
  Tzu-Ling Kan authored Aug 14, 2025
  
  3a3f5bf2
23 Jul, 2025 1 commit
- feat: health check changes based on endpoint served (#1996) · b127d95f
  Neelay Shah authored Jul 22, 2025
  
  b127d95f
22 Jul, 2025 1 commit

feat: add a hierarchical Prometheus MetricsRegistry trait for... · e5a8628f

Keiven C authored Jul 22, 2025

feat: add a hierarchical Prometheus MetricsRegistry trait for DistributedRuntime, Namespace, Components, and Endpoint (#2008)
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
Co-authored-by: Ryan Olson <rolson@nvidia.com>

e5a8628f

11 Jun, 2025 1 commit
- refactor: move kv store to runtime (#1459) · 08355da6
  Ryan Olson authored Jun 11, 2025
  
  08355da6
23 May, 2025 1 commit

fix: etcd.rs - linear increasing watch with number of requests (#1081) · 3f9c3ffe

Yan Ru Pei authored May 23, 2025

Signed-off-by: Michael Feil <63565275+michaelfeil@users.noreply.github.com>
Co-authored-by: Michael Feil <63565275+michaelfeil@users.noreply.github.com>
Co-authored-by: jthomson04 <jwillthomson19@gmail.com>
Co-authored-by: Ryan Olson <ryanolson@users.noreply.github.com>

3f9c3ffe

21 May, 2025 1 commit

chore: Fix model removal on instance stop, refactor discovery (#1142) · b520bf44

Graham King authored May 21, 2025

- Stop advertising a model when it's last instance stops. Previously was when any instance stops.
- Faster locks on model manager.
- Move discovery code out of http, as it is used by all inputs.

b520bf44

19 May, 2025 1 commit

feat: Support multiple models on single ingress node (#1127) · aeb79e62

Graham King authored May 19, 2025

We can now do this:

- Node 1:

```
dynamo-run in=http out=dyn
```

- Node 2 and 3, two instances of component 'backend' in the nemotron_ultra pipeline:

```
dynamo-run in=dyn://nemotron_ultra.backend.generate out=vllm /data/models/NemotronUltra
```

- Node 4 and 5, two instances of the 'backend' component in nemotron_super pipeline:

```
dynamo-run in=dyn://nemotron_super.backend.generate out=vllm /data/models/NemotronSuper
```

The ingress node will discover all four instances and route correctly. We have been planning for this for a long time now.

As part of this auto-discovery is now always `out=dyn`, with no extra URL parts. Previously it could only route to a single pipeline.

Also:
- Refactor endpoint / instance naming now that I understand them
- Fix removing models when their instance stops.

aeb79e62

15 May, 2025 1 commit

chore: Prevent duplicate components with different models. (#1103) · 641234cd

Graham King authored May 15, 2025

Each namespace is for a single pipeline, so a component must be model-unique. The means we can have several components with the same name running the same model (data parallel), their traffic will be routed according to `--router-mode`, but we cannot have several components with the same name running different models.

Add an `ensure_unique` check to prevent that happening.

641234cd

06 May, 2025 1 commit

feat: dynamo-run <-> python interop (#934) · 99cd9d85

Graham King authored May 05, 2025

Adding this to a Python script makes it register on the network so that `dynamo-run` can discover it and send it requests:
```
from dynamo.llm import register_llm

MODEL = "Qwen/Qwen2.5-0.5B-Instruct"
await register_llm(endpoint, MODEL, 3)
```

Full vllm example, with pre-processing in dynamo:
- `dynamo-run in=text out=dyn://dynamo.backend.generate`
- `cd lib/bindings/python/examples/hello_world`
- `python server_vllm.py`

This builds on top of the work to move pre-processor to ingress side. It means we can decouple Rust and Python using NATS as the bus.

The `register_llm` call does this:

- Download the model from HF if necessary
- Load the model deployment card from the HF folder or extract from GGUF
- Push the tokenizer config etc into NATS object store so ingress can access it from a different machine
- Publish the model deployment card to ETCD

99cd9d85