Commits · f6ed01b14c8caf0de4d90a05cb766c787a904af7 · OpenDAS / dynamo

20 Oct, 2025 1 commit
- chore: Replace ServiceConfigBuilder with add_stats_service (#3736) · f6ed01b1
  Graham King authored Oct 20, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  f6ed01b1
13 Oct, 2025 1 commit
- chore: pre-0.6.0 activities (#3592) · cd2389ba
  Harrison Saturley-Hall authored Oct 13, 2025
```
Signed-off-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
```
  cd2389ba
30 Sep, 2025 1 commit
- chore: Move model_input, model_type from ModelEntry to ModelDeploymentCard (#3292) · 6ffd20a8
  Graham King authored Sep 30, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  6ffd20a8
26 Sep, 2025 1 commit
- feat: add Rayon compute pool for CPU-intensive operations (#2969) · a13c4cb6
  Ryan Olson authored Sep 26, 2025
```
Signed-off-by: Ryan Olson <rolson@nvidia.com>
```
  a13c4cb6
24 Sep, 2025 1 commit
- chore: bump versions ahead of 0.5.1 release (#3209) · 980727bb
  Harrison Saturley-Hall authored Sep 24, 2025
```
Signed-off-by: Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
```
  980727bb
19 Sep, 2025 1 commit
- chore: Upgrade Rust to 1.90 (#3147) · 7a5a0bd6
  Graham King authored Sep 19, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  7a5a0bd6
16 Sep, 2025 1 commit
- chore(runtime): Shorten the license header (#3059) · 02a22cbc
  Graham King authored Sep 16, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  02a22cbc
05 Sep, 2025 2 commits
- fix: Load the tokenizer JSON once for chat and completions. (#2910) · cb5a657a
  Graham King authored Sep 05, 2025
```
Signed-off-by: Graham King <grahamk@nvidia.com>
```
  cb5a657a
- docs: change docs to default port 8000 (#2876) · 1995ef9a
  Yan Ru Pei authored Sep 04, 2025
```
Signed-off-by: PeaBrane <yanrpei@gmail.com>
```
  1995ef9a
02 Sep, 2025 1 commit
- chore: bump version numbers ahead of 0.5.0 release (#2812) · 561ecb98
  Harrison Saturley-Hall authored Sep 02, 2025
```
Signed-off-by: Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
```
  561ecb98
22 Aug, 2025 1 commit
- chore: Rust to 1.89 and edition 2024 (#2659) · bce74588
  Graham King authored Aug 22, 2025
  
  bce74588
21 Aug, 2025 1 commit
- feat: Add model label for vllm backend metrics (#2474) · 57728909
  Tzu-Ling Kan authored Aug 21, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  57728909
20 Aug, 2025 1 commit
- chore: Bumped Dynamo version to 0.4.1 (#2545) · 9a021885
  Dmitry Tokarev authored Aug 19, 2025
  
  9a021885
19 Aug, 2025 1 commit
- feat: Rename dynamo_component_concurrent_requests (#2515) · 6f7f6b12
  Tzu-Ling Kan authored Aug 19, 2025
```
Signed-off-by: Tzu-Ling Kan <tzulingk@nvidia.com>
```
  6f7f6b12
18 Aug, 2025 1 commit
- fix: replace metrics callback with background scraping to prevent tim… (#2480) · 04442173
  Keiven C authored Aug 18, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  04442173
15 Aug, 2025 2 commits
- fix: remove kvmanager feature from python 3.12 ai-dynamo-runtime wheel (#2456) · ffae72b7
  Harrison Saturley-Hall authored Aug 15, 2025
  
  ffae72b7
- feat(metrics): add NATS client metrics to prometheus_metrics_fmt (#2292) · acbdabc4
  Keiven C authored Aug 14, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  acbdabc4
14 Aug, 2025 1 commit
- feat: Add a "model" label to Component metrics (#2389) · 3a3f5bf2
  Tzu-Ling Kan authored Aug 14, 2025
  
  3a3f5bf2
13 Aug, 2025 1 commit
- feat: Allow an endpoint to serve multiple models (#2418) · 72ec5f5c
  Graham King authored Aug 13, 2025
  
  72ec5f5c
11 Aug, 2025 1 commit
- refactor: rename to system status server for consistency (#2354) · c91e2e49
  Keiven C authored Aug 11, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  c91e2e49
07 Aug, 2025 2 commits
- chore(metrics): Remove the Arc (#2357) · a3f7a39f
  Graham King authored Aug 07, 2025
  
  a3f7a39f
- refactor: Rename HTTP server to metrics server in worker process (#2318) · 254f4819
  Yingge He authored Aug 06, 2025
  
  254f4819
01 Aug, 2025 1 commit
- fix: dynamo_component to be added in metric names (#2180) · efd863d6
  Keiven C authored Jul 31, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  efd863d6
30 Jul, 2025 1 commit
- chore: Version bump to 0.4.0 (#2179) · 4c90b1b9
  Dmitry Tokarev authored Jul 30, 2025
  
  4c90b1b9
28 Jul, 2025 1 commit
- feat: Base metrics: add generic ingress handler metrics (#2090) · 615580d8
  Keiven C authored Jul 28, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  615580d8
25 Jul, 2025 1 commit
- fix: move docker-compose.yml to deploy/, and update frontend port (#2121) · 4498a77d
  Keiven C authored Jul 25, 2025
```
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
```
  4498a77d
22 Jul, 2025 1 commit

feat: add a hierarchical Prometheus MetricsRegistry trait for... · e5a8628f

Keiven C authored Jul 22, 2025

feat: add a hierarchical Prometheus MetricsRegistry trait for DistributedRuntime, Namespace, Components, and Endpoint (#2008)
Co-authored-by: Keiven Chang <keivenchang@users.noreply.github.com>
Co-authored-by: Ryan Olson <rolson@nvidia.com>

e5a8628f

16 Jul, 2025 1 commit
- perf(router): Remove lock from router hot path (#1963) · aba60996
  Graham King authored Jul 16, 2025
  
  aba60996
08 Jul, 2025 1 commit
- feat: Build DistributedRuntime-level HTTP server with /health /metrics (#1656) · ece76a62
  ZichengMa authored Jul 08, 2025
  
  ece76a62
07 Jul, 2025 1 commit
- chore: update versions for 0.3.2 release (#1793) · c4935b34
  Anant Sharma authored Jul 07, 2025
  
  c4935b34
03 Jul, 2025 1 commit
- chore(engines): Upgrade mistralrs to 0.6.0 (#1767) · 4ab47617
  Graham King authored Jul 03, 2025
  
  4ab47617
13 Jun, 2025 1 commit
- chore: update dynamo and nixl versions for 0.3.1 (#1517) · 99e67e60
  Anant Sharma authored Jun 13, 2025
  
  99e67e60
29 May, 2025 1 commit
- chore: update dynamo and nixl versions for 0.3.0 (#1240) · 9d9a1d9b
  Anant Sharma authored May 29, 2025
  
  9d9a1d9b
23 May, 2025 1 commit
- chore: Upgrade Rust to 1.87 (#1189) · a4c49fe5
  Graham King authored May 23, 2025
  
  a4c49fe5
19 May, 2025 1 commit

feat: Support multiple models on single ingress node (#1127) · aeb79e62

Graham King authored May 19, 2025

We can now do this:

- Node 1:

```
dynamo-run in=http out=dyn
```

- Node 2 and 3, two instances of component 'backend' in the nemotron_ultra pipeline:

```
dynamo-run in=dyn://nemotron_ultra.backend.generate out=vllm /data/models/NemotronUltra
```

- Node 4 and 5, two instances of the 'backend' component in nemotron_super pipeline:

```
dynamo-run in=dyn://nemotron_super.backend.generate out=vllm /data/models/NemotronSuper
```

The ingress node will discover all four instances and route correctly. We have been planning for this for a long time now.

As part of this auto-discovery is now always `out=dyn`, with no extra URL parts. Previously it could only route to a single pipeline.

Also:
- Refactor endpoint / instance naming now that I understand them
- Fix removing models when their instance stops.

aeb79e62

16 May, 2025 1 commit
- test: Add doc tests to Rust CI (#1102) · 34f3fc6d
  Ryan McCormick authored May 16, 2025
  
  34f3fc6d
09 May, 2025 2 commits
- chore: bump versions and NIXL dependencies for 0.2.1 (#1012) · e9cb035a
  Harrison Saturley-Hall authored May 09, 2025
  
  e9cb035a
- feat: allow adding auth to etcd (#980) · b2e401bc
  wxsm authored May 09, 2025
```
Allow both password or TLS auth, if none of these is provided fallback to no auth

Closes #657
```
  b2e401bc
29 Apr, 2025 1 commit

chore: Split PushRouter from Client (#817) · a1a10365

Graham King authored Apr 29, 2025

In a distributed system we don't know if the remote workers need pre-processing done ingress-side or not. Previously Client required us to decide this before discovering the remote endpoints, which was fine because pre-processing was worker-side.

As part of moving pre-processing back to ingress-side we need to split this into two steps:
- Client discovers the endpoints, and (later PR) will fetch their Model Deployment Card.
- PushRouter will use the Model Deployment Card to decide if they need pre-processing or not, which affects the types of the generic parameters.

Part of #743

a1a10365

25 Apr, 2025 1 commit
- chore: bump NIXL version and package versions (#836) · 0715d469
  Harrison Saturley-Hall authored Apr 25, 2025
```
Signed-off-by: Harrison Saturley-Hall <454891+saturley-hall@users.noreply.github.com>
```
  0715d469