Commits · 3c49a02c60d70dfcc0434dd82e424e3ea7cd6267 · OpenDAS / dynamo

03 Apr, 2025 1 commit
- chore: rename duration to timeout (#503) · 3c49a02c
  tlipoca9 authored Apr 03, 2025
  
  3c49a02c
31 Mar, 2025 1 commit
- refactor: prometheus upgrade (#452) · de290537
  Ryan Olson authored Mar 31, 2025
  
  de290537
17 Mar, 2025 1 commit
- feat: expose Python binding for KVEventPublisher. Use event pub/sub trait for KV events (#169) · 6e09681e
  GuanLuo authored Mar 17, 2025
  
  6e09681e
07 Mar, 2025 1 commit
- refactor: Use library constant for kv-hit-rate subject (#48) · 2ee29443
  Ryan McCormick authored Mar 07, 2025
```
Replaces hard-coded "kv-hit-rate" string in multiple places with KV_HIT_RATE_SUBJECT constant in lib/llm.
```
  2ee29443
06 Mar, 2025 1 commit
- feat: Add estimated kv cache hit metric events (#30) · 09656f6c
  Ryan McCormick authored Mar 06, 2025
  
  09656f6c
27 Feb, 2025 1 commit
- refactor: service/endpoint stats_handler (#282) · 85cc7b67
  Ryan Olson authored Feb 27, 2025
  
  85cc7b67
25 Feb, 2025 1 commit

refactor: move libs to lib dir · 08fcd7e9

Neelay Shah authored Feb 24, 2025


Signed-off-by: Neelay Shah <neelays@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

08fcd7e9

21 Feb, 2025 2 commits

feat(tio): Distributed inference! (#235) · 32a748e4

Graham King authored Feb 21, 2025

Add support in tio for distributed components and discovery.

Node 1:
```
tio in=http out=tdr://ns/backend/mistralrs
```

Node 2:
```
tio in=tdr://ns/backend/mistralrs out=mistralrs ~/llm_models/Llama-3.2-3B-Instruct
```

This will use etcd to auto-discover the model and NATS to talk to it. You can run multiple workers on the same endpoint and it will pick one at random each time.

The `ns/backend/mistralrs` are purely symbolic, pick anything as long as it has three parts, and it matches the other node.

32a748e4

feat: event plane + count · 3b7a462d

Ryan Olson authored Feb 21, 2025


Signed-off-by: Ryan Olson <ryanolson@users.noreply.github.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

3b7a462d

18 Feb, 2025 1 commit

feat: Add KV publisher and receiver. Add KV aware routing example. · 8588e33a

GuanLuo authored Feb 18, 2025


Signed-off-by: Neelay Shah <neelays@nvidia.com>
Co-authored-by: aflowers <aflowers@nvidia.com>
Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>
Co-authored-by: hongkuanz <hongkuanz@nvidia.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>

8588e33a

15 Feb, 2025 1 commit
- refactor: adding facades for runtimes (#187) · fd0bcfa2
  Ryan Olson authored Feb 15, 2025
  
  fd0bcfa2
11 Feb, 2025 1 commit
- refactor: Use tracing crate (#161) · a62a8627
  Graham King authored Feb 11, 2025
  
  a62a8627
10 Feb, 2025 1 commit
- doc: Fix doc links (#149) · c7f8acd4
  Graham King authored Feb 10, 2025
  
  c7f8acd4
05 Feb, 2025 1 commit
- ci: Add Copyright Verification Scripts w/ Automation (#110) · c9130f8f
  J Wyman authored Feb 05, 2025
  
  c9130f8f
04 Feb, 2025 1 commit
- feat: rust - initial commit · 5ed8c1c0
  Ryan Olson authored Feb 03, 2025
```
the journey begins
```
  5ed8c1c0