Commits · 0a1d1fbe3d048959a55568d0232b2f7dda3f9bf9 · OpenDAS / dynamo

28 May, 2025 15 commits
- feat(dynamo-llm): Remove bring-your-own-engine (#1216) · 0a1d1fbe
  Graham King authored May 28, 2025
```
It was removed from the docs in 0.2.1 and replaced with writing a [standalone Python engine](https://github.com/ai-dynamo/dynamo/blob/main/docs/guides/dynamo_run.md#writing-your-own-engine-in-python).

Also remove the associated `dynamo-run` feature `python`.

Releasing this in 0.3.0 will resolve #784 and #1109.
```
  0a1d1fbe
- fix: Fix async_on_start syntax (#1243) · edc6fdea
  Kris Hung authored May 28, 2025
  
  edc6fdea
- feat: Enable dynamo-run out=trtllm (#1223) · 1b1e089a
  Tanmay Verma authored May 28, 2025
  
  1b1e089a
- fix: ignore setuptools warning (#1239) · fc31a510
  mohammedabdulwahhab authored May 28, 2025
  
  fc31a510
- fix: update kv-router usage (#1238) · 761f67e0
  Hongkuan Zhou authored May 28, 2025
  
  761f67e0
- fix: dynamo-run pass proper args using register-llm (#1230) · cc40af70
  Alec authored May 28, 2025
  
  cc40af70
- fix: dynamo-run add warning if block-size different (#1233) · e450c2c7
  Alec authored May 28, 2025
  
  e450c2c7
- chore: remove pa build (#1231) · 4426e937
  Neelay Shah authored May 28, 2025
  
  4426e937
- fix: resolve regex library warnings (#1237) · cd7a301b
  Emmanuel Ferdman authored May 28, 2025
```
Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>
```
  cd7a301b
- fix: fix operator unit tests (#1227) · 9abe8dff
  julienmancuso authored May 28, 2025
  
  9abe8dff
- fix: devcontainer small qol fixes (#1228) · 07e4720d
  Alec authored May 28, 2025
  
  07e4720d
- chore: bump sglang version (#1219) · 811b10a6
  ishandhanani authored May 27, 2025
  
  811b10a6
- feat: fluxcd guide to managing custom resources (#1220) · c12f61a6
  mohammedabdulwahhab authored May 27, 2025
```
Signed-off-by: mohammedabdulwahhab <furkhan324@berkeley.edu>
```
  c12f61a6
- feat: portable dynamo build (#1215) · 68ac71c4
  Biswa Panda authored May 27, 2025
  
  68ac71c4
- feat: document model caching using Fluid (#1218) · 0594235b
  julienmancuso authored May 27, 2025
```
Signed-off-by: julienmancuso <161955438+julienmancuso@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
```
  0594235b
27 May, 2025 11 commits
- feat(http): add health check endpoint (#1037) · 39d01eac
  ishandhanani authored May 27, 2025
  
  39d01eac
- feat(sglang): add dockerfile/pyproject toml entry + steps to run dsr1 disagg (#1193) · 5c5cec3d
  ishandhanani authored May 27, 2025
```
Signed-off-by: ishandhanani <82981111+ishandhanani@users.noreply.github.com>
Co-authored-by: Neelay Shah <neelays@nvidia.com>
```
  5c5cec3d
- fix: add liveness and readiness probes to Dynamo SDK (#1187) · 088f7eeb
  mohammedabdulwahhab authored May 27, 2025
```
Co-authored-by: Anna Tchernych <atchernych@nvidia.com>
```
  088f7eeb
- feat: Add Hello World Multinode example (#624) · 69dcba7b
  kYLe authored May 27, 2025
  
  69dcba7b
- fix: Add block-size parameter to Router in the example (#1210) · b4f23a13
  Shuaiyi Zhang authored May 28, 2025
```
Signed-off-by: Shuaiyi Zhang <zhangsy28@lenovo.com>
Co-authored-by: Shuaiyi Zhang <zhangsy28@lenovo.com>
Co-authored-by: Yan Ru Pei <yanrpei@gmail.com>
```
  b4f23a13
- docs: fix minor typo (#1206) · a8bdc0be
  Akash authored May 28, 2025
```
Signed-off-by: Akash <akpaul@nvidia.com>
```
  a8bdc0be
- chore: fix loading logs in dynamo serve (#1213) · bd91a175
  ishandhanani authored May 27, 2025
  
  bd91a175
- fix: ignore setuptools warning in pytest (#1212) · 030ceadf
  mohammedabdulwahhab authored May 27, 2025
  
  030ceadf
- feat: NIXL Based RDMA Support w/ Multimodal Example (#1060) · 75e774d4
  J Wyman authored May 27, 2025
  
  75e774d4
- feat: Add metrics and event publishers (#1192) · 9acaa8d1
  Tanmay Verma authored May 27, 2025
  
  9acaa8d1
- docs: Fix broken link to `support_matrix.md` in `README.md` (#1201) · b8272a98
  Hyogeun Oh (오효근) authored May 28, 2025
```
Signed-off-by: Hyogeun Oh <ohg3417@gmail.com>
```
  b8272a98
24 May, 2025 1 commit
- feat: kvbm offload fixes and tests (#1191) · 6d9aac77
  jthomson04 authored May 24, 2025
  
  6d9aac77
23 May, 2025 8 commits
- chore: Add code owners for multimodal examples (#1194) · e5845b53
  Kris Hung authored May 23, 2025
  
  e5845b53
- feat: add dynamo-run example for vllm v0 (#1186) · 7cd0d680
  Hongkuan Zhou authored May 23, 2025
  
  7cd0d680
- chore: rm duplicate fwd pass metric (#1190) · 9d944c27
  Yan Ru Pei authored May 23, 2025
  
  9d944c27
- chore: Upgrade Rust to 1.87 (#1189) · a4c49fe5
  Graham King authored May 23, 2025
  
  a4c49fe5
- fix: etcd.rs - linear increasing watch with number of requests (#1081) · 3f9c3ffe
  Yan Ru Pei authored May 23, 2025
```
Signed-off-by: Michael Feil <63565275+michaelfeil@users.noreply.github.com>
Co-authored-by: Michael Feil <63565275+michaelfeil@users.noreply.github.com>
Co-authored-by: jthomson04 <jwillthomson19@gmail.com>
Co-authored-by: Ryan Olson <ryanolson@users.noreply.github.com>
```
  3f9c3ffe
- feat: add dynamo operator overview doc (#688) · 4eae238f
  julienmancuso authored May 23, 2025
  
  4eae238f
- feat: support k8s target in dynamo deploy command (#1104) · 33e72720
  hhzhang16 authored May 23, 2025
  
  33e72720
- feat: adding arena allocator for storage objects (#1178) · 31ff2370
  Ryan Olson authored May 23, 2025
  
  31ff2370
22 May, 2025 5 commits

fix: add blocking mode for k8s connector in planner (#1176) · 14e1d446
julienmancuso authored May 22, 2025

14e1d446
feat: Add standalone script for TRTLLM integration into dynamo-run (#1162) · 3d4fe574
Tanmay Verma authored May 22, 2025

3d4fe574

feat(dynamo-run): Allow setting KV cache block size (#1175) · 183f2b32

Graham King authored May 22, 2025

Example:
```
dynamo-run out=<engine> <model> --kv-cache-block-size 64
```

In a distributed system this goes on the worker node and is propagated to ingress via the model deployment card.

Previously hard coded to 16, which is now the default.

- Load context_length from model. Closes #1172
- Store context length and KV cache block size in Model Deployment Card #1170

183f2b32

feat: Add TTFT and ITL Interpolation to Profiling Script (#1159) · 7860861f
Hongkuan Zhou authored May 22, 2025
```
Co-authored-by: root <root@kkranen-dt.nvidia.com>
```
7860861f

fix: Fix race condition in kv_router unit test (#1174) · 3bde1e45

Graham King authored May 22, 2025

Removed the hard coded sleeps, explained what we're testing.

Closes https://github.com/ai-dynamo/dynamo/issues/1132

The race condition is that `apply_event` sends a message on a channel, it does not directly apply the event. At some later point the tokio runtime schedules the task running the channel receiver, which applies the event. If that had not happened yet the test would fail.

3bde1e45