- 28 May, 2025 15 commits
-
-
Graham King authored
It was removed from the docs in 0.2.1 and replaced with writing a [standalone Python engine](https://github.com/ai-dynamo/dynamo/blob/main/docs/guides/dynamo_run.md#writing-your-own-engine-in-python). Also remove the associated `dynamo-run` feature `python`. Releasing this in 0.3.0 will resolve #784 and #1109.
-
Kris Hung authored
-
Tanmay Verma authored
-
mohammedabdulwahhab authored
-
Hongkuan Zhou authored
-
Alec authored
-
Alec authored
-
Neelay Shah authored
-
Emmanuel Ferdman authored
Signed-off-by:Emmanuel Ferdman <emmanuelferdman@gmail.com>
-
julienmancuso authored
-
Alec authored
-
ishandhanani authored
-
mohammedabdulwahhab authored
Signed-off-by:mohammedabdulwahhab <furkhan324@berkeley.edu>
-
Biswa Panda authored
-
julienmancuso authored
Signed-off-by:
julienmancuso <161955438+julienmancuso@users.noreply.github.com> Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com>
-
- 27 May, 2025 11 commits
-
-
ishandhanani authored
-
ishandhanani authored
Signed-off-by:
ishandhanani <82981111+ishandhanani@users.noreply.github.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
mohammedabdulwahhab authored
Co-authored-by:Anna Tchernych <atchernych@nvidia.com>
-
kYLe authored
-
Shuaiyi Zhang authored
Signed-off-by:
Shuaiyi Zhang <zhangsy28@lenovo.com> Co-authored-by:
Shuaiyi Zhang <zhangsy28@lenovo.com> Co-authored-by:
Yan Ru Pei <yanrpei@gmail.com>
-
Akash authored
Signed-off-by:Akash <akpaul@nvidia.com>
-
ishandhanani authored
-
mohammedabdulwahhab authored
-
J Wyman authored
-
Tanmay Verma authored
-
Hyogeun Oh (오효근) authored
Signed-off-by:Hyogeun Oh <ohg3417@gmail.com>
-
- 24 May, 2025 1 commit
-
-
jthomson04 authored
-
- 23 May, 2025 8 commits
-
-
Kris Hung authored
-
Hongkuan Zhou authored
-
Yan Ru Pei authored
-
Graham King authored
-
Yan Ru Pei authored
Signed-off-by:
Michael Feil <63565275+michaelfeil@users.noreply.github.com> Co-authored-by:
Michael Feil <63565275+michaelfeil@users.noreply.github.com> Co-authored-by:
jthomson04 <jwillthomson19@gmail.com> Co-authored-by:
Ryan Olson <ryanolson@users.noreply.github.com>
-
julienmancuso authored
-
hhzhang16 authored
-
Ryan Olson authored
-
- 22 May, 2025 5 commits
-
-
julienmancuso authored
-
Tanmay Verma authored
-
Graham King authored
Example: ``` dynamo-run out=<engine> <model> --kv-cache-block-size 64 ``` In a distributed system this goes on the worker node and is propagated to ingress via the model deployment card. Previously hard coded to 16, which is now the default. - Load context_length from model. Closes #1172 - Store context length and KV cache block size in Model Deployment Card #1170
-
Hongkuan Zhou authored
Co-authored-by:root <root@kkranen-dt.nvidia.com>
-
Graham King authored
Removed the hard coded sleeps, explained what we're testing. Closes https://github.com/ai-dynamo/dynamo/issues/1132 The race condition is that `apply_event` sends a message on a channel, it does not directly apply the event. At some later point the tokio runtime schedules the task running the channel receiver, which applies the event. If that had not happened yet the test would fail.
-