- 07 Apr, 2025 5 commits
-
-
Graham King authored
As a first step towards KV routing: - introduce a `--router-mode` in dynamo-run that only does random and round-robin right now. Not that interesting yet. - Make the vllm engine publish the KV events received from our patched vllm. Now we "just" need to connect the two. Easy right?
-
ishandhanani authored
-
tlipoca9 authored
Co-authored-by:ishandhanani <82981111+ishandhanani@users.noreply.github.com>
-
julienmancuso authored
-
ishandhanani authored
Co-authored-by:finofliu <finofliu@tencent.com>
-
- 05 Apr, 2025 4 commits
-
-
julienmancuso authored
-
Dmitry Tokarev authored
-
mohammedabdulwahhab authored
Co-authored-by:mabdulwahhab <mabdulwahhab@nvidia.com>
-
ishandhanani authored
-
- 04 Apr, 2025 7 commits
-
-
Yan Ru Pei authored
-
Yan Ru Pei authored
-
Graham King authored
-
Graham King authored
Also upgrade the cargo resolver to v3, the default. New clippy lints: - `next_back()` instead of `last()` for a double-ended iterator. That avoids walking the whole list. - ` repeat_n` instead of `repeat.take`. That avoids cloning. - Doc indenting
-
Graham King authored
-
Graham King authored
Adds `@dynamo_worker(static = True)` to create a static worker which has a predictable name and hence does not require discovery or `etcd` to be running. There can only be a single static worker per namespace / component / endpoint trio. This contrasts with the default dynamic `dynamo_worker` endpoints we have now, which get a unique random name (based on namespace/component/endpoint), and are discovered by ingress components using etcd. Also change the hello_world example to use `dynamo_worker(static = True)` so that it is exercised and demonstrated somewhere. For NIM.
-
Kyungmin Lee authored
-
- 03 Apr, 2025 8 commits
-
-
ishandhanani authored
-
Ryan Olson authored
Moved all of `lib/llm/src/engines` to their own crates as e.g. `lib/engines/mistralrs`. This will allow publishing of the `dynamo-llm` crate as it won't have any github dependencies. The only engines in dynamo-llm will be the demo `echo` ones. Co-authored-by:Graham King <grahamk@nvidia.com>
-
Tanmay Verma authored
Co-authored-by:Neelay Shah <neelays@nvidia.com>
-
Graham King authored
-
hanweisen authored
Signed-off-by:hanweisen <hanweisen_yewu@cmss.chinamobile.com>
-
Graham King authored
-
tlipoca9 authored
-
Ryan Olson authored
-
- 02 Apr, 2025 8 commits
-
-
julienmancuso authored
-
Tushar Sharma authored
-
Tushar Sharma authored
-
Harrison Saturley-Hall authored
-
mohammedabdulwahhab authored
Co-authored-by:mabdulwahhab <mabdulwahhab@nvidia.com>
-
jon-chuang authored
Co-authored-by:Ryan Olson <ryanolson@users.noreply.github.com>
-
Ryan Olson authored
-
julienmancuso authored
-
- 01 Apr, 2025 7 commits
-
-
ishandhanani authored
Co-authored-by:hongkuanz <hongkuanz@nvidia.com>
-
Tushar Sharma authored
-
Ryan Olson authored
-
hhzhang16 authored
-
ishandhanani authored
-
Kiv Chen authored
-
Pavithra Vijayakrishnan authored
-
- 31 Mar, 2025 1 commit
-
-
Pavithra Vijayakrishnan authored
-