- 10 Apr, 2025 10 commits
-
-
Anant Sharma authored
-
Anant Sharma authored
-
ishandhanani authored
-
Ziqi Fan authored
-
Alec authored
-
Zhan Lu authored
-
Anant Sharma authored
-
Carsten Csiky authored
Co-authored-by:Meenakshi Sharma <163925564+nvda-mesharma@users.noreply.github.com>
-
tlipoca9 authored
Neelay approved merge
-
mohammedabdulwahhab authored
Co-authored-by:
Hannah Zhang <hannahz@nvidia.com> Co-authored-by:
Julien Mancuso <jmancuso@nvidia.com> Co-authored-by:
mabdulwahhab <mabdulwahhab@nvidia.com>
-
- 09 Apr, 2025 10 commits
-
-
jon-chuang authored
feat: Extract Common Configs + Log Configs on Init + Add `test_` to `sdk/tests` filenames required for pytest (#434) Co-authored-by:ishandhanani <82981111+ishandhanani@users.noreply.github.com>
-
Tanmay Verma authored
-
ishandhanani authored
-
Jacky authored
Signed-off-by:
Jacky <18255193+kthui@users.noreply.github.com> Co-authored-by:
Piotr Marcinkiewicz <piotrm@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
Suman Tatiraju authored
-
julienmancuso authored
-
Anant Sharma authored
-
cdgamarose-nv authored
#### Overview: Updated the dynamo run doc `docs/guides/dynamo_run.md` #### Details: - Updated the instructions to make it clear which binary to use for built backends - Reformatted the doc to make it more readable - Added missing cmake library for ubuntu Signed-off-by:Chantal D Gama Rose <cdgamarose@nvidia.com>
-
Pavithra Vijayakrishnan authored
Signed-off-by:Pavithra Vijayakrishnan <160681768+pvijayakrish@users.noreply.github.com>
-
Jacky authored
Signed-off-by:
Jacky <18255193+kthui@users.noreply.github.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
- 08 Apr, 2025 4 commits
-
-
Tushar Sharma authored
-
Hongkuan Zhou authored
-
Ziqi Fan authored
-
Tanmay Verma authored
Signed-off-by:
Tanmay Verma <tanmay2592@gmail.com> Co-authored-by:
Dmitry Tokarev <dtokarev@nvidia.com>
-
- 07 Apr, 2025 6 commits
-
-
ishandhanani authored
Signed-off-by:
ishandhanani <82981111+ishandhanani@users.noreply.github.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
Graham King authored
As a first step towards KV routing: - introduce a `--router-mode` in dynamo-run that only does random and round-robin right now. Not that interesting yet. - Make the vllm engine publish the KV events received from our patched vllm. Now we "just" need to connect the two. Easy right?
-
ishandhanani authored
-
tlipoca9 authored
Co-authored-by:ishandhanani <82981111+ishandhanani@users.noreply.github.com>
-
julienmancuso authored
-
ishandhanani authored
Co-authored-by:finofliu <finofliu@tencent.com>
-
- 05 Apr, 2025 4 commits
-
-
julienmancuso authored
-
Dmitry Tokarev authored
-
mohammedabdulwahhab authored
Co-authored-by:mabdulwahhab <mabdulwahhab@nvidia.com>
-
ishandhanani authored
-
- 04 Apr, 2025 6 commits
-
-
Yan Ru Pei authored
-
Yan Ru Pei authored
-
Graham King authored
-
Graham King authored
Also upgrade the cargo resolver to v3, the default. New clippy lints: - `next_back()` instead of `last()` for a double-ended iterator. That avoids walking the whole list. - ` repeat_n` instead of `repeat.take`. That avoids cloning. - Doc indenting
-
Graham King authored
-
Graham King authored
Adds `@dynamo_worker(static = True)` to create a static worker which has a predictable name and hence does not require discovery or `etcd` to be running. There can only be a single static worker per namespace / component / endpoint trio. This contrasts with the default dynamic `dynamo_worker` endpoints we have now, which get a unique random name (based on namespace/component/endpoint), and are discovered by ingress components using etcd. Also change the hello_world example to use `dynamo_worker(static = True)` so that it is exercised and demonstrated somewhere. For NIM.
-