- 24 Oct, 2025 1 commit
-
-
Keiven C authored
refactor: redesign the metrics API from Trait to composition to make the code cleaner and easier to understand (#3687) Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 23 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 21 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 17 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 11 Oct, 2025 1 commit
-
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
- 10 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 09 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 08 Oct, 2025 1 commit
-
-
Keiven C authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 18 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 17 Sep, 2025 1 commit
-
-
Tzu-Ling Kan authored
Signed-off-by:tzulingk@nvidia.com <tzulingk@nvidia.com>
-
- 16 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 03 Sep, 2025 1 commit
-
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Signed-off-by:
hongkuan <hongkuanz@nvidia.com>
-
- 29 Aug, 2025 1 commit
-
-
Keiven C authored
Signed-off-by:Keiven C <213854356+keivenchang@users.noreply.github.com>
-
- 25 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 22 Aug, 2025 2 commits
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Graham King authored
-
- 21 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 19 Aug, 2025 2 commits
-
-
Ryan Olson authored
Signed-off-by:
Ryan Olson <rolson@nvidia.com> Co-authored-by:
Olga Andreeva <oandreeva@nvidia.com> Co-authored-by:
Ziqi Fan <ziqif@nvidia.com> Co-authored-by:
John Thompson <jothomson@nvidia.com> Co-authored-by:
Richard Huo <rihuo@nvidia.com> Co-authored-by:
Zicheng Ma <zichengm@nvidia.com>
-
Keiven C authored
fix: use tokio spawn / interval.tick(), make nats metric names clearer, fix tests sharing environment variables (temp_env) (#2506) Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 18 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 15 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 14 Aug, 2025 1 commit
-
-
Tzu-Ling Kan authored
-
- 13 Aug, 2025 1 commit
-
-
jthomson04 authored
Signed-off-by:jthomson04 <jwillthomson19@gmail.com>
-
- 11 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 07 Aug, 2025 1 commit
-
-
Yingge He authored
-
- 05 Aug, 2025 1 commit
-
-
Yingge He authored
-
- 28 Jul, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 23 Jul, 2025 1 commit
-
-
Neelay Shah authored
-
- 22 Jul, 2025 1 commit
-
-
Keiven C authored
feat: add a hierarchical Prometheus MetricsRegistry trait for DistributedRuntime, Namespace, Components, and Endpoint (#2008) Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Ryan Olson <rolson@nvidia.com>
-
- 15 Jul, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 08 Jul, 2025 1 commit
-
-
ZichengMa authored
-
- 11 Jun, 2025 1 commit
-
-
Ryan Olson authored
-
- 23 May, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
Michael Feil <63565275+michaelfeil@users.noreply.github.com> Co-authored-by:
Michael Feil <63565275+michaelfeil@users.noreply.github.com> Co-authored-by:
jthomson04 <jwillthomson19@gmail.com> Co-authored-by:
Ryan Olson <ryanolson@users.noreply.github.com>
-
- 07 Apr, 2025 1 commit
-
-
Graham King authored
As a first step towards KV routing: - introduce a `--router-mode` in dynamo-run that only does random and round-robin right now. Not that interesting yet. - Make the vllm engine publish the KV events received from our patched vllm. Now we "just" need to connect the two. Easy right?
-
- 04 Apr, 2025 1 commit
-
-
Graham King authored
Adds `@dynamo_worker(static = True)` to create a static worker which has a predictable name and hence does not require discovery or `etcd` to be running. There can only be a single static worker per namespace / component / endpoint trio. This contrasts with the default dynamic `dynamo_worker` endpoints we have now, which get a unique random name (based on namespace/component/endpoint), and are discovered by ingress components using etcd. Also change the hello_world example to use `dynamo_worker(static = True)` so that it is exercised and demonstrated somewhere. For NIM.
-
- 25 Feb, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 18 Feb, 2025 2 commits
-
-
Ryan Olson authored
Co-authored-by:Ryan McCormick <rmccormick@nvidia.com>
-
GuanLuo authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
aflowers <aflowers@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com> Co-authored-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
- 05 Feb, 2025 1 commit
-
-
J Wyman authored
-
- 04 Feb, 2025 1 commit
-
-
Ryan Olson authored
the journey begins
-