- 24 Oct, 2025 1 commit
-
-
Keiven C authored
refactor: redesign the metrics API from Trait to composition to make the code cleaner and easier to understand (#3687) Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 17 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 16 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 15 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 14 Aug, 2025 1 commit
-
-
Tzu-Ling Kan authored
-
- 31 Jul, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 22 Jul, 2025 1 commit
-
-
Keiven C authored
feat: add a hierarchical Prometheus MetricsRegistry trait for DistributedRuntime, Namespace, Components, and Endpoint (#2008) Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Ryan Olson <rolson@nvidia.com>
-
- 17 Mar, 2025 1 commit
-
-
GuanLuo authored
-
- 07 Mar, 2025 1 commit
-
-
Ryan McCormick authored
Replaces hard-coded "kv-hit-rate" string in multiple places with KV_HIT_RATE_SUBJECT constant in lib/llm.
-
- 06 Mar, 2025 1 commit
-
-
Ryan McCormick authored
-
- 25 Feb, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 22 Feb, 2025 1 commit
-
-
Ryan Olson authored
Enables `#[tokio::test]` via `Runtime::from_current()` This uses the current handle as both the primary and secondary.
-
- 21 Feb, 2025 1 commit
-
-
Ryan Olson authored
Signed-off-by:
Ryan Olson <ryanolson@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-