- 10 Mar, 2026 1 commit
-
-
jh-nv authored
feat: Add Loki log aggregation, and enable unified OTLP ingestion for both traces and logs through a single endpoint (#6974)
-
- 02 Mar, 2026 1 commit
-
-
Ryan McCormick authored
-
- 12 Feb, 2026 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 16 Jan, 2026 1 commit
-
-
Neelay Shah authored
Co-authored-by:Claude Opus 4.5 <noreply@anthropic.com>
-
- 09 Jan, 2026 1 commit
-
-
jh-nv authored
Co-authored-by:Ishan Dhanani <ishandhanani@gmail.com>
-
- 02 Jan, 2026 1 commit
-
-
Tushar Sharma authored
Signed-off-by:Tushar Sharma <tusharma@nvidia.com>
-
- 18 Dec, 2025 1 commit
-
-
jh-nv authored
Signed-off-by:Jie Hao <jihao@nvidia.com>
-
- 03 Dec, 2025 1 commit
-
-
Keiven C authored
fix: integration test failures. Support DYN_SYSTEM_PORT=0 for random port binding, and update etcd test (#4687) Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
- 17 Nov, 2025 1 commit
-
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
- 13 Nov, 2025 1 commit
-
-
Biswa Panda authored
-
- 10 Nov, 2025 1 commit
-
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
- 04 Nov, 2025 1 commit
-
-
Thomas Montfort authored
Co-authored-by:mohammedabdulwahhab <furkhan324@berkeley.edu>
-
- 01 Nov, 2025 1 commit
-
-
Thomas Montfort authored
-
- 23 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 13 Oct, 2025 1 commit
-
-
mohammedabdulwahhab authored
Signed-off-by:mohammedabdulwahhab <furkhan324@berkeley.edu>
-
- 22 Aug, 2025 2 commits
-
-
Graham King authored
-
Waël Boukhobza authored
-
- 07 Aug, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Co-authored-by:
Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com>
-
- 28 Jul, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
- 17 Jul, 2025 2 commits
-
-
Kris Hung authored
-
Graham King authored
-
- 21 Apr, 2025 1 commit
-
-
Abrar Shivani authored
-
- 01 Apr, 2025 1 commit
-
-
Ryan Olson authored
-
- 14 Mar, 2025 1 commit
-
-
Ryan McCormick authored
-
- 08 Mar, 2025 1 commit
-
-
Neelay Shah authored
Co-authored-by:Biswa Panda <biswa.panda@gmail.com>
-
- 05 Mar, 2025 1 commit
-
-
Neelay Shah authored
Co-authored-by:Graham King <grahamk@nvidia.com>
-
- 25 Feb, 2025 1 commit
-
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 22 Feb, 2025 1 commit
-
-
Ryan Olson authored
Enables `#[tokio::test]` via `Runtime::from_current()` This uses the current handle as both the primary and secondary.
-
- 21 Feb, 2025 1 commit
-
-
Graham King authored
Add support in tio for distributed components and discovery. Node 1: ``` tio in=http out=tdr://ns/backend/mistralrs ``` Node 2: ``` tio in=tdr://ns/backend/mistralrs out=mistralrs ~/llm_models/Llama-3.2-3B-Instruct ``` This will use etcd to auto-discover the model and NATS to talk to it. You can run multiple workers on the same endpoint and it will pick one at random each time. The `ns/backend/mistralrs` are purely symbolic, pick anything as long as it has three parts, and it matches the other node.
-
- 18 Feb, 2025 1 commit
-
-
Ryan Olson authored
Co-authored-by:Ryan McCormick <rmccormick@nvidia.com>
-
- 13 Feb, 2025 1 commit
-
-
Ryan Olson authored
-