- 18 Feb, 2026 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
- 14 Feb, 2026 1 commit
-
-
Ryan Olson authored
Signed-off-by:Ryan Olson <rolson@nvidia.com>
-
- 06 Feb, 2026 1 commit
-
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
- 28 Jan, 2026 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 27 Jan, 2026 1 commit
-
-
Pavithra Vijayakrishnan authored
Signed-off-by:pvijayakrish <pvijayakrish@nvidia.com>
-
- 19 Dec, 2025 1 commit
-
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
- 18 Dec, 2025 1 commit
-
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
- 21 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 19 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 11 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 10 Nov, 2025 2 commits
-
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 07 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 27 Oct, 2025 1 commit
-
-
Tushar Sharma authored
Signed-off-by:Tushar Sharma <tusharma@nvidia.com>
-
- 21 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 13 Oct, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 30 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 24 Sep, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 05 Sep, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 02 Sep, 2025 1 commit
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison King Saturley-Hall <hsaturleyhal@nvidia.com>
-
- 22 Aug, 2025 1 commit
-
-
Graham King authored
-
- 20 Aug, 2025 1 commit
-
-
Dmitry Tokarev authored
-
- 18 Aug, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 13 Aug, 2025 1 commit
-
-
Graham King authored
-
- 07 Aug, 2025 1 commit
-
-
Graham King authored
-
- 30 Jul, 2025 1 commit
-
-
Dmitry Tokarev authored
-
- 28 Jul, 2025 1 commit
-
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 22 Jul, 2025 1 commit
-
-
Keiven C authored
feat: add a hierarchical Prometheus MetricsRegistry trait for DistributedRuntime, Namespace, Components, and Endpoint (#2008) Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Ryan Olson <rolson@nvidia.com>
-
- 16 Jul, 2025 1 commit
-
-
Graham King authored
-
- 08 Jul, 2025 1 commit
-
-
ZichengMa authored
-
- 07 Jul, 2025 1 commit
-
-
Anant Sharma authored
-
- 03 Jul, 2025 1 commit
-
-
Graham King authored
-
- 13 Jun, 2025 1 commit
-
-
Anant Sharma authored
-
- 29 May, 2025 1 commit
-
-
Anant Sharma authored
-
- 09 May, 2025 2 commits
-
-
Harrison Saturley-Hall authored
-
wxsm authored
Allow both password or TLS auth, if none of these is provided fallback to no auth Closes #657
-
- 25 Apr, 2025 2 commits
-
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <454891+saturley-hall@users.noreply.github.com>
-
Graham King authored
This will allow an ingress-side pre-processor to see it without needing a model checkout. Currently pre-processing is done in the worker, which has access to the model deployment card ("MDC") files (`config.json`, `tokenizer.json` and `tokenizer_config.json`) locally. We want to move the pre-processor to the ingress side to support KV routing. That requires ingress side (i.e the HTTP server), on a different machine than the worker to be able to see those three files. To support that this PR makes the worker upload the contents of those files to the NATS object store, and publishes the MDC with those NATS urls to the key-value store. The key-value store has an interface so any store (nats, etcd, redis, etc) can be supported. Implementations for memory and NATS are provided. Fetching the MDC from the store, doing pre-processing ingress side, and publishing a card backed by a GGUF, are all for a later commit. Part of #743
-
- 09 Apr, 2025 1 commit
-
-
Anant Sharma authored
-
- 31 Mar, 2025 1 commit
-
-
Ryan Olson authored
-