- 19 Sep, 2025 6 commits
-
-
nachiketb-nvidia authored
Signed-off-by:nachiketb <nachiketb@nvidia.com>
-
Yan Ru Pei authored
feat: allow router to not track active blocks (prefill), and to not track cached blocks (decode) (#3135) Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Olga Andreeva authored
feat: KVBM connector : enabling vectorized copy from pinned memory to device memory and vice versa (#2989) Signed-off-by:
Olga Andreeva <oandreeva@nvidia.com> Signed-off-by:
oandreeva-nv <oandreeva-nv@nvidia.com> Co-authored-by:
Ziqi Fan <ziqif@nvidia.com> Co-authored-by:
oandreeva-nv <oandreeva-nv@nvidia.com>
-
KrishnanPrash authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
Jacky authored
Signed-off-by:Jacky <18255193+kthui@users.noreply.github.com>
-
- 18 Sep, 2025 5 commits
-
-
zhongdaor-nv authored
feat: enhance GPT OSS frontend with improved harmony tool calling parser and reasoning parser (#2999) Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
Elyas Mehtabuddin authored
Signed-off-by:Elyas Mehtabuddin <emehtabuddin@nvidia.com>
-
Harrison Saturley-Hall authored
Signed-off-by:Harrison Saturley-Hall <hsaturleyhal@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
- 17 Sep, 2025 10 commits
-
-
Richard Huo authored
fix: Update the KVBM <> TRT-LLM integration interface to match the latest TRT-LLM connector API (#2979) Signed-off-by:richardhuo-nv <rihuo@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:
ayushag <ayushag@nvidia.com> Signed-off-by:
Graham King <grahamk@nvidia.com> Co-authored-by:
Graham King <grahamk@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Greg Clark authored
Signed-off-by:Greg Clark <grclark@nvidia.com>
-
Chi McIsaac authored
Signed-off-by:Chi McIsaac <chixie.mcisaac@gmail.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Tzu-Ling Kan authored
Signed-off-by:tzulingk@nvidia.com <tzulingk@nvidia.com>
-
Michael Feil authored
Signed-off-by:michaelfeil <me@michaelfeil.eu>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 16 Sep, 2025 10 commits
-
-
Michael Feil authored
Signed-off-by:michaelfeil <me@michaelfeil.eu>
-
Biswa Panda authored
Signed-off-by:Biswa Panda <biswa.panda@gmail.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Keiven C authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
ryan-lempka authored
Signed-off-by:Ryan Lempka <rlempka@nvidia.com>
-
Ryan Olson authored
Signed-off-by:Ryan Olson <rolson@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
- 15 Sep, 2025 4 commits
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Elyas Mehtabuddin authored
Signed-off-by:
ayushag <ayushag@nvidia.com> Signed-off-by:
Biswa Panda <biswa.panda@gmail.com> Co-authored-by:
ayushag <ayushag@nvidia.com> Co-authored-by:
Biswa Panda <biswa.panda@gmail.com>
-
Ziqi Fan authored
Signed-off-by:Ziqi Fan <ziqif@nvidia.com>
-
Ziqi Fan authored
Signed-off-by:Ziqi Fan <ziqif@nvidia.com>
-
- 11 Sep, 2025 1 commit
-
-
Alec authored
Signed-off-by:alec-flowers <aflowers@nvidia.com>
-
- 10 Sep, 2025 4 commits
-
-
blarson-b10 authored
Signed-off-by:Brian Larson <brian.larson@baseten.co>
-
Neelay Shah authored
Signed-off-by:nnshah1 <neelays@nvidia.com>
-
Jacky authored
Signed-off-by:Jacky <18255193+kthui@users.noreply.github.com>
-
Michael Feil authored
Signed-off-by:michaelfeil <me@michaelfeil.eu>
-