- 08 Nov, 2025 2 commits
-
-
Ryan McCormick authored
feat: Add support for skip_special_tokens parameter in v1/completions and v1/chat/completions endpoints (#4175)
-
milesial authored
Signed-off-by:Alexandre Milesi <milesial@users.noreply.github.com>
-
- 07 Nov, 2025 4 commits
-
-
zhongdaor-nv authored
feat: enable HTTP completion endpoint to accept arrays of prompts and generate multiple completions per prompt (#3953) Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
ryan-lempka authored
-
Ryan McCormick authored
-
- 06 Nov, 2025 4 commits
-
-
Jacky authored
Signed-off-by:Jacky <18255193+kthui@users.noreply.github.com>
-
Nathan Barry authored
Co-authored-by:Ryan McCormick <rmccormick@nvidia.com>
-
KrishnanPrash authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
Jacky authored
Signed-off-by:
Jacky <18255193+kthui@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 05 Nov, 2025 2 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
GuanLuo authored
Signed-off-by:Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
- 04 Nov, 2025 1 commit
-
-
milesial authored
Signed-off-by:Alexandre Milesi <milesial@users.noreply.github.com>
-
- 03 Nov, 2025 2 commits
-
-
KrishnanPrash authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 01 Nov, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 31 Oct, 2025 4 commits
-
-
Richard Huo authored
Signed-off-by:
Anant Sharma <anants@nvidia.com> Co-authored-by:
Anant Sharma <anants@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
milesial authored
Signed-off-by:Alexandre Milesi <milesial@users.noreply.github.com>
-
Biswa Panda authored
-
- 30 Oct, 2025 4 commits
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
GuanLuo authored
Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com>
-
Tushar Sharma authored
Signed-off-by:Tushar Sharma <tusharma@nvidia.com>
-
Kris Hung authored
Signed-off-by:krishung5 <krish@nvidia.com>
-
- 29 Oct, 2025 2 commits
-
-
Ayush Agarwal authored
Signed-off-by:
ayushag <ayushag@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 28 Oct, 2025 3 commits
-
-
jthomson04 authored
Signed-off-by:jthomson04 <jwillthomson19@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 27 Oct, 2025 3 commits
-
-
milesial authored
Signed-off-by:Alexandre Milesi <30204471+milesial@users.noreply.github.com>
-
zhongdaor-nv authored
Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 25 Oct, 2025 1 commit
-
-
ryan-lempka authored
-
- 24 Oct, 2025 7 commits
-
-
ryan-lempka authored
Signed-off-by:
Ryan Lempka <rlempka@nvidia.com> Signed-off-by:
ryan-lempka <rlempka@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
Keiven C authored
refactor: redesign the metrics API from Trait to composition to make the code cleaner and easier to understand (#3687) Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
KrishnanPrash authored
Signed-off-by:
Krishnan Prashanth <kprashanth@nvidia.com> Signed-off-by:
KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
ryan-lempka authored
Signed-off-by:Ryan Lempka <rlempka@nvidia.com>
-
zhongdaor-nv authored
Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-