- 09 Dec, 2025 1 commit
-
-
Vladislav Nosivskoy authored
Signed-off-by:Vladislav Nosivskoy <vladnosiv@gmail.com>
-
- 04 Dec, 2025 1 commit
-
-
Vladislav Nosivskoy authored
Signed-off-by:Vladislav Nosivskoy <vladnosiv@gmail.com>
-
- 03 Dec, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 02 Dec, 2025 3 commits
-
-
GuanLuo authored
fix: ModelDeploymentCard obtains full set of eos_token_ids by taking union from different files (#3192) Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
William Zhang authored
* Why? We would like the ability to configure different parser types. Prior to this commit, only the JSON parser could be configured. * What? This commit refactors the tool parser config in the following ways: - the `format` and `json` fields of `ToolParserConfig` are merged into a single `config` field that is a "discriminated union" type. Each parser type can declare its own configuration options. - a `XmlParserConfig` is defined with a default factory method that corresponds to the Qwen3 coder configuration. - affected calls and tests are adjusted.
-
Biswa Panda authored
-
- 25 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 21 Nov, 2025 2 commits
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
- 20 Nov, 2025 1 commit
-
-
Tanmay Verma authored
-
- 19 Nov, 2025 1 commit
-
-
Zhongxuan (Daniel) Wang authored
Signed-off-by:Zhongxuan Wang <daniewang@nvidia.com>
-
- 18 Nov, 2025 2 commits
-
-
tangcy98 authored
Signed-off-by:
zhangzhang <tangchenyu@xiaohongshu.com> Co-authored-by:
zhangzhang <tangchenyu@xiaohongshu.com> Co-authored-by:
Ayush Agarwal <ayushag@nvidia.com>
-
Vladislav Nosivskoy authored
Signed-off-by:Vladislav Nosivskoy <vladnosiv@gmail.com>
-
- 17 Nov, 2025 2 commits
-
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 13 Nov, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 11 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 10 Nov, 2025 2 commits
-
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 08 Nov, 2025 3 commits
-
-
mohammedabdulwahhab authored
Signed-off-by:mohammedabdulwahhab <furkhan324@berkeley.edu>
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
Ryan McCormick authored
feat: Add support for skip_special_tokens parameter in v1/completions and v1/chat/completions endpoints (#4175)
-
- 07 Nov, 2025 2 commits
-
-
zhongdaor-nv authored
feat: enable HTTP completion endpoint to accept arrays of prompts and generate multiple completions per prompt (#3953) Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 06 Nov, 2025 1 commit
-
-
KrishnanPrash authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
- 05 Nov, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
- 04 Nov, 2025 1 commit
-
-
milesial authored
Signed-off-by:Alexandre Milesi <milesial@users.noreply.github.com>
-
- 03 Nov, 2025 1 commit
-
-
KrishnanPrash authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
- 31 Oct, 2025 1 commit
-
-
milesial authored
Signed-off-by:Alexandre Milesi <milesial@users.noreply.github.com>
-
- 30 Oct, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com>
-
- 29 Oct, 2025 1 commit
-
-
Ayush Agarwal authored
Signed-off-by:
ayushag <ayushag@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 28 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 27 Oct, 2025 2 commits
-
-
milesial authored
Signed-off-by:Alexandre Milesi <30204471+milesial@users.noreply.github.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 24 Oct, 2025 2 commits
-
-
KrishnanPrash authored
Signed-off-by:
Krishnan Prashanth <kprashanth@nvidia.com> Signed-off-by:
KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
ryan-lempka authored
Signed-off-by:Ryan Lempka <rlempka@nvidia.com>
-
- 23 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 22 Oct, 2025 1 commit
-
-
GuanLuo authored
fix: enhance gRPC frontend to return output in raw content field for Triton client compatibility (#3600) Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
- 21 Oct, 2025 1 commit
-
-
Elyas Mehtabuddin authored
-
- 17 Oct, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-