- 11 Nov, 2025 1 commit
-
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 07 Nov, 2025 2 commits
-
-
zhongdaor-nv authored
feat: enable HTTP completion endpoint to accept arrays of prompts and generate multiple completions per prompt (#3953) Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com>
-
- 30 Oct, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com>
-
- 27 Oct, 2025 1 commit
-
-
Richard Huo authored
-
- 24 Oct, 2025 1 commit
-
-
KrishnanPrash authored
Signed-off-by:
Krishnan Prashanth <kprashanth@nvidia.com> Signed-off-by:
KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 22 Oct, 2025 1 commit
-
-
GuanLuo authored
fix: enhance gRPC frontend to return output in raw content field for Triton client compatibility (#3600) Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
- 16 Oct, 2025 1 commit
-
-
zhongdaor-nv authored
Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
- 14 Oct, 2025 1 commit
-
-
zhongdaor-nv authored
Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
- 12 Oct, 2025 1 commit
-
-
Alec authored
Signed-off-by:alec-flowers <aflowers@nvidia.com>
-
- 07 Oct, 2025 1 commit
-
-
zhongdaor-nv authored
Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com>
-