- 13 Aug, 2025 4 commits
-
-
Hongkuan Zhou authored
-
Graham King authored
-
ishandhanani authored
-
Hongkuan Zhou authored
Signed-off-by:
Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
- 12 Aug, 2025 7 commits
-
-
GuanLuo authored
Co-authored-by:krishung5 <krish@nvidia.com>
-
Tanmay Verma authored
-
Ryan McCormick authored
-
KrishnanPrash authored
feat: Add frontend support for `min_tokens` and `ignore_eos` (outside of `nvext`) and Structured Output / Guided Decoding (#2380) Signed-off-by:
KrishnanPrash <140860868+KrishnanPrash@users.noreply.github.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com> Co-authored-by:
Ayush Agarwal <ayushag@nvidia.com>
-
Tushar Sharma authored
-
Hongkuan Zhou authored
Signed-off-by:
Hongkuan Zhou <tedzhouhk@gmail.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
Kris Hung authored
-
- 11 Aug, 2025 10 commits
-
-
hhzhang16 authored
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Hongkuan Zhou authored
-
julienmancuso authored
-
ishandhanani authored
-
Tushar Sharma authored
-
Graham King authored
-
Ryan Olson authored
-
Neal Vaidya authored
-
Faradawn Yang authored
-
- 08 Aug, 2025 4 commits
-
-
J Wyman authored
-
mohammedabdulwahhab authored
-
Tushar Sharma authored
-
Anant Sharma authored
-
- 07 Aug, 2025 12 commits
-
-
Yan Ru Pei authored
Signed-off-by:Yan Ru Pei <yanrpei@gmail.com>
-
Yan Ru Pei authored
-
Tushar Sharma authored
Signed-off-by:
Tushar Sharma <tusharma@nvidia.com> Co-authored-by:
Anant Sharma <anants@nvidia.com>
-
Ayush Agarwal authored
Co-authored-by:Ryan McCormick <rmccormick@nvidia.com>
-
Graham King authored
-
Neelay Shah authored
Signed-off-by:
Neelay Shah <neelays@nvidia.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Co-authored-by:
Olga Andreeva <124622579+oandreeva-nv@users.noreply.github.com>
-
Graham King authored
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Yingge He authored
-
Richard Huo authored
feat: DIS-323 [trtllm backend publisher] only publish kv event with the biggest window size to support kv routing with variable sliding window attention (#2241)
-
Ryan McCormick authored
fix: Reduce disk space usage in pre-merge-rust action (removes unnecessary sccache artifacts) (#2337)
-
ZichengMa authored
Signed-off-by:
ZichengMa <zichengma1225@gmail.com> Co-authored-by:
Ziqi Fan <ziqif@nvidia.com>
-
- 06 Aug, 2025 3 commits
-
-
Graham King authored
-
Graham King authored
-
Dan Aloni authored
Signed-off-by:Dan Aloni <dan.aloni@vastdata.com>
-