- 04 Sep, 2025 1 commit
-
-
zhongdaor-nv authored
Co-authored-by:Zhongdao Ren <zhongdaor@ipp1-3309.ipp1a1.colossus.nvidia.com>
-
- 03 Sep, 2025 1 commit
-
-
Olga Andreeva authored
refactor: Split ModelType to ModelInput for request and response type; ModelType for the supported workloads (#2714) Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com> Co-authored-by:
Guan Luo <gluo@nvidia.com> Co-authored-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com>
-
- 02 Sep, 2025 3 commits
-
-
Ayush Agarwal authored
Signed-off-by:Ayush Agarwal <ayushag@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:Ayush Agarwal <ayushag@nvidia.com>
-
Tzu-Ling Kan authored
Signed-off-by:tzulingk@nvidia.com <tzulingk@nvidia.com>
-
- 30 Aug, 2025 1 commit
-
-
Bhuvan Agrawal authored
Signed-off-by:
Bhuvan Agrawal <11240550+bhuvan002@users.noreply.github.com> Co-authored-by:
coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
-
- 29 Aug, 2025 3 commits
-
-
nachiketb-nvidia authored
Signed-off-by:nachiketb <nachiketb@nvidia.com>
-
Indrajit Bhosale authored
Signed-off-by:Indrajit Bhosale <iamindrajitb@gmail.com>
-
Alec authored
Signed-off-by:alec-flowers <aflowers@nvidia.com>
-
- 27 Aug, 2025 1 commit
-
-
Tzu-Ling Kan authored
-
- 26 Aug, 2025 1 commit
-
-
nachiketb-nvidia authored
-
- 25 Aug, 2025 1 commit
-
-
Ryan McCormick authored
-
- 22 Aug, 2025 2 commits
-
-
Richard Huo authored
fix: [TRTLLM+ LLAMA4 + Eagle 3] Remove the ‘two-models config’ and set the ‘one-model’ solution as the default (#2661)
-
Tanmay Verma authored
-
- 21 Aug, 2025 1 commit
-
-
Biswa Panda authored
-
- 20 Aug, 2025 1 commit
-
-
Ryan McCormick authored
-
- 19 Aug, 2025 2 commits
-
-
Tanmay Verma authored
-
Anish authored
-
- 18 Aug, 2025 3 commits
-
-
mohammedabdulwahhab authored
-
julienmancuso authored
-
Ryan McCormick authored
docs: Bring back some missed release/0.4.0 doc changes, fix broken links, add lychee link checker github action (#2482)
-
- 16 Aug, 2025 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:Yan Ru Pei <yanrpei@gmail.com>
-
- 15 Aug, 2025 3 commits
-
-
julienmancuso authored
-
Graham King authored
-
Tanmay Verma authored
-
- 14 Aug, 2025 4 commits
-
-
mohammedabdulwahhab authored
Signed-off-by:mohammedabdulwahhab <furkhan324@berkeley.edu>
-
Jorge António authored
Co-authored-by:Yan Ru Pei <yanrpei@gmail.com>
-
Biswa Panda authored
-
Anish authored
Signed-off-by:Anish <80174047+athreesh@users.noreply.github.com>
-
- 13 Aug, 2025 2 commits
-
-
Tanmay Verma authored
-
Ryan McCormick authored
-
- 12 Aug, 2025 1 commit
-
-
Tanmay Verma authored
-
- 11 Aug, 2025 1 commit
-
-
Neal Vaidya authored
-
- 07 Aug, 2025 1 commit
-
-
Richard Huo authored
feat: DIS-323 [trtllm backend publisher] only publish kv event with the biggest window size to support kv routing with variable sliding window attention (#2241)
-
- 06 Aug, 2025 1 commit
-
-
Indrajit Bhosale authored
-
- 05 Aug, 2025 5 commits
-
-
Neal Vaidya authored
-
Neal Vaidya authored
-
Richard Huo authored
-
Neal Vaidya authored
Signed-off-by:
jthomson04 <jwillthomson19@gmail.com> Co-authored-by:
jthomson04 <jwillthomson19@gmail.com> Co-authored-by:
John Thomson (DLAlgo) <jothomson@nvidia.com> Co-authored-by:
Neelay Shah <neelays@nvidia.com>
-
Ryan McCormick authored
-