- 23 Apr, 2026 2 commits
-
-
Tushar Sharma authored
Signed-off-by:Tushar Sharma <tusharma@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 22 Apr, 2026 11 commits
-
-
Jacky authored
refactor: Removes obsolete TRT-LLM request-abort disable wiring and re-enables cancellation test coverage (#8562) Signed-off-by:Jacky <18255193+kthui@users.noreply.github.com>
-
Biswa Panda authored
-
Dmitry Tokarev authored
Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Richard Huo authored
fix: remove the tool calling flaky test as the small model is not stable in generating Json format (#8523)
-
GuanLuo authored
Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com>
-
Jacky authored
Signed-off-by:Jacky <18255193+kthui@users.noreply.github.com>
-
ishandhanani authored
Signed-off-by:
Anant Sharma <anants@nvidia.com> Co-authored-by:
Anant Sharma <anants@nvidia.com>
-
Tzu-Ling Kan authored
Signed-off-by:Tzu-Ling <tzulingk@nvidia.com>
-
Qi Wang authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Krishnan Prashanth authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
Kris Hung authored
-
- 21 Apr, 2026 11 commits
-
-
Richard Huo authored
fix: Delete SQL tool call test for now since small models could occasionally generate malformed arguments with guided decoding (#8474)
-
Dmitry Tokarev authored
test: stabilize nightly — skip engine-init failures, convert xfails to skips, fix http URL validation regression (#8443) Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Tushar Sharma authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Tushar Sharma authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Ran Rubin authored
Signed-off-by:rrubin <rrubin@nvidia.com>
-
Indrajit Bhosale authored
fix: replace torch.load with safetensors and enable Rust frontend media decoding for TRT-LLM multimodal (#8295) Signed-off-by:
Indrajit Bhosale <iamindrajitb@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Dmitry Tokarev authored
Signed-off-by:Dmitry Tokarev <dtokarev@nvidia.com>
-
Dmitry Tokarev authored
test(fault_tolerance): skip flaky sglang migration NATS combo, drop stale graceful_shutdown xfail (#8404) Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Dmitry Tokarev authored
Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Krishnan Prashanth authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
zhongdaor-nv authored
Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
- 20 Apr, 2026 3 commits
-
-
Richard Huo authored
fix: update the tool calling functionalities for sglang frontend processor to match with the latest sglang implementation (#8269)
-
GuanLuo authored
Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com> Co-authored-by:
devin-ai-integration[bot] <158243242+devin-ai-integration[bot]@users.noreply.github.com>
-
Tzu-Ling Kan authored
Signed-off-by:Tzu-Ling <tzulingk@nvidia.com>
-
- 17 Apr, 2026 2 commits
-
-
Qi Wang authored
Signed-off-by:
furionw <qiwa@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Schwinn Saereesitthipitak authored
Signed-off-by:Schwinn Saereesitthipitak <schwinns@nvidia.com>
-
- 16 Apr, 2026 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 15 Apr, 2026 4 commits
-
-
Schwinn Saereesitthipitak authored
Signed-off-by:
Schwinn Saereesitthipitak <schwinns@nvidia.com> Co-authored-by:
Dan Gil <dagil@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
hhzhang16 authored
Co-authored-by:
Schwinn Saereesitthipitak <schwinns@nvidia.com> Co-authored-by:
Dmitry Tokarev <dtokarev@nvidia.com>
-
Dmitry Tokarev authored
Signed-off-by:Dmitry Tokarev <dtokarev@nvidia.com>
-
- 14 Apr, 2026 6 commits
-
-
Tanmay Verma authored
Signed-off-by:Tanmay Verma <tanmayv@nvidia.com>
-
Biswa Panda authored
-
Biswa Panda authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Dmitry Tokarev authored
Signed-off-by:Dmitry Tokarev <dtokarev@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Ashna Mehrotra authored
Signed-off-by:ashnamehrotra <ashnamehrotra@gmail.com>
-