- 10 Mar, 2026 1 commit
-
-
zhongdaor-nv authored
fix(perf): Skip duplicate image downloads and unnecessary image processing in MM Router (vLLM) (#7080) Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
- 09 Mar, 2026 3 commits
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
ishandhanani authored
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 06 Mar, 2026 8 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Schwinn Saereesitthipitak authored
-
Biswa Panda authored
-
MatejKosec authored
Signed-off-by:
Marko Kosec <mkosec@nvidia.com> Signed-off-by:
Matej Kosec <mkosec@nvidia.com> Signed-off-by:
Vasilis Vagias <vvagias@nvidia.com> Co-authored-by:
vvagias <vasilis.n.vagias@gmail.com> Co-authored-by:
ishandhanani <82981111+ishandhanani@users.noreply.github.com>
-
Graham King authored
Signed-off-by:
Graham King <grahamk@nvidia.com> Signed-off-by:
PeaBrane <yanrpei@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Michal Guzek authored
fix: TRT-LLM multimodal preprocessor - remove default_multimodal_input_loader from the embedding paths (#6924) Signed-off-by:Michal Guzek <mguzek@nvidia.com>
-
Yuewei Na authored
Signed-off-by:
Yuewei Na <nv-yna@users.noreply.github.com> Co-authored-by:
Yuewei Na <nv-yna@users.noreply.github.com>
-
- 05 Mar, 2026 2 commits
-
-
Ameen Patel authored
Signed-off-by:
AmeenP <ameenp360@gmail.com> Co-authored-by:
Biswa Panda <biswa.panda@gmail.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 04 Mar, 2026 2 commits
-
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 03 Mar, 2026 6 commits
-
-
GuanLuo authored
fix: properly setup and register vLLM worker for external / hybrid load balancing. Update launch script (#6695) Signed-off-by:Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
Ziqi Fan authored
Signed-off-by:Ziqi Fan <ziqif@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Ryan Olson authored
Signed-off-by:Ryan Olson <rolson@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 02 Mar, 2026 4 commits
-
-
Kris Hung authored
-
MatejKosec authored
feat: Full Anthropic Messages API cache_control support (top-level, per-block, system block arrays) (#6629) Signed-off-by:Matej Kosec <mkosec@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Neal Vaidya authored
Signed-off-by:
Neal Vaidya <nealv@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 28 Feb, 2026 2 commits
-
-
Michael Feil authored
Signed-off-by:michaelfeil <63565275+michaelfeil@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 27 Feb, 2026 3 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Nate Mailhot authored
-
ishandhanani authored
Co-authored-by:Claude <noreply@anthropic.com>
-
- 26 Feb, 2026 4 commits
-
-
Biswa Panda authored
-
Yan Ru Pei authored
Signed-off-by:
PeaBrane <yanrpei@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
William Arnold authored
Signed-off-by:William Arnold <7565007+Aphoh@users.noreply.github.com>
-
Neal Vaidya authored
Signed-off-by:Neal Vaidya <nealv@nvidia.com>
-
- 25 Feb, 2026 5 commits
-
-
Nikita authored
Signed-off-by:Nikita Sukharev <kaonael@gmail.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Nikita authored
Signed-off-by:Nikita Sukharev <kaonael@gmail.com>
-
Keiven C authored
Signed-off-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Cursor <cursoragent@cursor.com>
-
Yongming Ding authored
Signed-off-by:Yongming Ding <yongmingd@nvidia.com>
-