- 02 Apr, 2026 2 commits
-
-
Keiven C authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Yi Yao authored
Signed-off-by:
Yi Yao <yi.a.yao@intel.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 31 Mar, 2026 1 commit
-
-
Zhongdongming Dai authored
Signed-off-by:Zhongdongming Dai <zhongdongmin@nvidia.com>
-
- 30 Mar, 2026 1 commit
-
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
- 27 Mar, 2026 1 commit
-
-
Yao Qing authored
Signed-off-by:Yao, Qing <qing.yao@intel.com>
-
- 26 Mar, 2026 2 commits
-
-
ZhengHongming888 authored
Signed-off-by:
Hongming Zheng <hongming.zheng@intel.com> Co-authored-by:
Zhan Xue <zhan.xue@intel.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
Sihan Chen authored
Signed-off-by:
Spycsh <sihan.chen@intel.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 24 Mar, 2026 1 commit
-
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
- 20 Mar, 2026 1 commit
-
-
GuanLuo authored
Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com>
-
- 18 Mar, 2026 1 commit
-
-
Keiven C authored
feat: GPU VRAM profiler via memory fraction injection + profiled test markers (part 2 - vLLM only) (#6719) Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 17 Mar, 2026 1 commit
-
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
- 16 Mar, 2026 1 commit
-
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
- 15 Mar, 2026 1 commit
-
-
Dmitry Tokarev authored
Signed-off-by:Dmitry Tokarev <dtokarev@nvidia.com>
-
- 12 Mar, 2026 1 commit
-
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
- 11 Mar, 2026 2 commits
-
-
daiyaanarfeen authored
Signed-off-by:
Daiyaan <darfeen@nvidia.com> Signed-off-by:
athreesh <anish.maddipoti@utexas.edu> Signed-off-by:
Anish <80174047+athreesh@users.noreply.github.com> Co-authored-by:
Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by:
athreesh <anish.maddipoti@utexas.edu> Co-authored-by:
Anish <80174047+athreesh@users.noreply.github.com>
-
Keiven C authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 10 Mar, 2026 4 commits
-
-
dagil-nvidia authored
Signed-off-by:Dan Gil <dagil@nvidia.com>
-
Kris Hung authored
-
zhongdaor-nv authored
fix(perf): Skip duplicate image downloads and unnecessary image processing in MM Router (vLLM) (#7080) Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Keiven C authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
- 06 Mar, 2026 2 commits
-
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
- 05 Mar, 2026 2 commits
-
-
Tushar Sharma authored
Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by:
Dmitry Tokarev <dtokarev@nvidia.com>
-
Qi Wang authored
-
- 04 Mar, 2026 1 commit
-
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
- 03 Mar, 2026 5 commits
-
-
GuanLuo authored
fix: properly setup and register vLLM worker for external / hybrid load balancing. Update launch script (#6695) Signed-off-by:Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
Qi Wang authored
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Yue Yu authored
Signed-off-by:
zhuofan1123 <zhuofanl@nvidia.com> Co-authored-by:
zhuofan1123 <zhuofanl@nvidia.com>
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by:
ishandhanani <ishandhanani@gmail.com>
-
- 02 Mar, 2026 5 commits
-
-
Biswa Panda authored
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
GuanLuo authored
Signed-off-by:Guan Luo <41310872+GuanLuo@users.noreply.github.com>
-
Neal Vaidya authored
Signed-off-by:
Neal Vaidya <nealv@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 27 Feb, 2026 1 commit
-
-
KrishnanPrash authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
- 26 Feb, 2026 1 commit
-
-
zhongdaor-nv authored
Signed-off-by:
zhongdaor <zhongdaor@nvidia.com> Signed-off-by:
zhongdaor-nv <zhongdaor@nvidia.com> Co-authored-by:
Ryan McCormick <rmccormick@nvidia.com>
-
- 25 Feb, 2026 3 commits
-
-
daiyaanarfeen authored
Signed-off-by:
Daiyaan <darfeen@nvidia.com> Co-authored-by:
Claude Sonnet 4.6 <noreply@anthropic.com>
-
Qi Wang authored
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-