- 22 Jan, 2026 17 commits
-
-
Julien Mancuso authored
Signed-off-by:Julien Mancuso <jmancuso@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
dagil-nvidia authored
Signed-off-by:Dan Gil <dagil@nvidia.com>
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Signed-off-by:
Alec <35311602+alec-flowers@users.noreply.github.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
atchernych authored
Signed-off-by:Anna Tchernych <atchernych@nvidia.com>
-
Keiven C authored
Co-authored-by:
Keiven Chang <keivenchang@users.noreply.github.com> Co-authored-by:
Alec <35311602+alec-flowers@users.noreply.github.com>
-
dagil-nvidia authored
Signed-off-by:Dan Gil <dagil@nvidia.com>
-
Pavithra Vijayakrishnan authored
Signed-off-by:pvijayakrish <pvijayakrish@nvidia.com>
-
Pavithra Vijayakrishnan authored
Signed-off-by:Pavithra Vijayakrishnan <160681768+pvijayakrish@users.noreply.github.com>
-
dagil-nvidia authored
Signed-off-by:Dan Gil <dagil@nvidia.com>
-
Pavithra Vijayakrishnan authored
Signed-off-by:pvijayakrish <pvijayakrish@nvidia.com>
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
Alec authored
Signed-off-by:alec-flowers <aflowers@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
- 21 Jan, 2026 11 commits
-
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
Olga Andreeva authored
-
Shang Wang authored
chore: [vLLM] Update the import paths from `vllm.entrypoints.openai` to align with vLLM latest `main` (#5447) Signed-off-by:
Shang Wang <shangw@nvidia.com> Signed-off-by:
Shang Wang <samshang.wang@mail.utoronto.ca> Signed-off-by:
Qidong Su <qidongs@nvidia.com> Co-authored-by:
Qidong Su <qidongs@nvidia.com>
-
Biswa Panda authored
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
milesial authored
Signed-off-by:Alexandre Milesi <milesial@users.noreply.github.com>
-
Ryan Olson authored
Signed-off-by:Ryan Olson <rolson@nvidia.com>
-
Anant Sharma authored
Signed-off-by:Anant Sharma <anants@nvidia.com>
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
blarson-b10 authored
Signed-off-by:Brian Larson <brian.larson@baseten.co>
-
Janelle Cai authored
-
- 20 Jan, 2026 11 commits
-
-
ishandhanani authored
Co-authored-by:Claude Opus 4.5 <noreply@anthropic.com>
-
Schwinn Saereesitthipitak authored
-
ishandhanani authored
-
MatejKosec authored
This ensures that only new tokens are returned by sglang which avoids the overhead from creating copies of the entire token sequences per each iteration. These copies can become a bottleneck particularly for long sequence lengths and large concurrency counts. Signed-off-by:Matej Kosec <mkosec@nvidia.com>
-
Keiven C authored
Co-authored-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
jthomson04 authored
Signed-off-by:jthomson04 <jwillthomson19@gmail.com>
-
Schwinn Saereesitthipitak authored
Signed-off-by:
Schwinn Saereesitthipitak <schwinns@nvidia.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
Schwinn Saereesitthipitak authored
Signed-off-by:
Schwinn Saereesitthipitak <schwinns@nvidia.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
Michael Feil authored
Signed-off-by:
Michael Feil <63565275+michaelfeil@users.noreply.github.com> Co-authored-by:
Anant Sharma <anants@nvidia.com>
-
Graham King authored
Signed-off-by:Graham King <grahamk@nvidia.com>
-
Schwinn Saereesitthipitak authored
Signed-off-by:
Schwinn Saereesitthipitak <schwinns@nvidia.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
- 18 Jan, 2026 1 commit
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-