"vllm/vscode:/vscode.git/clone" did not exist on "48eb976dd21d6b94d75995934619af8bf7de8dd9"
- 22 Apr, 2026 5 commits
-
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
Jie Hao authored
-
Hongkuan Zhou authored
Signed-off-by:hongkuanz <hongkuanz@nvidia.com>
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
- 21 Apr, 2026 29 commits
-
-
Richard Huo authored
fix: Delete SQL tool call test for now since small models could occasionally generate malformed arguments with guided decoding (#8474)
-
jthomson04 authored
Signed-off-by:
jthomson04 <jwillthomson19@gmail.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
MatejKosec authored
Signed-off-by:Matej Kosec <mkosec@nvidia.com>
-
Biswa Panda authored
-
Hongkuan Zhou authored
Signed-off-by:
Hongkuan Zhou <hongkuanz@nvidia.com> Signed-off-by:
hongkuanz <hongkuanz@nvidia.com>
-
Ziqi Fan authored
Signed-off-by:Ziqi Fan <ziqif@nvidia.com>
-
Hongkuan Zhou authored
Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Dmitry Tokarev authored
test: stabilize nightly — skip engine-init failures, convert xfails to skips, fix http URL validation regression (#8443) Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Nate Mailhot authored
Signed-off-by:
Nate Mailhot <nmailhot@nvidia.com> Co-authored-by:
Dmitry Tokarev <dtokarev@nvidia.com>
-
Tushar Sharma authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Tushar Sharma authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Tushar Sharma authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Ran Rubin authored
Signed-off-by:rrubin <rrubin@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Hongkuan Zhou authored
fix(planner): use dynamo-planner image for profiler job and planner pods [DYN-2733][DYN-2746] (#8407) Signed-off-by:
hongkuanz <hongkuanz@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Indrajit Bhosale authored
fix: replace torch.load with safetensors and enable Rust frontend media decoding for TRT-LLM multimodal (#8295) Signed-off-by:
Indrajit Bhosale <iamindrajitb@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Dmitry Tokarev authored
Signed-off-by:Dmitry Tokarev <dtokarev@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Dmitry Tokarev authored
test(fault_tolerance): skip flaky sglang migration NATS combo, drop stale graceful_shutdown xfail (#8404) Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Dmitry Tokarev authored
Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Ran Rubin authored
Signed-off-by:rrubin <rrubin@nvidia.com>
-
Krishnan Prashanth authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-
Dr. Stefan Schimanski authored
Signed-off-by:Dr. Stefan Schimanski <sschimanski@nvidia.com>
-
Neelay Shah authored
Co-authored-by:Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
Schwinn Saereesitthipitak authored
Signed-off-by:Schwinn Saereesitthipitak <schwinns@nvidia.com>
-
zhongdaor-nv authored
Signed-off-by:zhongdaor <zhongdaor@nvidia.com>
-
Yan Ru Pei authored
Signed-off-by:PeaBrane <yanrpei@gmail.com>
-
Yuewei Na authored
Signed-off-by:
Yuewei Na <nv-yna@users.noreply.github.com> Co-authored-by:
Yuewei Na <nv-yna@users.noreply.github.com>
-
Dmitry Tokarev authored
Signed-off-by:
Dmitry Tokarev <dtokarev@nvidia.com> Co-authored-by:
Claude Opus 4.7 (1M context) <noreply@anthropic.com>
-
- 20 Apr, 2026 6 commits
-
-
Ayush Agarwal authored
Signed-off-by:ayushag <ayushag@nvidia.com>
-
Keiven C authored
Signed-off-by:Keiven Chang <keivenchang@users.noreply.github.com>
-
Karen Chung authored
-
Richard Huo authored
fix: update the tool calling functionalities for sglang frontend processor to match with the latest sglang implementation (#8269)
-
Dr. Stefan Schimanski authored
Signed-off-by:Dr. Stefan Schimanski <sschimanski@nvidia.com>
-
Krishnan Prashanth authored
Signed-off-by:Krishnan Prashanth <kprashanth@nvidia.com>
-