- 14 Apr, 2026 1 commit
-
-
Mark McLoughlin authored
[Core][Metrics][BugFix] Replace num_cached_tokens/num_external_computed_tokens with PrefillStats (#37460) Related to `Counters can only be incremented by non-negative amounts` error with the `vllm:prompt_tokens_by_source_total` metric. Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
- 10 Apr, 2026 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 09 Apr, 2026 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
- 08 Apr, 2026 3 commits
-
-
triangleXIV authored
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by:
triangle14 <y1019026570@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Shengqi Chen authored
Signed-off-by:
Shengqi Chen <harry-chen@outlook.com> Co-authored-by:
Jason Li <jasonlizhengjian@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
haosdent authored
Signed-off-by:haosdent <haosdent@gmail.com>
-
- 30 Mar, 2026 1 commit
-
-
fangyuchu authored
Signed-off-by:
fangyuchu <fangyuchu@qq.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 27 Mar, 2026 1 commit
-
-
Bvicii authored
Signed-off-by:
Bvicii <yizhanhuang2002@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 25 Mar, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 24 Mar, 2026 1 commit
-
-
Sungjae Lee authored
Signed-off-by:
Sungjae Lee <33976427+llsj14@users.noreply.github.com> Signed-off-by:
Sungjae Lee <sung-jae.lee@navercorp.com> Signed-off-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 21 Mar, 2026 1 commit
-
-
Brandon Pelfrey authored
Signed-off-by:
Brandon Pelfrey <bpelfrey@nvidia.com> Signed-off-by:
Brandon Pelfrey <brandonpelfrey@gmail.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 20 Mar, 2026 2 commits
-
-
Itay Alroy authored
Signed-off-by:
Itay Alroy <ialroy@nvidia.com> Co-authored-by:
Ron Tourgeman <rtourgeman@nvidia.com>
-
Itay Alroy authored
Signed-off-by:Itay Alroy <ialroy@nvidia.com>
-
- 19 Mar, 2026 1 commit
-
-
Aaron Hao authored
Signed-off-by:hao-aaron <ahao@anyscale.com>
-
- 18 Mar, 2026 1 commit
-
-
Itay Alroy authored
Signed-off-by:Itay Alroy <ialroy@nvidia.com>
-
- 13 Mar, 2026 3 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Itay Alroy authored
Signed-off-by:
Itay Alroy <ialroy@nvidia.com> Co-authored-by:
Yongji Wu <wuyongji317@gmail.com> Co-authored-by:
Ron Tourgeman <rtourgeman@nvidia.com>
-
- 12 Mar, 2026 2 commits
-
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 11 Mar, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
fangyuchu authored
[Bugfix] Surface exceptions from non-blocking execute_model in UniProcExecutor to avoid DP deadlocks (#35194) Signed-off-by:fangyuchu <fangyuchu@qq.com>
-
- 10 Mar, 2026 5 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
SoluMilken authored
Signed-off-by:SoluMilken <ypiheyn.imm02g@g2.nctu.edu.tw>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Wentao Ye authored
[Perf] Compute maxsim in worker side, reducing redundant copies, 2.7% E2E throughput improvement (#36159) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 09 Mar, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 08 Mar, 2026 1 commit
-
-
Sage authored
-
- 07 Mar, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
- 06 Mar, 2026 3 commits
-
-
Nick Hill authored
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com>
-
Shiyan Deng authored
Signed-off-by:
Shiyan Deng <dsy842974287@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
- 05 Mar, 2026 1 commit
-
-
Jiayi Yan authored
Signed-off-by:
1195343015 <1195343015@qq.com> Signed-off-by:
Jiayi Yan <66017932+1195343015@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 04 Mar, 2026 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Hyunkyun Moon authored
Signed-off-by:HyunKyun Moon <mhg5303@gmail.com>
-