- 09 Apr, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 08 Apr, 2026 1 commit
-
-
triangleXIV authored
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by:
triangle14 <y1019026570@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 18 Mar, 2026 1 commit
-
-
Itay Alroy authored
Signed-off-by:Itay Alroy <ialroy@nvidia.com>
-
- 13 Mar, 2026 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 10 Mar, 2026 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
- 09 Mar, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Mar, 2026 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com>
-
- 03 Mar, 2026 1 commit
-
-
aykoppol authored
Signed-off-by:aykoppol <aykoppol+git@gmail.com>
-
- 28 Feb, 2026 1 commit
-
-
Itay Alroy authored
Signed-off-by:
Yongji Wu <wuyongji317@gmail.com> Signed-off-by:
Itay Alroy <ialroy@nvidia.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
Ron Tourgeman <rtourgeman@nvidia.com> Co-authored-by:
Yongji Wu <wuyongji317@gmail.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Ron Tourgeman <rtourgeman@nvidia.com>
-
- 13 Feb, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 07 Feb, 2026 1 commit
-
-
Aaron Hao authored
Signed-off-by:
ahao-anyscale <ahao@anyscale.com> Signed-off-by:
Aaron Hao <ahao@anyscale.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 05 Feb, 2026 1 commit
-
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 04 Feb, 2026 1 commit
-
-
zhanqiuhu authored
Add labeled Prometheus metrics to distinguish where prompt tokens come from in P/D disaggregated deployments. In P/D disaggregation, decode instances receive KV cache from prefill instances. Currently, decode reports inflated prompt throughput because it counts all prompt tokens as "computed", even though most were transferred. This PR adds labeled metrics so users can understand actual compute work vs transferred work: vllm:prompt_tokens_by_source_total{source="local_compute"} # Tokens prefilled locally vllm:prompt_tokens_by_source_total{source="external_kv_transfer"} # Tokens received via KV transfer vllm:prompt_tokens_by_source_total{source="local_cache_hit"} # Tokens from local prefix cache vllm:prompt_tokens_cached_total # Total cached (local + external, -1 when all Signed-off-by:Zhanqiu Hu <zh338@cornell.edu>
-
- 24 Jan, 2026 1 commit
-
-
Joshua Deng authored
Signed-off-by:
Joshua Deng <joshuakdeng@gmail.com> Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 12 Jan, 2026 1 commit
-
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Signed-off-by:
Hongxin Xu <70438206+xhx1022@users.noreply.github.com> Signed-off-by:
arlenxu <arlenxu@tencent.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
- 23 Dec, 2025 1 commit
-
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 10 Dec, 2025 1 commit
-
-
Will Eaton authored
Signed-off-by:
Will Eaton <weaton@redhat.com> Signed-off-by:
Will Eaton <me@wseaton.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
chaunceyjiang <chaunceyjiang@gmail.com>
-
- 25 Nov, 2025 1 commit
-
-
Avishek Goswami authored
Signed-off-by:GOavi101 <1704178@kiit.ac.in>
-
- 14 Nov, 2025 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 05 Nov, 2025 1 commit
-
-
Snehlata authored
Signed-off-by:atalhens <sneh.lata@nutanix.com>
-
- 12 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 19 Sep, 2025 1 commit
-
-
Andrew Sansom authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Andrew Sansom <qthequartermasterman@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 12 Sep, 2025 1 commit
-
-
RichardoMu authored
Signed-off-by:
Mu Huai <tianbowen.tbw@antgroup.com> Signed-off-by:
Ye Zhang <zhysishu@gmail.com> Signed-off-by:
RichardoMu <44485717+RichardoMrMu@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com> Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
Mu Huai <tianbowen.tbw@antgroup.com> Co-authored-by:
Ye Zhang <zhysishu@gmail.com> Co-authored-by:
Benjamin Bartels <benjamin@bartels.dev> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
瑜琮 <ly186375@antfin.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
- 29 Aug, 2025 1 commit
-
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
- 16 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 13 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 30 Jul, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 19 Jul, 2025 1 commit
-
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
- 23 Jun, 2025 1 commit
-
-
amit authored
Signed-off-by:
amit <amit.man@gmail.com> Co-authored-by:
Roger Wang <Rogerw0108@gmail.com>
-
- 19 Jun, 2025 1 commit
-
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
- 04 Jun, 2025 1 commit
-
-
jmswen authored
Signed-off-by:Jon Swenson <jmswen@gmail.com>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 30 May, 2025 1 commit
-
-
Nick Hill authored
-
- 23 May, 2025 1 commit
-
-
Chauncey authored
Co-authored-by:simon-mo <xmo@berkeley.edu>
-
- 12 May, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
ApostaC <yihua98@uchicago.edu> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
Robert Shaw <rshaw@neuralmagic.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
Brent Salisbury <bsalisbu@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
ApostaC <yihua98@uchicago.edu> Co-authored-by:
Robert Shaw <rshaw@neuralmagic.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Brent Salisbury <bsalisbu@redhat.com>
-
- 30 Apr, 2025 1 commit
-
-
Marko Rosenmueller authored
Signed-off-by:Marko Rosenmueller <5467316+dr75@users.noreply.github.com>
-
- 26 Apr, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 23 Apr, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 17 Apr, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Signed-off-by:
Andrew Feldman <afeldman@neuralmagic.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Andrew Feldman <afeldman@neuralmagic.com> Co-authored-by:
afeldman-nm <156691304+afeldman-nm@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-