- 20 Apr, 2026 5 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
larryli2-amd authored
[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3 speculative decoding on ROCm (#39616) Signed-off-by:
larryli2-amd <larryli2@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Markov Ilya <markovilya19@gmail.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Chaojun Zhang authored
Signed-off-by:
chaojun-zhang <chaojun.zhang@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 19 Apr, 2026 2 commits
-
-
Andrew Barnes authored
Signed-off-by:Bortlesboat <bortstheboat@gmail.com>
-
omerpaz95 authored
Signed-off-by:omerpaz95 <omerpaz95@gmail.com>
-
- 18 Apr, 2026 1 commit
-
-
Dan Alistarh authored
Signed-off-by:Dan Alistarh <d.alistarh@gmail.com>
-
- 17 Apr, 2026 6 commits
-
-
aditi-amd authored
Signed-off-by:aditi <aditi.rana@amd.com>
-
Xinyu Chen authored
Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Jing Wang authored
Signed-off-by:
Jing Wang <jingwang96@qq.com> Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com>
-
sychen52 authored
Signed-off-by:Shiyang Chen <shiychen@nvidia.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 16 Apr, 2026 4 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Nikita Shapovalov authored
[Bugfix] Fix Ray compiled-DAG SHM channel stalls by detaching zero-copy `np.ndarray` logprobs buffers (#35736) Signed-off-by:Nikita Shapovalov <nikita@poolside.ai>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
- 15 Apr, 2026 5 commits
-
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Csrayz authored
[Metrics] Add request_id to FinishedRequestStats to enable correlation between metrics and requests (#39710) Enables external `StatLogger` plugins to correlate per-request metrics with request-level context. Also, this is a pre-requisite for Prometheus exemplars in #30972. Signed-off-by:Csrayz <33659823+Csrayz@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Vibhav Agarwal authored
Signed-off-by:
vibhavagarwal5 <vibhavagarwal5@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
- 14 Apr, 2026 8 commits
-
-
Francesco Fusco authored
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
omerpaz95 authored
Signed-off-by:
omerpaz95 <omerpaz95@gmail.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Andrew Barnes authored
Signed-off-by:Bortlesboat <bortstheboat@gmail.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
Shanshan Shen authored
Signed-off-by:
shen-shanshan <467638484@qq.com> Signed-off-by:
Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Mark McLoughlin authored
[Core][Metrics][BugFix] Replace num_cached_tokens/num_external_computed_tokens with PrefillStats (#37460) Related to `Counters can only be incremented by non-negative amounts` error with the `vllm:prompt_tokens_by_source_total` metric. Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
- 13 Apr, 2026 3 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
mukesh-hai authored
Signed-off-by:
Mukesh Baphna <mukesh@hippocraticai.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
- 12 Apr, 2026 3 commits
-
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Martin Hickey authored
Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
Xinyu Chen authored
Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 11 Apr, 2026 1 commit
-
-
Tianyu Guo authored
Signed-off-by:
Tianyu Guo <guoty9@mail2.sysu.edu.cn> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
- 10 Apr, 2026 2 commits
-
-
Fynn Schmitt-Ulms authored
Signed-off-by:
Rahul-Tuli <rtuli@redhat.com> Signed-off-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Co-authored-by:
Rahul-Tuli <rtuli@redhat.com> Co-authored-by:
Claude <noreply@anthropic.com>
-
yzong-rh authored
Signed-off-by:Yifan Zong <yzong@redhat.com>
-