- 16 Apr, 2026 2 commits
-
-
Nikita Shapovalov authored
[Bugfix] Fix Ray compiled-DAG SHM channel stalls by detaching zero-copy `np.ndarray` logprobs buffers (#35736) Signed-off-by:Nikita Shapovalov <nikita@poolside.ai>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 15 Apr, 2026 9 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
khluu <khluu000@gmail.com> Signed-off-by:
Kevin H. Luu <khluu000@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
khluu <khluu000@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
jiang1.li <jiang1.li@intel.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
Zhewen Li authored
Signed-off-by:
Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai> Co-authored-by:
OpenAI Codex <codex@openai.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Csrayz authored
[Metrics] Add request_id to FinishedRequestStats to enable correlation between metrics and requests (#39710) Enables external `StatLogger` plugins to correlate per-request metrics with request-level context. Also, this is a pre-requisite for Prometheus exemplars in #30972. Signed-off-by:Csrayz <33659823+Csrayz@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
wliao2 authored
[Test] Refactor hard coded device string in test files under compile/quantization/models/model_executor folders (#38901) Signed-off-by:Liao, Wei <wei.liao@intel.com>
-
- 14 Apr, 2026 4 commits
-
-
zhanqiuhu authored
[CI][KVConnector][Metrics] Update multi KV connector edge case according to prefill stats changes (#39808) Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
omerpaz95 authored
Signed-off-by:
omerpaz95 <omerpaz95@gmail.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Shanshan Shen authored
Signed-off-by:
shen-shanshan <467638484@qq.com> Signed-off-by:
Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Mark McLoughlin authored
[Core][Metrics][BugFix] Replace num_cached_tokens/num_external_computed_tokens with PrefillStats (#37460) Related to `Counters can only be incremented by non-negative amounts` error with the `vllm:prompt_tokens_by_source_total` metric. Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
- 13 Apr, 2026 3 commits
-
-
Monishver authored
Signed-off-by:
Monishver Chandrasekaran <monishverchandrasekaran@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
mukesh-hai authored
Signed-off-by:
Mukesh Baphna <mukesh@hippocraticai.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
zhanqiuhu authored
Signed-off-by:ZhanqiuHu <zhu@redhat.com>
-
- 12 Apr, 2026 3 commits
-
-
Nicolò Lucchesi authored
The number of features supported by the connector has grown substantially and the `nixl_connector.py` file has accumulated a lot of code. Creates a separate directory and isolates connector/scheduler code in the hope of improving clarity and maintainability. Further refactor of components aimed at improving clarity and simplifying code will follow soon. Signed-off-by:NickLucche <nlucches@redhat.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
Martin Hickey authored
Signed-off-by:
Martin Hickey <martin.hickey@ie.ibm.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
- 11 Apr, 2026 1 commit
-
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 10 Apr, 2026 4 commits
-
-
zhanqiuhu authored
Signed-off-by:ZhanqiuHu <zhu@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
- 09 Apr, 2026 4 commits
-
-
Lucas Kabela authored
[Performance Improvement] Update `batched_count_greater_than` to handle batch size 1 without recompile (#38933) Signed-off-by:
Lucas Kabela <lucaskabela@meta.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
sihao_li authored
Signed-off-by:
sihao.li <sihao.li@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Chendi.Xue authored
Signed-off-by:Chendi Xue <chendi.xue@intel.com>
-
- 08 Apr, 2026 7 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
triangleXIV authored
[BugFix] --max-model-len=-1 causes over-limit requests to hang and starve the entire service (#39102) Signed-off-by:
triangle14 <y1019026570@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Rishi Puri authored
Signed-off-by:
Rishi Puri <riship@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Signed-off-by:
sfeng33 <4florafeng@gmail.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
Flora Feng <4florafeng@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
haosdent authored
Signed-off-by:haosdent <haosdent@gmail.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
- 07 Apr, 2026 3 commits
-
-
ibifrost authored
Signed-off-by:
wuchenxin <wuchenxin.wcx@alibaba-inc.com> Signed-off-by:
ibifrost <47308427+ibifrost@users.noreply.github.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
Ronen Schaffer authored
Signed-off-by:Ronen Schaffer <ronen.schaffer@ibm.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-