- 08 May, 2026 1 commit
-
-
王敏 authored
-
- 10 Apr, 2026 1 commit
-
-
xuxz authored
-
- 28 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nickhill123@gmail.com> (cherry picked from commit 0cd259b2)
-
- 15 Jan, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 12 Jan, 2026 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 09 Jan, 2026 1 commit
-
-
Max Hu authored
Signed-off-by:
Max Hu <maxhu@nvidia.com> Signed-off-by:
Max Hu <hyoung2991@gmail.com> Co-authored-by:
Max Hu <maxhu@nvidia.com>
-
- 06 Jan, 2026 1 commit
-
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 04 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:njhill <nickhill123@gmail.com>
-
- 02 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
njhill <nickhill123@gmail.com>
-
- 30 Dec, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:njhill <nickhill123@gmail.com>
-
- 24 Dec, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 19 Dec, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 18 Dec, 2025 2 commits
-
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Signed-off-by:
Alec <35311602+alec-flowers@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
zhuwenwen authored
-
- 10 Dec, 2025 1 commit
-
-
Jialin Ouyang authored
[Perf] Enable environment cache in EngineCore to enable the feature for UniProcExecutor as well (#29289) Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 05 Dec, 2025 2 commits
-
-
Tova Movshovitz authored
Signed-off-by:
tovam <tovam@pliops.com> Signed-off-by:
Tova Movshovitz <tovam@pliops.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Currently, when requests are cancelled while executing their final step, "completion" is handled based on normal stop processing (e.g. length or stop token), so the abort has no effect. This is typically not a problem, but when a kv connector is involved it thinks the request completed successfully rather than being aborted. This is problematic for disaggregated prefill which will free kv cache blocks if the request was aborted but not if it completed successfully—since the cancelled request will never be sent to the decode side, kv cache blocks remain pinned until the fall-back timeout expires. The problem is exacerbated when many requests are cancelled and/or there are large prefills whose forward pass takes a long time (since the window is bigger). This PR fixes the problem by processing pending aborts immediately prior to processing model output each step; we process only aborts, not new requests, since it's preferable for latency to process model outputs before new incoming requests. Fixes #26400. Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 02 Dec, 2025 1 commit
-
-
Zhuohan Li authored
Signed-off-by:
Zhuohan Li <zhuohan123@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 28 Nov, 2025 1 commit
-
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 20 Nov, 2025 1 commit
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 19 Nov, 2025 1 commit
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
- 18 Nov, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 17 Nov, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Ronald authored
Signed-off-by:
Ronald1995 <ronaldautomobile@163.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 14 Nov, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 10 Nov, 2025 2 commits
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
- 07 Nov, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 01 Nov, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 31 Oct, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
- 30 Oct, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 24 Oct, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 22 Oct, 2025 2 commits
-
-
dongbo910220 authored
Signed-off-by:
dongbo910220 <1275604947@qq.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 21 Oct, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 19 Oct, 2025 2 commits
-
-
iAmir97 authored
Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com>
-
dongbo910220 authored
Signed-off-by:dongbo910220 <1275604947@qq.com>
-
- 17 Oct, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-