- 05 Dec, 2025 2 commits
-
-
Tova Movshovitz authored
Signed-off-by:
tovam <tovam@pliops.com> Signed-off-by:
Tova Movshovitz <tovam@pliops.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Currently, when requests are cancelled while executing their final step, "completion" is handled based on normal stop processing (e.g. length or stop token), so the abort has no effect. This is typically not a problem, but when a kv connector is involved it thinks the request completed successfully rather than being aborted. This is problematic for disaggregated prefill which will free kv cache blocks if the request was aborted but not if it completed successfully—since the cancelled request will never be sent to the decode side, kv cache blocks remain pinned until the fall-back timeout expires. The problem is exacerbated when many requests are cancelled and/or there are large prefills whose forward pass takes a long time (since the window is bigger). This PR fixes the problem by processing pending aborts immediately prior to processing model output each step; we process only aborts, not new requests, since it's preferable for latency to process model outputs before new incoming requests. Fixes #26400. Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 02 Dec, 2025 1 commit
-
-
Zhuohan Li authored
Signed-off-by:
Zhuohan Li <zhuohan123@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 28 Nov, 2025 1 commit
-
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 20 Nov, 2025 1 commit
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 19 Nov, 2025 1 commit
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
- 18 Nov, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 17 Nov, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Ronald authored
Signed-off-by:
Ronald1995 <ronaldautomobile@163.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 14 Nov, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 10 Nov, 2025 2 commits
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
- 07 Nov, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 01 Nov, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 31 Oct, 2025 1 commit
-
-
GuanLuo authored
Signed-off-by:
Guan Luo <gluo@nvidia.com> Signed-off-by:
GuanLuo <41310872+GuanLuo@users.noreply.github.com> Signed-off-by:
Guan Luo <41310872+GuanLuo@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
- 30 Oct, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 24 Oct, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 22 Oct, 2025 2 commits
-
-
dongbo910220 authored
Signed-off-by:
dongbo910220 <1275604947@qq.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 21 Oct, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 19 Oct, 2025 2 commits
-
-
iAmir97 authored
Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com>
-
dongbo910220 authored
Signed-off-by:dongbo910220 <1275604947@qq.com>
-
- 17 Oct, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 14 Oct, 2025 1 commit
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 13 Oct, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 12 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 10 Oct, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 08 Oct, 2025 1 commit
-
-
Ayush Satyam authored
Signed-off-by:Ayush Satyam <ayushsatyam146@gmail.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Sep, 2025 1 commit
-
-
Jialin Ouyang authored
Signed-off-by:
Jialin Ouyang <jialino@meta.com> Signed-off-by:
Jialin Ouyang <Jialin.Ouyang@gmail.com> Co-authored-by:
Jialin Ouyang <jialino@meta.com>
-
- 20 Sep, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 15 Sep, 2025 1 commit
-
-
Chao Lei authored
Signed-off-by:
LCAIZJ <leichao139636@163.com> Co-authored-by:
leichao.lc <leichao.lc@antgroup.com>
-
- 14 Sep, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 12 Sep, 2025 2 commits
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
Ronald1995 <ronaldautomobile@163.com> Co-authored-by:
Ronald1995 <ronaldautomobile@163.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
dongluw authored
Signed-off-by:donglu <donglu@cohere.com>
-
- 09 Sep, 2025 1 commit
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 06 Sep, 2025 1 commit
-
-
Didier Durand authored
Signed-off-by:
Didier Durand <durand.didier@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-