- 11 Mar, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
fangyuchu authored
[Bugfix] Surface exceptions from non-blocking execute_model in UniProcExecutor to avoid DP deadlocks (#35194) Signed-off-by:fangyuchu <fangyuchu@qq.com>
-
- 10 Mar, 2026 2 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
- 09 Mar, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 06 Mar, 2026 3 commits
-
-
Nick Hill authored
-
Mark McLoughlin authored
Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com>
-
Shiyan Deng authored
Signed-off-by:
Shiyan Deng <dsy842974287@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
- 05 Mar, 2026 1 commit
-
-
Jiayi Yan authored
Signed-off-by:
1195343015 <1195343015@qq.com> Signed-off-by:
Jiayi Yan <66017932+1195343015@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 04 Mar, 2026 3 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Qi Wang authored
Signed-off-by:Qi Wang <qiwa@nvidia.com>
-
Jaewon authored
Signed-off-by:Jaewon Lee <jaewon@meta.com>
-
- 28 Feb, 2026 1 commit
-
-
Itay Alroy authored
Signed-off-by:
Yongji Wu <wuyongji317@gmail.com> Signed-off-by:
Itay Alroy <ialroy@nvidia.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
Ron Tourgeman <rtourgeman@nvidia.com> Co-authored-by:
Yongji Wu <wuyongji317@gmail.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Ron Tourgeman <rtourgeman@nvidia.com>
-
- 26 Feb, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 25 Feb, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 20 Feb, 2026 2 commits
-
-
Lucas Wilkinson authored
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
- 19 Feb, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 13 Feb, 2026 3 commits
-
-
Aaron Hao authored
Signed-off-by:
ahao-anyscale <ahao@anyscale.com> Signed-off-by:
Aaron Hao <ahao@anyscale.com> Signed-off-by:
hao-aaron <ahao@anyscale.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Jaewon authored
Signed-off-by:
Jaewon Lee <jaewon@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
Jaewon authored
Signed-off-by:
Jaewon Lee <jaewon@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
- 07 Feb, 2026 1 commit
-
-
Aaron Hao authored
Signed-off-by:
ahao-anyscale <ahao@anyscale.com> Signed-off-by:
Aaron Hao <ahao@anyscale.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 06 Feb, 2026 1 commit
-
-
emricksini-h authored
-
- 31 Jan, 2026 1 commit
-
-
jma99_2333 authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
- 27 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 15 Jan, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 12 Jan, 2026 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 09 Jan, 2026 1 commit
-
-
Max Hu authored
Signed-off-by:
Max Hu <maxhu@nvidia.com> Signed-off-by:
Max Hu <hyoung2991@gmail.com> Co-authored-by:
Max Hu <maxhu@nvidia.com>
-
- 06 Jan, 2026 1 commit
-
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 04 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:njhill <nickhill123@gmail.com>
-
- 02 Jan, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
njhill <nickhill123@gmail.com>
-
- 30 Dec, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:njhill <nickhill123@gmail.com>
-
- 24 Dec, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 19 Dec, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 18 Dec, 2025 1 commit
-
-
Alec authored
Signed-off-by:
alec-flowers <aflowers@nvidia.com> Signed-off-by:
Alec <35311602+alec-flowers@users.noreply.github.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 10 Dec, 2025 1 commit
-
-
Jialin Ouyang authored
[Perf] Enable environment cache in EngineCore to enable the feature for UniProcExecutor as well (#29289) Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
- 05 Dec, 2025 2 commits
-
-
Tova Movshovitz authored
Signed-off-by:
tovam <tovam@pliops.com> Signed-off-by:
Tova Movshovitz <tovam@pliops.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Currently, when requests are cancelled while executing their final step, "completion" is handled based on normal stop processing (e.g. length or stop token), so the abort has no effect. This is typically not a problem, but when a kv connector is involved it thinks the request completed successfully rather than being aborted. This is problematic for disaggregated prefill which will free kv cache blocks if the request was aborted but not if it completed successfully—since the cancelled request will never be sent to the decode side, kv cache blocks remain pinned until the fall-back timeout expires. The problem is exacerbated when many requests are cancelled and/or there are large prefills whose forward pass takes a long time (since the window is bigger). This PR fixes the problem by processing pending aborts immediately prior to processing model output each step; we process only aborts, not new requests, since it's preferable for latency to process model outputs before new incoming requests. Fixes #26400. Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 02 Dec, 2025 1 commit
-
-
Zhuohan Li authored
Signed-off-by:
Zhuohan Li <zhuohan123@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nick Hill <nhill@redhat.com>
-
- 28 Nov, 2025 1 commit
-
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-