- 22 Feb, 2026 1 commit
-
-
qizixi authored
[Spec Decode] Defer clearing KV connector metadata for EAGLE3 speculative decode + prefill / decode disagg setup (#34529) Signed-off-by:
qizixi <qizixi@meta.com> Co-authored-by:
Lu Fang <30275821+houseroad@users.noreply.github.com>
-
- 20 Feb, 2026 2 commits
-
-
Lucas Wilkinson authored
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
- 19 Feb, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 14 Feb, 2026 1 commit
-
-
Wei Zhao authored
Signed-off-by:wzhao18 <wzhao18.sz@gmail.com>
-
- 13 Feb, 2026 1 commit
-
-
Harry Huang authored
Signed-off-by:huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com>
-
- 10 Feb, 2026 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
- 09 Feb, 2026 1 commit
-
-
Reagan Lee authored
Signed-off-by:
Reagan Lee <“reaganjlee@gmail.com”> Co-authored-by:
Reagan Lee <“reaganjlee@gmail.com”>
-
- 08 Feb, 2026 1 commit
-
-
Reagan Lee authored
Signed-off-by:
Reagan Lee <“reaganjlee@gmail.com”> Signed-off-by:
Reagan Lee <reaganjlee@gmail.com> Signed-off-by:
Reagan Lee <96998476+reaganjlee@users.noreply.github.com> Co-authored-by:
Reagan Lee <“reaganjlee@gmail.com”> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 07 Feb, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Feb, 2026 1 commit
-
-
emricksini-h authored
-
- 05 Feb, 2026 2 commits
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 04 Feb, 2026 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 02 Feb, 2026 2 commits
-
-
yugong333 authored
Reduce the kernel overhead when num of active loras is smaller than max loras. Multiple cuda graphs are captured for each num of active-loras. (#32005) Signed-off-by:Yu Gong <yu3.gong@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 01 Feb, 2026 2 commits
-
-
Komal Kumar Teru authored
Signed-off-by:kkt-cohere <komal@cohere.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 31 Jan, 2026 4 commits
-
-
jma99_2333 authored
Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 30 Jan, 2026 2 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 29 Jan, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 28 Jan, 2026 2 commits
-
-
Wentao Ye authored
[Feature] Fully support for async scheduling + PP, 30.8% E2E throughput improvement, 31.8% TPOT improvement (#32618) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 27 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 25 Jan, 2026 1 commit
-
-
Itay Etelis authored
Signed-off-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
Itay Etelis <itay.etelis@ibm.com>
-
- 24 Jan, 2026 3 commits
-
-
Joshua Deng authored
Signed-off-by:
Joshua Deng <joshuakdeng@gmail.com> Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Reagan Lee authored
Signed-off-by:
Reagan <reaganjlee@gmail.com> Signed-off-by:
Reagan Lee <96998476+reaganjlee@users.noreply.github.com> Co-authored-by:
Hiroken. <105287758+HirokenOvo@users.noreply.github.com>
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Luka Govedič <luka.govedic@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <luka.govedic@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
- 23 Jan, 2026 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Harry Huang authored
Signed-off-by:
huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
- 20 Jan, 2026 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 19 Jan, 2026 2 commits
-
-
Tomas Ruiz authored
Signed-off-by:Tomas Ruiz <tomas.ruiz.te@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 16 Jan, 2026 1 commit
-
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
- 15 Jan, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-