- 03 Feb, 2026 1 commit
-
-
Dezhan authored
Co-authored-by:Dezhan Tu <dztu@meta.com>
-
- 02 Feb, 2026 32 commits
-
-
Patrick von Platen authored
Signed-off-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Pavani Majety <pmajety@nvidia.com>
-
Vasiliy Kuznetsov authored
Signed-off-by:vasiliy <vasiliy@fb.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
yugong333 authored
Reduce the kernel overhead when num of active loras is smaller than max loras. Multiple cuda graphs are captured for each num of active-loras. (#32005) Signed-off-by:Yu Gong <yu3.gong@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Yang Liu authored
Signed-off-by:Yang <lymailforjob@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Kebe authored
Signed-off-by:
Kebe <mail@kebe7jun.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Thomas Vegas <tvegas@nvidia.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
shanjiaz authored
Signed-off-by:shanjiaz <zsjwpianpian@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
danielafrimi authored
Signed-off-by:dafrimi <dafrimi@nvidia.com>
-
Rabi Mishra authored
Signed-off-by:rabi <ramishra@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Borushiki authored
Signed-off-by:Borushiki <38628261+Otsutsukii@users.noreply.github.com>
-
Grzegorz K. Karch authored
Signed-off-by:Grzegorz Karch <gkarch@nvidia.com>
-
Nicolò Lucchesi authored
[CI][Bugfix] Fix flaky `tests/v1/kv_connector/unit/test_multi_connector.py::test_multi_example_connector_consistency` (#33555) Signed-off-by:NickLucche <nlucches@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Komal Kumar Teru authored
Signed-off-by:
kkt-cohere <komal@cohere.com> Signed-off-by:
Komal Kumar Teru <162363718+kkt-cohere@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
R3hankhan authored
Signed-off-by:Rehan Khan <Rehan.Khan7@ibm.com>
-
RED authored
Signed-off-by:
liuli <ll407707@alibaba-inc.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
liuli <ll407707@alibaba-inc.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
Sawyer Bowerman authored
Signed-off-by:
Sawyer Bowerman <sbowerma@redhat.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Paco Xu authored
Signed-off-by:Paco Xu <paco.xu@daocloud.io>
-
jack authored
Signed-off-by:
QwertyJack <7554089+QwertyJack@users.noreply.github.com> Co-authored-by:
QwertyJack <7554089+QwertyJack@users.noreply.github.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Yifan Qiao authored
Signed-off-by:Yifan Qiao <yifanqiao@berkeley.edu>
-
Runkai Tao authored
Signed-off-by:Runkai Tao <rt572@physics.rutgers.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 01 Feb, 2026 7 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Komal Kumar Teru authored
Signed-off-by:kkt-cohere <komal@cohere.com>
-
will b. authored
Signed-off-by:
Eduardo Salinas <edus@microsoft.com> Signed-off-by:
catswe <212922539+catswe@users.noreply.github.com> Co-authored-by:
Eduardo Salinas <edus@microsoft.com>
-
shaharmor98 authored
-
JartX authored
[BUGFIX] Fix hipErrorIllegalState in Qwen3-Omni during startup profiling allow inference Omni on ROCM (#33077) Signed-off-by:JartX <sagformas@epdcenter.es>
-
Maral authored
Signed-off-by:
maral <maralbahari.98@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
YunzhuLu <lucia.yunzhu@gmail.com>
-