- 06 Jan, 2026 30 commits
-
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Jzz1943 authored
Signed-off-by:Zhongze Jiang <jiangzhongze.jzz@ant-intl.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
kzwrime authored
Signed-off-by:kunzh <zhikun.wu@outlook.com>
-
Lucas Wilkinson authored
[Attention][1/n] Remove usage of deprecated `seq_lens_cpu` and `num_computed_tokens_cpu` CommonAttentionMetadata properties (#31773) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
BlankR authored
Signed-off-by:BlankR <hjyblanche@gmail.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
h100 <h100@inferact.ai> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
h100 <h100@inferact.ai>
-
vllmellm authored
[Bugfix][ROCm] Fix Unsupported attention metadata type for speculative decoding in `eagle.py` (#31714) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Kevin McKay authored
Signed-off-by:
c0de128 <kevin.mckay@outlook.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
maang authored
Signed-off-by:maang <maang_h@163.com>
-
Wentao Ye authored
[Perf] Optimize additional `fill(0)` in cutlass moe, 2.9% E2E throughput improvement, 10.8% TTFT improvement (#31754) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
maang authored
Signed-off-by:
maang <maang_h@163.com> Signed-off-by:
maang <55082429+maang-h@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
John Calderon authored
Signed-off-by:
John Calderon <jcalderon@nvidia.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
- 05 Jan, 2026 10 commits
-
-
Seiji Eicher authored
Signed-off-by:
Seiji Eicher <seiji@anyscale.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Nick Hill authored
Signed-off-by:njhill <nickhill123@gmail.com>
-
amitz-nv authored
Signed-off-by:
amitz-nv <203509407+amitz-nv@users.noreply.github.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Qidong Su authored
Signed-off-by:Qidong Su <soodoshll@gmail.com>
-
gnovack authored
Signed-off-by:gnovack <gnovack@amazon.com>
-