- 26 Mar, 2026 1 commit
-
-
laibao authored
feat(v1 attention): 为 ROCm FlashAttention 接入 unified kv layout,并打通 mm_prefix、qq_bias 与 use_alibi_sqrt 透传 在 ROCm FlashAttention 后端增加 unified KV layout 选择逻辑 接入 unified varlen kernel 调用路径 在 FlashAttention metadata 中补充 mm_prefix_range 与 qq_bias 透传
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 15 Dec, 2025 1 commit
-
-
Isotr0py authored
[Platform] Refactor Platform attention backend selection to avoid breakpoint for OOT platform (#30212) Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 11 Dec, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 07 Dec, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 05 Dec, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 28 Nov, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 24 Nov, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
- 19 Nov, 2025 1 commit
-
-
Shanshan Shen authored
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487) Signed-off-by:shen-shanshan <467638484@qq.com>
-
- 13 Nov, 2025 1 commit
-
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
- 12 Nov, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 11 Nov, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 05 Nov, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 17 Oct, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 12 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 09 Oct, 2025 1 commit
-
-
Wenzheng Bi authored
Signed-off-by:iwzbi <wzbi@zju.edu.cn>
-
- 08 Oct, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 06 Oct, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 05 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 03 Oct, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 01 Oct, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
-
- 30 Sep, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
-
- 25 Sep, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com>
-
- 23 Sep, 2025 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 12 Aug, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
- 07 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 05 Aug, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
- 19 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 14 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 06 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 10 Mar, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 02 Feb, 2025 1 commit
-
-
Russell Bryant authored
- **Add SPDX license headers to python source files** - **Check for SPDX headers using pre-commit** commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on ...
-
- 31 Jan, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
- 28 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 15 Jan, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:
wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by:
Mengqing Cao <cmq0113@163.com> Co-authored-by:
Mengqing Cao <cmq0113@163.com>
-
- 09 Jan, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:
wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by:
Mengqing Cao <cmq0113@163.com> Co-authored-by:
Mengqing Cao <cmq0113@163.com>
-
- 05 Dec, 2024 1 commit
-
-
zhuwenwen authored
-
- 19 Nov, 2024 1 commit
-
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
- 06 Nov, 2024 1 commit
-
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-