- 31 Aug, 2025 1 commit
-
-
Didier Durand authored
Signed-off-by:Didier Durand <durand.didier@gmail.com>
-
- 27 Aug, 2025 1 commit
-
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
- 25 Aug, 2025 1 commit
-
-
Chaojun Zhang authored
Signed-off-by:chzhang <chaojun.zhang@intel.com>
-
- 22 Aug, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
- 19 Aug, 2025 1 commit
-
-
Nikhil Suryawanshi authored
Signed-off-by:Nikhil Suryawanshi <suryawanshin74@gmail.com>
-
- 15 Aug, 2025 1 commit
-
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 12 Aug, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
- 05 Aug, 2025 1 commit
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
- 04 Aug, 2025 1 commit
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
- 02 Aug, 2025 1 commit
-
-
vllmellm authored
[FEAT][ROCm] Enable running Flash Attention as ViT attn backend for Qwen-VL models on ROCm platform. (#22069) Signed-off-by:
tjtanaavllm <tunjian.tan@amd.com> Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
tjtanaavllm <tunjian.tan@amd.com>
-
- 31 Jul, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 22 Jul, 2025 1 commit
-
-
Konrad Zawora authored
Signed-off-by:
Konrad Zawora <kzawora@habana.ai> Signed-off-by:
Chendi.Xue <chendi.xue@intel.com> Co-authored-by:
Chendi.Xue <chendi.xue@intel.com>
-
- 19 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 17 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 12 Jul, 2025 1 commit
-
-
Congcong Chen authored
Signed-off-by:Congcong Chen <congcongchen@microsoft.com>
-
- 09 Jul, 2025 1 commit
-
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
- 07 Jul, 2025 1 commit
-
-
Yang Yang authored
[Refactor]Abstract Platform Interface for Distributed Backend and Add xccl Support for Intel XPU (#19410) Signed-off-by:
dbyoung18 <yang5.yang@intel.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 04 Jul, 2025 1 commit
-
-
Gabriel Marinho authored
Signed-off-by:Gabriel Marinho <gmarinho@ibm.com>
-
- 26 Jun, 2025 1 commit
-
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
- 20 Jun, 2025 1 commit
-
-
kourosh hakhamaneshi authored
-
- 18 Jun, 2025 1 commit
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 07 Jun, 2025 1 commit
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
- 05 Jun, 2025 2 commits
-
-
Michael Goin authored
-
Nicolò Lucchesi authored
-
- 04 Jun, 2025 1 commit
-
-
Kaixi Hou authored
-
- 03 Jun, 2025 2 commits
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Nicolò Lucchesi authored
Signed-off-by:nicklucche <nlucches@redhat.com>
-
- 28 May, 2025 1 commit
-
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
- 27 May, 2025 1 commit
-
-
Hyogeun Oh (오효근) authored
[Doc] Convert Sphinx directives ( `{class}`, `{meth}`, `{attr}`, ...) to MkDocs format for better documentation linking (#18663) Signed-off-by:Zerohertz <ohg3417@gmail.com>
-
- 23 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 22 May, 2025 1 commit
-
-
Mengqing Cao authored
Signed-off-by:
Mengqing Cao <cmq0113@163.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 14 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 12 May, 2025 1 commit
-
-
Jade Zheng authored
Signed-off-by:Jade Zheng <zheng.shoujian@outlook.com>
-
- 09 May, 2025 2 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com>
-
- 07 May, 2025 2 commits
-
-
Akshat Tripathi authored
Signed-off-by:
Akshat Tripathi <akshat@krai.ai> Signed-off-by:
Chengji Yao <chengjiyao@google.com> Co-authored-by:
Chengji Yao <chengjiyao@google.com>
-
Bowen Bao authored
-
- 04 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 03 May, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-