- 09 Jan, 2026 1 commit
-
-
vllmellm authored
[Bugfix][ROCm]Fix Qwen3-Next-80B-A3B-Thinking inference and optimize non-standard block size (544) support under rocm_atten (#31380) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 08 Jan, 2026 2 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 1 commit
-
-
BlankR authored
Signed-off-by:
BlankR <hjyblanche@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 05 Jan, 2026 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 30 Dec, 2025 1 commit
-
-
yt0428 authored
Signed-off-by:
yuantao <2422264527@qq.com> Signed-off-by:
yt0428 <51468697+yt0428@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 22 Dec, 2025 1 commit
-
-
Kevin McKay authored
Signed-off-by:c0de128 <kevin.mckay@outlook.com>
-
- 19 Dec, 2025 2 commits
-
-
Thomas Parnell authored
[Bugfix] [Kernel] Triton attention kernels: mask out V blocks that fall outside sliding window (#30887) Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 18 Dec, 2025 4 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Isotr0py authored
[MM Encoder]: Migrate legacy ViT `MultiHeadAttention` to new `MMEncoderAttention` interface (#30684) Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Andreas Karatzas authored
[ROCm][Bugfix] Fix `fa_version` argument error in `flash_attn_maxseqlen_wrapper` for ROCm without aiter (#30909) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Isotr0py authored
-
- 16 Dec, 2025 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 15 Dec, 2025 1 commit
-
-
Shanshan Shen authored
[CustomOp][MM] Extract MMEncoderAttention as CustomOp and replace the backend of QwenVisionAttention with it. (#30125) Signed-off-by:
shen-shanshan <467638484@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 12 Dec, 2025 1 commit
-
-
jvlunteren authored
Signed-off-by:
Jan van Lunteren <jvl@zurich.ibm.com> Signed-off-by:
jvlunteren <161835099+jvlunteren@users.noreply.github.com> Co-authored-by:
Thomas Parnell <tom.parnell@gmail.com> Co-authored-by:
Thomas Parnell <tpa@zurich.ibm.com>
-
- 08 Dec, 2025 1 commit
-
-
Dazhi Jiang authored
Signed-off-by:Dazhi Jiang <dazhi_jiang@163.com>
-
- 28 Nov, 2025 3 commits
-
-
Augusto Yao authored
Signed-off-by:augusto.yjh <augusto.yjh@antgroup.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Mingyuan Ma authored
Signed-off-by:
mingyuanm <mingyuanm@nvidia.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
- 26 Nov, 2025 1 commit
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 25 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 24 Nov, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
- 20 Nov, 2025 1 commit
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 19 Nov, 2025 1 commit
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
- 14 Nov, 2025 1 commit
-
-
rasmith authored
[Bugfix][CI/Test][Spec Decode] Fix illegal memory access in offline_inference/spec_decode.py (Issue 27619) (#28432) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
- 12 Nov, 2025 1 commit
-
-
Andreas Karatzas authored
Signed-off-by:
Andreas Karatzas <akaratza@amd.com> Signed-off-by:
Andreas Karatzas <Andreas.Karatzas@amd.com>
-
- 11 Nov, 2025 2 commits
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 10 Nov, 2025 1 commit
-
-
vllmellm authored
[RFC][ROCm][AITER] Keep all AITER kernels in `_aiter_ops` class like `_custom_ops` and `_ipex_ops` (#24490) Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 08 Nov, 2025 1 commit
-
-
zhangsicheng5 authored
Signed-off-by:
zhangsicheng5 <zhangsicheng5@huawei.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
Qiu <qiuchunshuo@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
-
- 03 Nov, 2025 1 commit
-
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
- 01 Nov, 2025 1 commit
-
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Yejing Lai <yejing.lai@intel.com> Co-authored-by:
Guancheng Fu <110874468+gc-fu@users.noreply.github.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 28 Oct, 2025 1 commit
-
-
Lucas Kabela authored
[Misc][qwen2_5_vl][torch.compile] Enable `supports_torch_compile` on generic nn.Module and demonstrate speedup on Qwen Vision model (#23207) Signed-off-by:
Lucas Kabela <lucaskabela@meta.com> Signed-off-by:
Lucas Kabela <lucasakabela@gmail.com>
-
- 26 Oct, 2025 1 commit
-
-
Yeshwanth N authored
Signed-off-by:
Yeshwanth Surya <yeshsurya@gmail.com> Signed-off-by:
Yeshwanth N <yeshsurya@gmail.com> Signed-off-by:
yeshsurya <yeshsurya@gmail.com>
-
- 21 Oct, 2025 1 commit
-
-
Tao He authored
Signed-off-by:
Tao He <linzhu.ht@alibaba-inc.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com>
-
- 18 Oct, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
isotr0py <2037008807@qq.com>
-
- 14 Oct, 2025 1 commit
-
-
Jaya Yuan authored
Signed-off-by:
yuanyongjie.yyj <yuanyongjie.yyj@antgroup.com> Signed-off-by:
FENP <32334296+FENP@users.noreply.github.com> Signed-off-by:
Jaya Yuan <yuanyongjie.yyj@antgroup.com>
-
- 12 Oct, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-