- 13 Nov, 2025 4 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Nov, 2025 36 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:
Hollow Man <hollowman@opensuse.org> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
kliuae <kuanfu.liu@embeddedllm.com>
-
Michael Goin authored
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
vllmellm authored
[ROCM] Fix ROCm warnings, environment flag access, and GEMM kernel naming for consistency in `_aiter_ops.py` (#28464) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Wei Wei authored
Signed-off-by:Wei Wei <wwei6@meta.com>
-
Andy Lo authored
Signed-off-by:Andy Lo <andy@mistral.ai>
-
Yihua Cheng authored
Signed-off-by:ApostaC <yihua98@uchicago.edu>
-
Harry Mellor authored
[CI] Skip "Multi-Modal Models Test (Extended) 3" test that's broken in current Transformers (#28559) Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
PerryZhang01 authored
Signed-off-by:
Perry Zhang <perzhang@amd.com> Co-authored-by:
Perry Zhang <perzhang@amd.com>
-
alberto authored
Signed-off-by:
Alberto Perdomo <aperdomo@redhat.com> Signed-off-by:
alberto <aperdomo@redhat.com> Co-authored-by:
Or Ozeri <or@ozery.com>
-
Benjamin Chislett authored
[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479) Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Mark McLoughlin <markmc@redhat.com> Co-authored-by:
Mark McLoughlin <markmc@redhat.com>
-
Canlin Guo authored
Signed-off-by:gcanlin <canlinguosdu@gmail.com>
-
Alexander Matveev authored
[Performance][Hopper] Avoid M dim padding to 4x for most cases (due to cuda graphs paddings) (#28492) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
ZhengHongming888 authored
Signed-off-by:Hongming Zheng <hongming.zheng@intel.com>
-
ziruiliu authored
Signed-off-by:
Zirui Liu <ziliu@ddn.com> Signed-off-by:
ziruiliu <ziliu@ddn.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Chaojun Zhang authored
Signed-off-by:chaojun-zhang <chaojun.zhang@intel.com>
-
wuyaoxuehun authored
Signed-off-by:
wuao.scotty <wuao.scotty@bytedance.com> Co-authored-by:
wuao.scotty <wuao.scotty@bytedance.com>
-
yyzxw authored
Signed-off-by:zxw <1020938856@qq.com>
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
Chenguang Zheng authored
Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
ai-jz authored
-
Fanli Lin authored
Signed-off-by:Lin, Fanli <fanli.lin@intel.com>
-
Chenguang Zheng authored
Signed-off-by:
n00909098 <nguyen.kha.long@huawei.com> Signed-off-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Signed-off-by:
herotai214 <herotai214@gmail.com> Signed-off-by:
Khuong Le <khuong.le.manh@huawei.com> Signed-off-by:
Khuong Le <lemanhkhuong2611@gmail.com> Co-authored-by:
n00909098 <nguyen.kha.long@huawei.com> Co-authored-by:
knlnguyen1802 <knlnguyen1802@gmail.com> Co-authored-by:
herotai214 <herotai214@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Khuong Le <khuong.le.manh@huawei.com> Co-authored-by:
Khuong Le <lemanhkhuong2611@gmail.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:
Andreas Karatzas <akaratza@amd.com> Signed-off-by:
Andreas Karatzas <Andreas.Karatzas@amd.com>
-