- 14 Nov, 2025 2 commits
-
-
haoyangli-amd authored
Signed-off-by:Haoyang Li <lihaoyang0109@gmail.com>
-
Hank_ authored
Signed-off-by:
Hank <hcc.mayday@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 13 Nov, 2025 4 commits
-
-
zofia authored
Signed-off-by:Zhu, Zufang <zufang.zhu@intel.com>
-
Zijing Liu authored
Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Lucia Fang authored
Support DeepEP for Kimi-k2-thinking through enabling gemm selection for compressed-tensor marlin wna16 (#28574) Signed-off-by:Lu Fang <fanglu@fb.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 12 Nov, 2025 5 commits
-
-
vllmellm authored
[ROCM] Fix ROCm warnings, environment flag access, and GEMM kernel naming for consistency in `_aiter_ops.py` (#28464) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
PerryZhang01 authored
Signed-off-by:
Perry Zhang <perzhang@amd.com> Co-authored-by:
Perry Zhang <perzhang@amd.com>
-
Alexander Matveev authored
[Performance][Hopper] Avoid M dim padding to 4x for most cases (due to cuda graphs paddings) (#28492) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 11 Nov, 2025 8 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
xuebwang-amd authored
Signed-off-by:xuebwang-amd <xuebwang@amd.com>
-
xuebwang-amd authored
Signed-off-by:
xuebwang-amd <xuebwang@amd.com> Co-authored-by:
fxmarty-amd <felmarty@amd.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
bnellnm authored
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 10 Nov, 2025 4 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
jiahanc authored
Signed-off-by:jiahanc <173873397+jiahanc@users.noreply.github.com>
-
vllmellm authored
[RFC][ROCm][AITER] Keep all AITER kernels in `_aiter_ops` class like `_custom_ops` and `_ipex_ops` (#24490) Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
zejunchen-zejun authored
[Rocm][fused_moe][fp4] view weight to torch.float4_e2m1fn_x2 when running aiter fused moe for fp4 model (#27474) Signed-off-by:zejunchen-zejun <zejun.chen@amd.com>
-
- 08 Nov, 2025 1 commit
-
-
Kunshang Ji authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 07 Nov, 2025 2 commits
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
smit kadvani authored
Signed-off-by:
Smit Kadvani <smit.kadvani@gmail.com> Co-authored-by:
Smit Shaileshbhai Kadvani <kadvani@meta.com>
-
- 06 Nov, 2025 1 commit
-
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com>
-
- 05 Nov, 2025 2 commits
-
-
amirkl94 authored
Signed-off-by:
Amir Klein <203507526+amirkl94@users.noreply.github.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
- 04 Nov, 2025 3 commits
-
-
bnellnm authored
-
Jerry Zhang authored
Signed-off-by:Jerry Zhang <jerryzh168@gmail.com>
-
Varun Sundar Rabindranath authored
-
- 31 Oct, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:Shu Wang. <shuw@nvidia.com>
-
- 30 Oct, 2025 1 commit
-
-
Paul Zhang authored
Signed-off-by:
PaulZhang12 <paulzhan@fb.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 27 Oct, 2025 2 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 24 Oct, 2025 3 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
fhl2000 authored
Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Xiangyu Li authored
[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by adapting gptq_gemm (#26092)
-
- 23 Oct, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-