- 09 Jan, 2026 3 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
- 24 Dec, 2025 1 commit
-
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
- 19 Dec, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 12 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 09 Dec, 2025 1 commit
-
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
- 08 Dec, 2025 1 commit
-
-
Daniel Cámpora authored
Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 07 Dec, 2025 2 commits
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement. (#29546) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 29 Nov, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin@redhat.com>
-
- 26 Nov, 2025 1 commit
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 25 Nov, 2025 1 commit
-
-
Pleaplusone authored
[Perf][Deepseek] optimize gather_and_maybe_dequant_cache kernel's perf for extremely long sequence (#28029) Signed-off-by:ganyi <ygan@amd.com>
-
- 20 Nov, 2025 1 commit
-
-
Boyuan Feng authored
Signed-off-by:
Boyuan Feng <boyuan@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 12 Nov, 2025 1 commit
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 11 Nov, 2025 1 commit
-
-
zhrrr authored
Signed-off-by:zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
-
- 02 Nov, 2025 1 commit
-
-
Asaf Joseph Gardin authored
Signed-off-by:asafg <39553475+Josephasafg@users.noreply.github.com>
-
- 24 Oct, 2025 1 commit
-
-
Xiangyu Li authored
[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by adapting gptq_gemm (#26092)
-
- 21 Oct, 2025 2 commits
-
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Signed-off-by:
Lain <siyuanf@nvidia.com> Co-authored-by:
Daniel Campora <961215+dcampora@users.noreply.github.com>
-
Daniel Cámpora authored
Signed-off-by:Daniel Campora <961215+dcampora@users.noreply.github.com>
-
- 17 Oct, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 10 Oct, 2025 1 commit
-
-
Elvir Crnčević authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
elvircrn <elvircrn@gmail.com> Signed-off-by:
Elvir Crnčević <elvircrn@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com>
-
- 08 Oct, 2025 1 commit
-
-
Barry Kang authored
Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Simon Mo <simon.mo@hey.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Simon Mo <simon.mo@hey.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
- 07 Oct, 2025 1 commit
-
-
Daniel Cámpora authored
Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Signed-off-by:
Daniel Cámpora <961215+dcampora@users.noreply.github.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 03 Oct, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 30 Sep, 2025 1 commit
-
-
Yongye Zhu authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com> Signed-off-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Lucia Fang <116399278+luccafong@users.noreply.github.com> Co-authored-by:
Lucia Fang <fanglu@meta.com> Co-authored-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Siyuan Fu <siyuanf@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Xiaozhu Meng <mxz297@gmail.com> Co-authored-by:
Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
-
- 17 Sep, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com>
-
- 15 Sep, 2025 1 commit
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
- 13 Sep, 2025 2 commits
-
-
Elvir Crnčević authored
Signed-off-by:elvircrn <elvircrn@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 11 Sep, 2025 1 commit
-
-
TaehyunKim authored
Signed-off-by:
ca1207 <ca1207zzz@gmail.com> Signed-off-by:
TaehyunKim <73943231+ca1207@users.noreply.github.com> Co-authored-by:
WyldeCat <skan1543@gmail.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
- 08 Sep, 2025 1 commit
-
-
Ming Yang authored
Signed-off-by:
Ming Yang <minos.future@gmail.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 06 Sep, 2025 1 commit
-
-
yzds authored
Signed-off-by:
hongchao <hongchao@msh.team> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
hongchao <hongchao@msh.team> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 04 Sep, 2025 1 commit
-
-
elvischenv authored
[Bugfix][Misc] Fix silu_and_mul_nvfp4_quant issue and extract common utils for nvfp4 kernel source files (#23727) Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 29 Aug, 2025 1 commit
-
-
yzds authored
Signed-off-by:
hongchao <hongchao@msh.team> Signed-off-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
hongchao <hongchao@msh.team> Co-authored-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
Richard Zou <zou3519@users.noreply.github.com>
-
- 28 Aug, 2025 2 commits
-
-
elvischenv authored
Signed-off-by:
jindih <jindih@nvidia.com> Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
jindih <jindih@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedic <lgovedic@redhat.com>
-
yzds authored
Co-authored-by:hongchao <hongchao@msh.team>
-
- 24 Aug, 2025 1 commit
-
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
- 22 Aug, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
- 20 Aug, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-