- 10 Apr, 2026 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 08 Apr, 2026 2 commits
-
-
Roberto L. Castro authored
[Perf][Kernel] Persistent TopK scheduler: unified CUDAGraph-safe kernel with dynamic per-row dispatch - DeepSeek-V3.2 DSA decode (#37421) Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 03 Apr, 2026 2 commits
-
-
Itay Etelis authored
Signed-off-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Carl Y authored
Signed-off-by:Carl You <4531192+carlyou@users.noreply.github.com>
-
- 02 Apr, 2026 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Signed-off-by:
Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
tjtanaavllm <tunjian.tan@amd.com>
-
- 01 Apr, 2026 1 commit
-
-
Monishver authored
Signed-off-by:Monishver Chandrasekaran <monishverchandrasekaran@gmail.com>
-
- 31 Mar, 2026 2 commits
-
-
Olya Kozlova authored
Signed-off-by:Olya Kozlova <okozlova@nvidia.com>
-
mikaylagawarecki authored
Signed-off-by:Mikayla Gawarecki <mikaylagawarecki@gmail.com>
-
- 30 Mar, 2026 2 commits
-
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
mikaylagawarecki authored
Signed-off-by:Mikayla Gawarecki <mikaylagawarecki@gmail.com>
-
- 25 Mar, 2026 1 commit
-
-
mikaylagawarecki authored
Signed-off-by:Mikayla Gawarecki <mikaylagawarecki@gmail.com>
-
- 23 Mar, 2026 1 commit
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
- 19 Mar, 2026 1 commit
-
-
mikaylagawarecki authored
Signed-off-by:Mikayla Gawarecki <mikaylagawarecki@gmail.com>
-
- 17 Mar, 2026 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 16 Mar, 2026 1 commit
-
-
Terry Gao authored
Signed-off-by:tianrengao <terrygao87@gmail.com>
-
- 09 Mar, 2026 1 commit
-
-
Roberto L. Castro authored
[Attention][Perf][Kernel] Replace torch.cat with vectorized CUDA kernel MLA query concat - DeepSeek-V3.2 (#34917) Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com>
-
- 02 Mar, 2026 1 commit
-
-
EdalatiAli authored
Signed-off-by:
EdalatiAli <aliedalati@cohere.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 01 Mar, 2026 1 commit
-
-
Asaf Gardin authored
Signed-off-by:Josephasafg <ajgard7@gmail.com>
-
- 26 Feb, 2026 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
- 24 Feb, 2026 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 18 Feb, 2026 2 commits
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 10 Feb, 2026 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com>
-
- 28 Jan, 2026 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 25 Jan, 2026 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:LopezCastroRoberto <rocastro@redhat.com>
-
- 23 Jan, 2026 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 22 Jan, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 18 Jan, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 10 Jan, 2026 1 commit
-
-
PatrykSaffer authored
Signed-off-by:
Patryk Saffer <patryk.saffer99@gmail.com> Signed-off-by:
PatrykSaffer <patryk.saffer@mistral.ai> Co-authored-by:
Patryk Saffer <patryk.saffer99@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 09 Jan, 2026 3 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
- 24 Dec, 2025 1 commit
-
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
- 19 Dec, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 12 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 09 Dec, 2025 1 commit
-
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
- 08 Dec, 2025 1 commit
-
-
Daniel Cámpora authored
Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 07 Dec, 2025 2 commits
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
[Perf] Deepgemm fused layout kernel for activations, 4.3% throughput improvement, 10.7% TTFT improvement. (#29546) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-