- 23 Mar, 2026 1 commit
-
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
- 20 Mar, 2026 3 commits
-
-
Kaihang Jiang authored
Signed-off-by:Kaihang Jiang <kaihangj@nvidia.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
- 19 Mar, 2026 1 commit
-
-
Elvir Crnčević authored
Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
- 18 Mar, 2026 2 commits
-
-
Andy Lo authored
[Bugfix] Fix KV scales inconsistency in fp8 MLA & FlashInfer kv_cache_dtype "auto" leading to gibberish (#37054) Signed-off-by:Andy Lo <andy@mistral.ai>
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
- 17 Mar, 2026 3 commits
-
-
Andrey Talman authored
Signed-off-by:atalman <atalman@fb.com>
-
Benjamin Chislett authored
-
Vadim Gimpelson authored
Signed-off-by:
Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com> Co-authored-by:
Pavani Majety <pavanimajety@gmail.com>
-
- 16 Mar, 2026 6 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
haosdent authored
Signed-off-by:
haosdent <haosdent@gmail.com> Co-authored-by:
Or Ozeri <oro@il.ibm.com>
-
Itay Etelis authored
Signed-off-by:
Itay Etelis <itay.etelis@ibm.com> Co-authored-by:
Itay Etelis <itay.etelis@ibm.com>
-
haosdent authored
Signed-off-by:haosdent <haosdent@gmail.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
- 13 Mar, 2026 2 commits
-
-
Dimitrios Bariamis authored
Signed-off-by:
Dimitrios Bariamis <12195802+dbari@users.noreply.github.com> Co-authored-by:
Dimitrios Bariamis <12195802+dbari@users.noreply.github.com>
-
Rohan Potdar authored
Signed-off-by:Rohan138 <rohanpotdar138@gmail.com>
-
- 12 Mar, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
grimulkan authored
Signed-off-by:grimulkan <grimulkan@gmail.com>
-
- 11 Mar, 2026 5 commits
-
-
Wuxun Zhang authored
Signed-off-by:Zhang, Wuxun <wuxun.zhang@intel.com>
-
JartX authored
[Bugfix] Add Multiple of 16 block_size to triton fallback on rocm Attention to support qwen3_5 (#35923) Signed-off-by:
JartX <sagformas@epdcenter.es> Co-authored-by:
akaratza <akaratza@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
pschlan-amd authored
Signed-off-by:Patrick Schlangen <pschlan@amd.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
- 10 Mar, 2026 4 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
- 09 Mar, 2026 4 commits
-
-
Andreas Karatzas authored
[ROCm][CI] Fix ROCm attention backend validation for head sizes, block sizes, and compute capability checks (#36292) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Roberto L. Castro authored
[Attention][Perf][Kernel] Replace torch.cat with vectorized CUDA kernel MLA query concat - DeepSeek-V3.2 (#34917) Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
cong-or authored
Signed-off-by:cong-or <conchubhar.gannon@gmail.com>
-
- 07 Mar, 2026 2 commits
-
-
Wei Zhao authored
-
Mengtao (Martin) Yuan authored
Signed-off-by:
Martin Yuan <myuan@meta.com> Co-authored-by:
Martin Yuan <myuan@meta.com>
-
- 06 Mar, 2026 4 commits
-
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
Andreas Karatzas authored
[ROCm][CI] Fix tool use test stability - disable skinny GEMM, prefix caching, eliminate batch variance (#35553) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Rohan Potdar authored
Signed-off-by:Rohan138 <rohanpotdar138@gmail.com>
-
Dor Huri authored
Signed-off-by:dorhuri123 <dor.huri1@live.biu.ac.il>
-
- 05 Mar, 2026 1 commit
-
-
Frank Wang authored
Signed-off-by:frankwang28 <frank.wbb@hotmail.com>
-