- 03 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 27 Jan, 2026 1 commit
-
-
rasmith authored
Signed-off-by:
Randall Smith <ransmith@amd.com> Signed-off-by:
Randall Smith <Randall.Smith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
- 26 Jan, 2026 1 commit
-
-
dolpm authored
Signed-off-by:dolpm <34420038+dolpm@users.noreply.github.com>
-
- 25 Jan, 2026 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:LopezCastroRoberto <rocastro@redhat.com>
-
- 23 Jan, 2026 3 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
rasmith authored
[CI][AMD][BugFix] Update wvSplitK (and other skinny_gemm wrappers) to ensure tensors passed will be made contiguous for the kernel (#32831) Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> Signed-off-by:
Li, Jiang <bigpyj64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 22 Jan, 2026 3 commits
-
-
Eldar Kurtić authored
Signed-off-by:
Eldar Kurtic <8884008+eldarkurtic@users.noreply.github.com> Signed-off-by:
eldarkurtic <8884008+eldarkurtic@users.noreply.github.com>
-
Fadi Arafeh authored
[CPU Backend] [Perf] Accelerate tensor-parallel/data-parallel inference across NUMA domains on Arm (#32792) Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 21 Jan, 2026 4 commits
-
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
zhuwenwen authored
-
zhuwenwen authored
-
- 20 Jan, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 18 Jan, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 16 Jan, 2026 3 commits
-
-
Hashem Hashemi authored
Signed-off-by:Hashem Hashemi <hashem.hashemi@amd.com>
-
zhuwenwen authored
区分pcie和hglink custom allreduce的使用 vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 set VLLM_USE_FUSED_RMS_ROPE=1 add SUPPORT_MOE_MARLIN_W16A16 to use moe marlin on bw support fa kvcache fp8 (todo: add VLLM_USE_QUERY_QUANT to not use q quant) update moe_align_block_size
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 13 Jan, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 10 Jan, 2026 2 commits
-
-
Kevin McKay authored
Signed-off-by:c0de128 <kevin.mckay@outlook.com>
-
PatrykSaffer authored
Signed-off-by:
Patryk Saffer <patryk.saffer99@gmail.com> Signed-off-by:
PatrykSaffer <patryk.saffer@mistral.ai> Co-authored-by:
Patryk Saffer <patryk.saffer99@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 09 Jan, 2026 4 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
R3hankhan authored
Signed-off-by:Rehan Khan <Rehan.Khan7@ibm.com>
-
- 08 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 07 Jan, 2026 2 commits
-
-
Xin Yang authored
Signed-off-by:
Xin Yang <xyangx@amazon.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 06 Jan, 2026 3 commits
-
-
Jinzhen Lin authored
Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
zhuwenwen authored
-
zhuwenwen authored
-
- 29 Dec, 2025 1 commit
-
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 24 Dec, 2025 3 commits
-
-
skaraban3807 authored
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
- 23 Dec, 2025 1 commit
-
-
danielafrimi authored
Signed-off-by: <> Co-authored-by:
root <root@gpu-193.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
root <root@gpu-951.slurm-workers-slurm.slurm.svc.cluster.local>
-
- 22 Dec, 2025 2 commits
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 21 Dec, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-