- 03 Jun, 2025 1 commit
-
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 29 May, 2025 2 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 28 May, 2025 1 commit
-
-
Hongxia Yang authored
[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py when running llama4 models and unit test fix (#18100) Signed-off-by:
Hongxia Yang <hongxia.yang@amd.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 21 May, 2025 1 commit
-
-
Hosang authored
Signed-off-by:Hosang Yoon <hosang.yoon@amd.com>
-
- 20 May, 2025 1 commit
-
-
Percy authored
Signed-off-by:haochengxia <xhc_1007@163.com>
-
- 15 May, 2025 2 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 14 May, 2025 1 commit
-
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
- 11 May, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 09 May, 2025 3 commits
-
-
Michael Goin authored
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com>
-
- 08 May, 2025 1 commit
-
-
Agata Dobrzyniewicz authored
Signed-off-by:Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
-
- 06 May, 2025 2 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Mengqing Cao authored
Signed-off-by:Mengqing Cao <cmq0113@163.com>
-
- 02 May, 2025 1 commit
-
-
Hui Liu authored
Signed-off-by:Hui Liu <96135754+hliuca@users.noreply.github.com>
-
- 01 May, 2025 1 commit
-
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
- 30 Apr, 2025 1 commit
-
-
Huy Do authored
-
- 27 Apr, 2025 1 commit
-
-
rasmith authored
[Kernel][Triton][FP8] Adding fp8 and variable length sequence support to Triton FAv2 kernel (#12591) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 26 Apr, 2025 1 commit
-
-
Agata Dobrzyniewicz authored
Signed-off-by:Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
-
- 23 Apr, 2025 1 commit
-
-
Aleksandr Malyshev authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
root <root@banff-cyxtera-s73-5.ctr.dcgpu> Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Signed-off-by:
root <root@banff-cyxtera-s65-4.amd.com> Signed-off-by:
maleksan85 <maleksan@amd.com> Signed-off-by: <> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
root <root@banff-cyxtera-s73-5.ctr.dcgpu> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
root <root@banff-cyxtera-s65-4.amd.com>
-
- 22 Apr, 2025 2 commits
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 15 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 14 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 11 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 09 Apr, 2025 1 commit
-
-
yihong authored
Signed-off-by:yihong0618 <zouzou0208@gmail.com>
-
- 04 Apr, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 03 Apr, 2025 2 commits
-
-
Liangfu Chen authored
Signed-off-by:Liangfu Chen <liangfc@amazon.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Signed-off-by:
root <root@banff-cyxtera-s65-4.amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
root <root@banff-cyxtera-s65-4.amd.com>
-
- 26 Mar, 2025 1 commit
-
-
Lucas Wilkinson authored
[BugFix] Fix nightly MLA failure (FA2 + MLA chunked prefill, i.e. V1, producing bad results) (#15492) Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
- 14 Mar, 2025 1 commit
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Jan van Lunteren <jvl@zurich.ibm.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Chih-Chieh Yang <chih.chieh.yang@ibm.com>
-
- 12 Mar, 2025 1 commit
-
-
Li, Jiang authored
Signed-off-by:
jiang1.li <jiang1.li@intel.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 11 Mar, 2025 1 commit
-
-
Liangfu Chen authored
-
- 06 Mar, 2025 2 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Burkhard Ringlein <ngl@zurich.ibm.com> Co-authored-by:
Jan van Lunteren <jvl@zurich.ibm.com>
-
Thomas Parnell authored
-
- 27 Feb, 2025 2 commits
-
-
qli88 authored
Signed-off-by:qli88 <qiang.li2@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
-
- 22 Feb, 2025 1 commit
-
-
Sage Moore authored
[V1][Kernel] Refactor the prefix_prefill kernel so that the caller no longer has to pass in the context lengths (#13095)
-