- 02 May, 2025 3 commits
-
-
Hui Liu authored
Signed-off-by:Hui Liu <96135754+hliuca@users.noreply.github.com>
-
Andrew Sansom authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
临景 <linjing.yx@alibaba-inc.com> Co-authored-by:
Bryce1010 <bryceyx@gmail.com> Co-authored-by:
Nan2018 <nan@protopia.ai> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 01 May, 2025 2 commits
-
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 30 Apr, 2025 2 commits
-
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Qiming Zhang <qiming1.zhang@intel.com>
-
Huy Do authored
-
- 28 Apr, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz>
-
- 27 Apr, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
rasmith authored
[Kernel][Triton][FP8] Adding fp8 and variable length sequence support to Triton FAv2 kernel (#12591) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 26 Apr, 2025 2 commits
-
-
Agata Dobrzyniewicz authored
Signed-off-by:Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
-
Shu Wang authored
Signed-off-by:shuw <shuw@nvidia.com>
-
- 25 Apr, 2025 1 commit
-
-
rasmith authored
[Quantization][FP8] Add support for FP8 models with input_scale for output projection and QK quantization (#15734) Signed-off-by:
Randall Smith <Randall.Smith@amd.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
- 23 Apr, 2025 2 commits
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
root <root@banff-cyxtera-s73-5.ctr.dcgpu> Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Signed-off-by:
root <root@banff-cyxtera-s65-4.amd.com> Signed-off-by:
maleksan85 <maleksan@amd.com> Signed-off-by: <> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
root <root@banff-cyxtera-s73-5.ctr.dcgpu> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
root <root@banff-cyxtera-s65-4.amd.com>
-
- 22 Apr, 2025 3 commits
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com>
-
Zhengyuan Su (苏政渊) authored
Signed-off-by:
苏政渊 <suzhengyuan@moonshot.cn> Co-authored-by:
苏政渊 <suzhengyuan@moonshot.cn>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
- 21 Apr, 2025 1 commit
-
-
Yan Ma authored
Signed-off-by:yan ma <yan.ma@intel.com>
-
- 18 Apr, 2025 1 commit
-
-
Luka Govedič authored
Signed-off-by:Luka Govedič <lgovedic@redhat.com>
-
- 17 Apr, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Yihua Cheng authored
Signed-off-by:
ApostaC <yihua98@uchicago.edu> Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Signed-off-by:
remi <remi@mistral.ai> Co-authored-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Rémi Delacourt <54138269+Flechman@users.noreply.github.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com>
-
- 15 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 14 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 11 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 09 Apr, 2025 3 commits
-
-
yihong authored
Signed-off-by:yihong0618 <zouzou0208@gmail.com>
-
yihong authored
Signed-off-by:yihong0618 <zouzou0208@gmail.com>
-
TJian authored
[Bug] [ROCm] Fix Llama 4 Enablement Bug on ROCm: V0 ROCmFlashAttentionImpl and Triton Fused MoE bugs (#16198) Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Signed-off-by:
kliuae <kuanfu.liu@embeddedllm.com> Co-authored-by:
Hongxia Yang <hongxia.yang@amd.com> Co-authored-by:
kliuae <kuanfu.liu@embeddedllm.com>
-
- 08 Apr, 2025 1 commit
-
-
Yong Hoon Shin authored
-
- 06 Apr, 2025 1 commit
-
-
Lucia Fang authored
Signed-off-by:Lu Fang <fanglu@fb.com>
-
- 04 Apr, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 03 Apr, 2025 2 commits
-
-
Liangfu Chen authored
Signed-off-by:Liangfu Chen <liangfc@amazon.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Signed-off-by:
root <root@banff-cyxtera-s65-4.amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
root <root@banff-cyxtera-s65-4.amd.com>
-
- 27 Mar, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 26 Mar, 2025 2 commits
-
-
cyyever authored
Signed-off-by:cyy <cyyever@outlook.com>
-
Lucas Wilkinson authored
[BugFix] Fix nightly MLA failure (FA2 + MLA chunked prefill, i.e. V1, producing bad results) (#15492) Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
- 25 Mar, 2025 1 commit
-
-
Thien Tran authored
Signed-off-by:Thien Tran <gau.nernst@yahoo.com.sg>
-
- 23 Mar, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com>
-
- 22 Mar, 2025 1 commit
-
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
- 21 Mar, 2025 1 commit
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-