- 19 May, 2025 10 commits
-
-
Satyajith Chilappagari authored
Signed-off-by:Satyajith Chilappagari <satchill@amazon.com>
-
sunyicode0012 authored
Add files via uploadAdd fused MoE kernel tuning configs (fp8_w8a8) for DeepSeek V3/R1 on a single-node 8x NVIDIA H20 96GB setup (#18337)
-
Wenhua Cheng authored
Signed-off-by:wenhuach21 <wenhua.cheng@intel.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
Shaoyu Yang authored
Signed-off-by:
cascade812 <cascade812@outlook.com> Signed-off-by:
shaoyuyoung <shaoyuyoung@gmail.com> Co-authored-by:
cascade <cascade812@outlook.com>
-
CYJiang authored
-
Nan Qin authored
Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Nan2018 <nan@protopia.ai> Co-authored-by:
临景 <linjing.yx@alibaba-inc.com> Co-authored-by:
Bryce1010 <bryceyx@gmail.com> Co-authored-by:
Andrew Sansom <andrew@protopia.ai> Co-authored-by:
Andrew Sansom <qthequartermasterman@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
wwl2755 authored
Signed-off-by:wwl2755 <wangwenlong2755@gmail.com>
-
- 18 May, 2025 2 commits
-
-
Lifu Huang authored
Signed-off-by:Lifu Huang <lifu.hlf@gmail.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 17 May, 2025 8 commits
-
-
cascade authored
Signed-off-by:cascade812 <cascade812@outlook.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
rongfu.leng authored
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Signed-off-by:
Jade Zheng <zheng.shoujian@outlook.com> Co-authored-by:
Carol Zheng <cazheng@google.com> Co-authored-by:
Jade Zheng <zheng.shoujian@outlook.com> Co-authored-by:
Hongmin Fan <fanhongmin@google.com>
-
David Ben-David authored
Signed-off-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
David Ben-David <davidb@pliops.com>
-
汪志鹏 authored
Signed-off-by:
汪志鹏 <wangzhipeng628@gmail.com> Co-authored-by:
tracelogfb <48808670+tracelogfb@users.noreply.github.com> Co-authored-by:
Stephen Chen <tracelog@meta.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 16 May, 2025 15 commits
-
-
Woosuk Kwon authored
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
learner0810 authored
Signed-off-by:learner0810 <zhongjun.li@daocloud.io>
-
fxmarty-amd authored
Signed-off-by:Felix Marty <felmarty@amd.com>
-
Lain authored
-
Seiji Eicher authored
Signed-off-by:Seiji Eicher <seiji@anyscale.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@centml.ai>
-
Lucia Fang authored
Signed-off-by:Lucia Fang <fanglu@fb.com>
-
Will Eaton authored
Signed-off-by:Will Eaton <weaton@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Sky Lee authored
Signed-off-by:
lisiqi23 <lisiqi23@xiaomi.com> Signed-off-by:
skylee-01 <497627264@qq.com> Co-authored-by:
lisiqi23 <lisiqi23@xiaomi.com>
-
kliuae authored
Signed-off-by:kf <kuanfu.liu@embeddedllm.com>
-
- 15 May, 2025 5 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
TJian authored
[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in unquantizedMethod to reenable LLama4 BF16 (#18205) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Zhonghua Deng authored
Signed-off-by:Abatom <abzhonghua@gmail.com>
-
Sebastian Schoennenbeck authored
Signed-off-by:Sebastian Schönnenbeck <sebastian.schoennenbeck@comma-soft.com>
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-