- 04 Sep, 2025 1 commit
-
-
elvischenv authored
[Bugfix][Misc] Fix silu_and_mul_nvfp4_quant issue and extract common utils for nvfp4 kernel source files (#23727) Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 29 Aug, 2025 1 commit
-
-
yzds authored
Signed-off-by:
hongchao <hongchao@msh.team> Signed-off-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
hongchao <hongchao@msh.team> Co-authored-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
Richard Zou <zou3519@users.noreply.github.com>
-
- 28 Aug, 2025 2 commits
-
-
elvischenv authored
Signed-off-by:
jindih <jindih@nvidia.com> Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
jindih <jindih@nvidia.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedic <lgovedic@redhat.com>
-
yzds authored
Co-authored-by:hongchao <hongchao@msh.team>
-
- 24 Aug, 2025 1 commit
-
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
- 22 Aug, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
- 20 Aug, 2025 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
shixianc authored
Signed-off-by:Shixian Cui <shixian@amazon.com>
-
- 17 Aug, 2025 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 16 Aug, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 15 Aug, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Simon Mo authored
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 14 Aug, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:
rongfu.leng <rongfu.leng@daocloud.io> Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Signed-off-by:
Huzaifa Sidhpurwala <huzaifas@redhat.com> Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Animesh Jain <anijain@umich.edu> Signed-off-by:
Rui Qiao <ruisearch42@gmail.com> Signed-off-by:
Xiongfei Wei <isaacwxf23@gmail.com> Signed-off-by:
Nick Hill <nhill@redhat.com> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
kf <kuanfu.liu@embeddedllm.com> Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by:
Sage Moore <sage@neuralmagic.com> Signed-off-by:
tjtanaavllm <tunjian.tan@amd.com> Signed-off-by:
Yong Hoon Shin <yhshin@meta.com> Signed-off-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Signed-off-by:
Roger Wang <hey@rogerw.me> Signed-off-by:
Vadim Gimpelson <vadim.gimpelson@centml.ai> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
Chih-Chieh Yang <7364402+cyang49@users.noreply.github.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
yan <yan.ma@intel.com> Signed-off-by:
Yan Ma <yan.ma@intel.com> Signed-off-by:
Xiao Liu <xiszishu@gmail.com> Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Ye (Charlotte) Qi <yeq@meta.com> Signed-off-by:
LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by:
Andy Xie <andy.xning@gmail.com> Signed-off-by:
Haibin Lin <haibin.lin@bytedance.com> Signed-off-by:
David Ben-David <davidb@pliops.com> Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
jiang1.li <jiang1.li@intel.com> Signed-off-by:
Seiji Eicher <seiji@anyscale.com> Signed-off-by:
zitian.zhao <zitian.zhao@tencentmusic.com> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Signed-off-by:
Abirdcfly <fp544037857@gmail.com> Signed-off-by:
Giancarlo Delfin <gdelfin@meta.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
huangweixiao <huangweixiao@msh.team> Signed-off-by:
alyosha-swamy <raghav@arcee.ai> Signed-off-by:
Eric Hanley <ericehanley@google.com> Signed-off-by:
Abatom <abzhonghua@gmail.com> Signed-off-by:
CLFutureX <775523362@qq.com> Signed-off-by:
Linkun Chen <github@lkchen.net> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Signed-off-by:
tlipoca9 <tlipoca9@gmail.com> Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Signed-off-by:
zitian zhao <zitian.zhao@tencentmusic.com> Signed-off-by:
mgoin <michael@neuralmagic.com> Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Benji Beck <benjibeck@meta.com> Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Signed-off-by:
Benjamin Chislett <benjamin.chislett@centml.ai> Signed-off-by:
isotr0py <2037008807@qq.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
LucasWilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Zhang Jason <ning.zhang2@amd.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Signed-off-by:
asafg <asafg@ai21.com> Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Lain <fusiyuan2000@hotmail.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Tao He <linzhu.ht@alibaba-inc.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Signed-off-by:
QscQ <qscqesze@gmail.com> Signed-off-by:
qingjun <qingjun@minimaxi.com> Signed-off-by:
Syed Muhammad Bin Asif <syedmba7@connect.hku.hk> Signed-off-by:
Lionel Villard <villard@us.ibm.com> Signed-off-by:
ycyaw66 <497410282@qq.com> Signed-off-by:
David Chen <530634352@qq.com> Signed-off-by:
Linkun <github@lkchen.net> Signed-off-by:
Moritz Sanft <58110325+msanft@users.noreply.github.com> Signed-off-by:
Ming Yang <minos.future@gmail.com> Signed-off-by:
Adrian Garcia <adrian.garcia@inceptionai.ai> Signed-off-by:
shaojunqi <shaojunqi.sjq@alibaba-inc.com> Signed-off-by:
Ricardo Decal <rdecal@anyscale.com> Signed-off-by:
Andrew Chan <andrewkchan.akc@gmail.com> Signed-off-by:
Felix Marty <Felix.Marty@amd.com> Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Zhiyu Cheng <zhiyuc@nvidia.com> Signed-off-by:
Shu Wang <shuw@nvidia.com> Signed-off-by:
Po-Han Huang <pohanh@nvidia.com> Signed-off-by:
Shu Wang. <shuw@nvidia.com> Signed-off-by:
XIn Li <xinli@nvidia.com> Signed-off-by:
Junhao Li <junhao@ubicloud.com> Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Signed-off-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Signed-off-by: <zyy1102000@gmail.com> Signed-off-by:
Guy Stone <guys@spotify.com> Signed-off-by: <yyweiss@gmail.com> Signed-off-by:
yyw <yyweiss@gmail.com> Signed-off-by:
Russell Bryant <rbryant@redhat.com> Signed-off-by:
Pradyun Ramadorai <pradyunr@amazon.com> Signed-off-by:
Pradyun92 <142861237+Pradyun92@users.noreply.github.com> Signed-off-by:
Jinzhen Lin <jinzhen.ljz@antgroup.com> Co-authored-by:
rongfu.leng <rongfu.leng@daocloud.io> Co-authored-by:
Huzaifa Sidhpurwala <huzaifas@redhat.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Animesh Jain <jainanimesh2305@yahoo.com> Co-authored-by:
Rui Qiao <161574667+ruisearch42@users.noreply.github.com> Co-authored-by:
XiongfeiWei <isaacwxf23@gmail.com> Co-authored-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
JartX <sagformas@gmail.com> Co-authored-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Co-authored-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
kf <kuanfu.liu@embeddedllm.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com> Co-authored-by:
Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
tjtanaavllm <tunjian.tan@amd.com> Co-authored-by:
Yong Hoon Shin <48474650+sarckk@users.noreply.github.com> Co-authored-by:
Chih-Chieh Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com> Co-authored-by:
Yuxuan Zhang <2448370773@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Xiao <xiszishu@gmail.com> Co-authored-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
Ning Xie <andy.xning@gmail.com> Co-authored-by:
H <linhaibin.eric@gmail.com> Co-authored-by:
David Ben-David <sdavidbd@gmail.com> Co-authored-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Li, Jiang <jiang1.li@intel.com> Co-authored-by:
TankNee <nee@tanknee.cn> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Seiji Eicher <58963096+eicherseiji@users.noreply.github.com> Co-authored-by:
ZiTian.Zhao <zitian.zhao@tencentmusic.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
Abirdcfly <fp544037857@gmail.com> Co-authored-by:
Giancarlo Delfin <32987265+TheEpicDolphin@users.noreply.github.com> Co-authored-by:
Chenxi Yang <cxyang@cs.utexas.edu> Co-authored-by:
Chenxi Yang <cxyang@meta.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Weixiao Huang <hwx.simle@gmail.com> Co-authored-by:
Raghav Ravishankar <113712354+alyosha-swamy@users.noreply.github.com> Co-authored-by:
ericehanley <ericehanley@google.com> Co-authored-by:
Zhonghua Deng <abzhonghua@gmail.com> Co-authored-by:
Po-Han Huang (NVIDIA) <53919306+nvpohanh@users.noreply.github.com> Co-authored-by:
PiteXChen <44110731+CLFutureX@users.noreply.github.com> Co-authored-by:
lkchen <github@lkchen.net> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com> Co-authored-by:
Gregory Shtrasberg <156009573+gshtras@users.noreply.github.com> Co-authored-by:
tlipoca9 <160737620+tlipoca9@users.noreply.github.com> Co-authored-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Benji Beck <benjibeck@meta.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Zhang Jason <ning.zhang2@amd.com> Co-authored-by:
Asaf Joseph Gardin <39553475+Josephasafg@users.noreply.github.com> Co-authored-by:
asafg <asafg@ai21.com> Co-authored-by:
Lain <siyuanf@nvidia.com> Co-authored-by:
tc-mb <157115220+tc-mb@users.noreply.github.com> Co-authored-by:
imning3 <hbning@pku.edu.cn> Co-authored-by:
Maximilien de Bayser <mbayser@br.ibm.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Tao He <linzhu.ht@alibaba-inc.com> Co-authored-by:
qscqesze <qingjun@minimaxi.com> Co-authored-by:
Syed Muhammad Bin Asif <92625830+syedmba@users.noreply.github.com> Co-authored-by:
Lionel Villard <villard@us.ibm.com> Co-authored-by:
WeiQing Chen <40507679+david6666666@users.noreply.github.com> Co-authored-by:
ycyaw66 <497410282@qq.com> Co-authored-by:
Moritz Sanft <58110325+msanft@users.noreply.github.com> Co-authored-by:
Ming Yang <minos.future@gmail.com> Co-authored-by:
Adrián García García <adrigarvk8@gmail.com> Co-authored-by:
Michael Goin <mgoin@redhat.com> Co-authored-by:
JaceyShao <65159281+JaceyShao@users.noreply.github.com> Co-authored-by:
shaojunqi <shaojunqi.sjq@alibaba-inc.com> Co-authored-by:
Ricardo Decal <crypdick@users.noreply.github.com> Co-authored-by:
Andrew Chan <andrewkchan.akc@gmail.com> Co-authored-by:
fxmarty-amd <felmarty@amd.com> Co-authored-by:
Andrew Sansom <andrew@protopia.ai> Co-authored-by:
Zhiyu <zhiyuc@nvidia.com> Co-authored-by:
Shu Wang <shuw@nvidia.com> Co-authored-by:
XIn Li <xinli@nvidia.com> Co-authored-by:
Junhao Li <streaver91@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
Hong Hanh <hanh.usth@gmail.com> Co-authored-by:
Daniel Serebrenik <74646983+pliops-daniels@users.noreply.github.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Guy Stone <guys@spotify.com> Co-authored-by:
yyweiss <70619747+yyweiss@users.noreply.github.com> Co-authored-by:
Pradyun92 <142861237+Pradyun92@users.noreply.github.com> Co-authored-by:
Pradyun Ramadorai <pradyunr@amazon.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
- 26 Jul, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Jul, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 22 Jul, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 18 Jul, 2025 1 commit
-
-
Richard Zou authored
Signed-off-by:rzou <zou3519@gmail.com>
-
- 16 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 15 Jul, 2025 1 commit
-
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
- 09 Jul, 2025 1 commit
-
-
Tuan, Hoang-Trong authored
Signed-off-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tuan M. Hoang-Trong <tmhoangt@us.ibm.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 06 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 04 Jul, 2025 1 commit
-
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Co-authored-by:
Duncan Moss <dmoss@nvidia.com>
-
- 27 Jun, 2025 1 commit
-
-
li haoyang authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 07 Jun, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Jun, 2025 1 commit
-
-
Chiyue Wei authored
Signed-off-by:
Chiyue Wei <chiyuew@nvidia.com> Co-authored-by:
Chiyue Wei <chiyuew@nvidia.com>
-
- 04 Jun, 2025 1 commit
-
-
Vadim Gimpelson authored
-
- 27 May, 2025 1 commit
-
-
almersawi authored
Signed-off-by:
Islam Almersawi <islam.almersawi@openinnovation.ai> Co-authored-by:
Islam Almersawi <islam.almersawi@openinnovation.ai>
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 11 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 09 May, 2025 1 commit
-
-
Pavani Majety authored
-
- 07 May, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Szymon Ożóg authored
Signed-off-by:
SzymonOzog <szymon.ozog@aleph-alpha.com> Signed-off-by:
SzymonOzog <szymon.ozog@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 05 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 01 May, 2025 1 commit
-
-
Sage Moore authored
[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867) Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 29 Apr, 2025 1 commit
-
-
TY-AMD authored
Signed-off-by:Tianyuan Wu <Tianyuan.Wu@amd.com>
-
- 27 Apr, 2025 1 commit
-
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
- 11 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 02 Apr, 2025 1 commit
-
-
LukasBluebaum authored
Signed-off-by:lukas.bluebaum <lukas.bluebaum@aleph-alpha.com>
-
- 01 Apr, 2025 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-