- 28 Apr, 2025 24 commits
-
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix cascade attention - RuntimeError: scheduler_metadata must have shape (metadata_size) (#17283) Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Aaron Pham authored
Signed-off-by:
Aaron Pham <contact@aarnphm.xyz> Co-authored-by:
Russell Bryant <rbryant@redhat.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Russell Bryant authored
Signed-off-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
idouba authored
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Wanrui Dai authored
Signed-off-by:
evian <eviantai@u.nus.edu> Co-authored-by:
evian <eviantai@u.nus.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Kuntai Du authored
Signed-off-by:KuntaiDu <kuntai@uchicago.edu>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
TherLF authored
Signed-off-by:Ther-LF <2639852836@qq.com>
-
Lennart K. M. Schulz authored
Signed-off-by:lkm-schulz <44176356+lkm-schulz@users.noreply.github.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz>
-
- 27 Apr, 2025 12 commits
-
-
Lily Liu authored
Signed-off-by:LiuXiaoxuanPKU <lilyliupku@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
cascade authored
Signed-off-by:
cascade812 <cascade812@outlook.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Flex Wang authored
[Misc] Change buckets of histogram_iteration_tokens to [1, 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, 8096] to represent number of tokens (#17033) Signed-off-by:sfc-gh-zhwang <flex.wang@snowflake.com>
-
Jade Zheng authored
Signed-off-by:Jade Zheng <zheng.shoujian@outlook.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
rasmith authored
[Kernel][Triton][FP8] Adding fp8 and variable length sequence support to Triton FAv2 kernel (#12591) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 26 Apr, 2025 4 commits
-
-
Happy authored
Signed-off-by:ShuaibinLi <lishuaibin@live.cn>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Kero Liang authored
Signed-off-by:imkero <kerorek@outlook.com>
-