- 28 Apr, 2025 19 commits
-
-
Alex Wu authored
Signed-off-by:Alex <alexwu@character.ai>
-
Simon Mo authored
Signed-off-by:simon-mo <xmo@berkeley.edu>
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix cascade attention - RuntimeError: scheduler_metadata must have shape (metadata_size) (#17283) Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Russell Bryant authored
Signed-off-by:Russell Bryant <rbryant@redhat.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
idouba authored
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Wanrui Dai authored
Signed-off-by:
evian <eviantai@u.nus.edu> Co-authored-by:
evian <eviantai@u.nus.edu>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz>
-
- 27 Apr, 2025 11 commits
-
-
Lily Liu authored
Signed-off-by:LiuXiaoxuanPKU <lilyliupku@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
cascade authored
Signed-off-by:
cascade812 <cascade812@outlook.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Flex Wang authored
[Misc] Change buckets of histogram_iteration_tokens to [1, 8, 16, 32, 64, 128, 256, 512, 1024, 2048, 4096, 8096] to represent number of tokens (#17033) Signed-off-by:sfc-gh-zhwang <flex.wang@snowflake.com>
-
Jade Zheng authored
Signed-off-by:Jade Zheng <zheng.shoujian@outlook.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
rasmith authored
[Kernel][Triton][FP8] Adding fp8 and variable length sequence support to Triton FAv2 kernel (#12591) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 26 Apr, 2025 10 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Kero Liang authored
Signed-off-by:imkero <kerorek@outlook.com>
-
Ning Xie authored
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
changjun.lee authored
[Bugfix] fix error due to an uninitialized tokenizer when using `skip_tokenizer_init` with `num_scheduler_steps` (#9276) Signed-off-by:changjun.lee <pord7457@gmail.com>
-
Aaron Pham authored
Signed-off-by:Aaron Pham <contact@aarnphm.xyz>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
Russell Bryant authored
-
Agata Dobrzyniewicz authored
Signed-off-by:Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-